Data Center / Cloud – NVIDIA Technical BlogNews and tutorials for developers, data scientists, and IT admins2025-04-28T23:18:38Zhttp://www.open-lab.net/blog/feed/Meenakshi Kaushik<![CDATA[NVIDIA NIM Operator 2.0 Boosts AI Deployment with NVIDIA NeMo Microservices Support]]>http://www.open-lab.net/blog/?p=993092025-04-28T23:18:38Z2025-04-29T16:00:00ZThe first release of NVIDIA NIM Operator simplified the deployment and lifecycle management of inference pipelines for NVIDIA NIM microservices, reducing the...
]]>Emily Sakata<![CDATA[Announcing NVIDIA Secure AI General Availability]]>http://www.open-lab.net/blog/?p=990642025-04-23T22:23:18Z2025-04-23T22:23:11ZAs many enterprises move to running AI training or inference on their data, the data and the code need to be protected, especially for large language models...
]]>Brian Sparks<![CDATA[NVIDIA Helps Build AI Factories Faster Than Ever with NVIDIA DGX SuperPOD]]>http://www.open-lab.net/blog/?p=985792025-04-17T19:35:28Z2025-04-11T18:35:30ZIn a cavernous room at an undisclosed location in Japan, a digital revolution is unfolding. Racks of servers stand like giants, their sleek frames linked by...
]]>Graham Lopez<![CDATA[Just Released: NVIDIA HPC SDK v25.3]]>http://www.open-lab.net/blog/?p=986462025-04-17T19:35:30Z2025-04-10T20:20:32ZThe HPC SDK v25.3 release includes support for NVIDIA Blackwell GPUs and an optimized allocator for Arm CPUs.
]]>Matheen Raza<![CDATA[Delivering NVIDIA Accelerated Computing for Enterprise AI Workloads with Rafay]]>http://www.open-lab.net/blog/?p=985332025-04-22T23:52:20Z2025-04-09T20:09:43ZThe worldwide adoption of generative AI has driven massive demand for accelerated compute hardware globally. In enterprises, this has accelerated the deployment...
]]>Christian Munley<![CDATA[Stanford Das Lab Accelerates RNA Folding Research with NVIDIA DGX Cloud]]>http://www.open-lab.net/blog/?p=968402025-04-17T19:35:35Z2025-04-09T16:00:00ZThe Das Lab at Stanford is revolutionizing RNA folding research with a unique approach that leverages community involvement and accelerated computing. With the...
]]>Ashraf Eassa<![CDATA[NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0]]>http://www.open-lab.net/blog/?p=983672025-04-23T19:41:12Z2025-04-02T18:14:48ZThe compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency...
]]>Ronen Dar<![CDATA[NVIDIA Open Sources Run:ai Scheduler to Foster Community Collaboration]]>http://www.open-lab.net/blog/?p=980942025-04-22T23:59:16Z2025-04-01T09:00:00ZToday, NVIDIA announced the open-source release of the KAI Scheduler, a Kubernetes-native GPU scheduling solution, now available under the Apache 2.0 license....
]]>Ameya Parab<![CDATA[Practical Tips for Preventing GPU Fragmentation for Volcano Scheduler]]>http://www.open-lab.net/blog/?p=981712025-04-03T18:44:56Z2025-03-31T20:00:54ZAt NVIDIA, we take pride in tackling complex infrastructure challenges with precision and innovation. When Volcano faced GPU underutilization in their NVIDIA...
]]>Brad Smith<![CDATA[A New Era in Data Center Networking with NVIDIA Silicon Photonics-based Network Switching]]>http://www.open-lab.net/blog/?p=979172025-04-03T18:45:19Z2025-03-27T16:00:00ZNVIDIA is breaking new ground by integrating silicon photonics directly with its NVIDIA Quantum and NVIDIA Spectrum switch ICs. At GTC 2025, we announced the...
]]>Pradyumna Desale<![CDATA[Automating AI Factories with NVIDIA Mission Control]]>http://www.open-lab.net/blog/?p=980122025-04-03T18:47:00Z2025-03-25T18:45:11ZAdvanced AI models such as DeepSeek-R1 are proving that enterprises can now build cutting-edge AI models specialized with their own data and expertise. These...
]]>Brad Nemire<![CDATA[Upcoming Event: NVIDIA at KubeCon and CloudNativeCon Europe]]>http://www.open-lab.net/blog/?p=979702025-03-24T17:15:13Z2025-03-24T17:15:10ZAttending KubeCon? Meet NVIDIA at booth S750, join our startup mixer, or stop by our 15+ sessions.
]]>Andrew Fear<![CDATA[NVIDIA Demonstrates GeForce NOW for Game AI Inference and Streamlined Hands-on Opportunities]]>http://www.open-lab.net/blog/?p=978252025-04-17T18:17:43Z2025-03-20T17:34:38ZNVIDIA cloud gaming service GeForce NOW is providing developers and publishers with new tools to bring their games to more gamers��and offer new experiences...
]]>Uttara Kumar<![CDATA[Boost Llama Model Performance on Microsoft Azure AI Foundry with NVIDIA TensorRT-LLM]]>http://www.open-lab.net/blog/?p=970082025-04-23T00:07:01Z2025-03-20T15:00:00ZMicrosoft, in collaboration with NVIDIA, announced transformative performance improvements for the Meta Llama family of models on its Azure AI Foundry platform....
]]>Phoebe Lee<![CDATA[NVIDIA Virtual GPU 18.0 Enables VDI for AI on Every Virtualized Platform]]>http://www.open-lab.net/blog/?p=976182025-04-23T00:07:56Z2025-03-19T20:00:00ZNVIDIA Virtual GPU (vGPU) technology unlocks AI capabilities within Virtual Desktop Infrastructure (VDI), making it more powerful and versatile than ever...
]]>Dave Salvator<![CDATA[NVIDIA Blackwell Ultra for the Era of AI Reasoning]]>http://www.open-lab.net/blog/?p=967612025-03-20T22:34:30Z2025-03-19T18:00:15ZFor years, advancements in AI have followed a clear trajectory through pretraining scaling: larger models, more data, and greater computational resources lead...
]]>Jonathan Ferrer Mestres<![CDATA[NVIDIA Earth-2 Powers Regional AI Weather Forecasting in the United Arab Emirates]]>http://www.open-lab.net/blog/?p=970742025-04-23T00:27:21Z2025-03-19T16:01:00ZIn the United Arab Emirates (UAE), extreme weather events disrupt daily life, delaying flights, endangering transportation, and complicating urban planning....
]]>TJ Chen<![CDATA[Shrink Genomics and Single-Cell Analysis Time to Minutes with NVIDIA Parabricks and NVIDIA AI Blueprints]]>http://www.open-lab.net/blog/?p=969792025-03-20T18:33:12Z2025-03-19T15:00:00ZNVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...
]]>Vishal Ganeriwala<![CDATA[Seamlessly Scale AI Across Cloud Environments with NVIDIA DGX Cloud Serverless Inference]]>http://www.open-lab.net/blog/?p=971922025-03-20T17:07:54Z2025-03-18T21:22:51ZNVIDIA DGX Cloud Serverless Inference is an auto-scaling AI inference solution that enables application deployment with speed and reliability. Powered by NVIDIA...
]]>Emily Potyraj<![CDATA[Measure and Improve AI Workload Performance with NVIDIA DGX Cloud Benchmarking]]>http://www.open-lab.net/blog/?p=975482025-03-20T17:07:42Z2025-03-18T21:21:17ZAs AI capabilities advance, understanding the impact of hardware and software infrastructure choices on workload performance is crucial for both technical...
]]>Amr Elmeleegy<![CDATA[Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models]]>http://www.open-lab.net/blog/?p=952742025-04-23T00:15:55Z2025-03-18T17:50:00ZNVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for...
]]>1Ashraf Eassa<![CDATA[NVIDIA Blackwell Delivers World-Record DeepSeek-R1 Inference Performance]]>http://www.open-lab.net/blog/?p=973522025-04-23T00:23:25Z2025-03-18T17:41:42ZNVIDIA announced world-record DeepSeek-R1 inference performance at NVIDIA GTC 2025. A single NVIDIA DGX system with eight NVIDIA Blackwell GPUs can achieve over...
]]>1Ben Williams<![CDATA[Networking Reliability and Observability at Scale with NCCL 2.24]]>http://www.open-lab.net/blog/?p=967312025-04-23T00:32:27Z2025-03-13T16:30:00ZThe NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multinode (MGMN) communication primitives optimized for NVIDIA GPUs and networking....
]]>Anu Srivastava<![CDATA[Lightweight, Multimodal, Multilingual Gemma 3 Models Are Streamlined for Performance]]>http://www.open-lab.net/blog/?p=967702025-04-23T00:33:31Z2025-03-12T08:45:00ZBuilding AI systems with foundation models requires a delicate balancing of resources such as memory, latency, storage, compute, and more. One size does not fit...
]]>Gregory Kimball<![CDATA[Efficient ETL with Polars and Apache Spark on NVIDIA Grace CPU]]>http://www.open-lab.net/blog/?p=968072025-04-23T00:33:58Z2025-03-11T18:30:00ZThe NVIDIA Grace CPU Superchip delivers outstanding performance and best-in-class energy efficiency for CPU workloads in the data center and in the cloud. The...
]]>Nikhil Gupta<![CDATA[Optimizing Compile Times for CUDA C++]]>http://www.open-lab.net/blog/?p=967752025-04-23T00:36:07Z2025-03-10T18:02:27ZIn modern software development, time is an incredibly valuable resource, especially during the compilation process. For developers working with CUDA C++ on...
]]>Shelby Thomas<![CDATA[Ensuring Reliable Model Training on NVIDIA DGX Cloud]]>http://www.open-lab.net/blog/?p=967892025-03-24T18:36:43Z2025-03-10T16:26:44ZTraining AI models on massive GPU clusters presents significant challenges for model builders. Because manual intervention becomes impractical as job scale...
]]>Michelle Horton<![CDATA[Featured Data Center and Cloud Sessions at NVIDIA GTC 2025]]>http://www.open-lab.net/blog/?p=969142025-03-07T23:58:04Z2025-03-07T23:21:09ZExplore the latest innovations in data center and cloud with sessions showcasing the full capabilities of the NVIDIA accelerated computing platform.
]]>Gareth Sylvester-Bradley<![CDATA[Supercharging Live Media Workflows with NVIDIA NIM and NVIDIA Holoscan for Media]]>http://www.open-lab.net/blog/?p=966502025-03-06T19:26:34Z2025-03-05T21:11:31ZNVIDIA Holoscan for Media is an NVIDIA-accelerated platform designed for multi-vendor live production and AI. It will be showcased at GTC, highlighting NVIDIA...
]]>Sangjune Park<![CDATA[Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM]]>http://www.open-lab.net/blog/?p=962792025-04-23T02:32:43Z2025-02-28T17:57:49ZNAVER is a popular South Korean search engine company that offers Naver Place, a geo-based service that provides detailed information about millions of...
]]>Tom Augspurger<![CDATA[High-Performance Remote IO With NVIDIA KvikIO]]>http://www.open-lab.net/blog/?p=965822025-03-06T19:26:42Z2025-02-27T17:55:52ZWorkloads processing large amounts of data, especially those running on the cloud, will often use an object storage service (S3, Google Cloud Storage, Azure...
]]>1Charu Chaubal<![CDATA[NVIDIA AI Enterprise Adds Support for NVIDIA H200 NVL]]>http://www.open-lab.net/blog/?p=964242025-04-23T02:34:39Z2025-02-24T22:37:47ZNVIDIA AI Enterprise is the cloud-native software platform for the development and deployment of production-grade AI solutions. The latest release of the NVIDIA...
]]>John Linford<![CDATA[Spotlight: University of Tokyo Uses NVIDIA Grace Hopper for Groundbreaking Energy-Efficient Seismic Research]]>http://www.open-lab.net/blog/?p=961782025-04-23T02:44:05Z2025-02-20T16:00:00ZSupercomputers are the engines of groundbreaking discoveries. From predicting extreme weather to advancing disease research and designing safer, more efficient...
]]>Tim Lustig<![CDATA[Featured Networking Sessions at NVIDIA GTC 2025]]>http://www.open-lab.net/blog/?p=961892025-02-20T15:52:20Z2025-02-17T05:34:30ZExplore the latest advancements in AI infrastructure, acceleration, and security from March 17-21.
]]>Leigh Engel<![CDATA[Simplify System Memory Management with the Latest NVIDIA GH200 NVL2 Enterprise RA]]>http://www.open-lab.net/blog/?p=960792025-04-23T02:45:13Z2025-02-13T21:26:30ZNVIDIA Enterprise Reference Architectures (Enterprise RAs) can reduce the time and cost of deploying AI infrastructure solutions. They provide a streamlined...
]]>2Gomathy Venkata Krishnan<![CDATA[LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework]]>http://www.open-lab.net/blog/?p=934512025-04-23T02:53:00Z2025-02-12T17:54:52ZModel pruning and knowledge distillation are powerful cost-effective strategies for obtaining smaller language models from an initial larger sibling. ...
]]>Brad Nemire<![CDATA[Featured Energy Sessions at NVIDIA GTC 2025]]>http://www.open-lab.net/blog/?p=959952025-02-20T15:54:13Z2025-02-11T18:19:43ZLearn from energy leaders using HPC and AI to boost exploration, production, and fuel delivery, while enhancing power grid reliability and resiliency.
]]>Emily Potyraj<![CDATA[NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance]]>http://www.open-lab.net/blog/?p=955582025-04-23T02:52:54Z2025-02-11T17:00:00ZIn the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a...
]]>Ivan Goldwasser<![CDATA[NVIDIA Grace CPU Integrates with the Arm Software Ecosystem]]>http://www.open-lab.net/blog/?p=956382025-04-23T02:52:39Z2025-02-10T18:45:22ZThe NVIDIA Grace CPU is transforming data center design by offering a new level of power-efficient performance. Built specifically for data center scale, the...
]]>Pranav Marathe<![CDATA[Just Released: Tripy, a Python Programming Model For TensorRT]]>http://www.open-lab.net/blog/?p=959472025-02-10T17:08:43Z2025-02-10T17:08:40ZExperience high-performance inference, usability, intuitive APIs, easy debugging with eager mode, clear error messages, and more.
]]>Pradeep Ramani<![CDATA[OpenAI Triton on NVIDIA Blackwell Boosts AI Performance and Programmability]]>http://www.open-lab.net/blog/?p=953882025-04-23T02:48:06Z2025-02-05T18:00:00ZMatrix multiplication and attention mechanisms are the computational backbone of modern AI workloads. While libraries like NVIDIA cuDNN provide highly optimized...
]]>Shruthii Sathyanarayanan<![CDATA[Streamline Collaboration Across Local and Cloud Systems with NVIDIA AI Workbench]]>http://www.open-lab.net/blog/?p=957202025-04-23T02:48:08Z2025-02-05T18:00:00ZNVIDIA AI Workbench is a free development environment manager to develop, customize, and prototype AI applications on your GPUs. AI Workbench provides a...
]]>Taylor Allison<![CDATA[Accelerating AI Storage by up to 48% with NVIDIA Spectrum-X Networking Platform and Partners]]>http://www.open-lab.net/blog/?p=954322025-04-23T02:48:15Z2025-02-04T15:00:00ZAI factories rely on more than just compute fabrics. While the East-West network connecting the GPUs is critical to AI application performance, the storage...
]]>Matthew Nicely<![CDATA[Just Released: CUTLASS 3.8]]>http://www.open-lab.net/blog/?p=957162025-02-06T19:33:50Z2025-02-03T23:54:16ZProvides support for the NVIDIA Blackwell SM100 architecture. CUTLASS is a collection of CUDA C++ templates and abstractions for implementing high-performance...
]]>Sylvain Jeaugey<![CDATA[New Scaling Algorithm and Initialization with NVIDIA Collective Communications Library 2.23]]>http://www.open-lab.net/blog/?p=954122025-04-23T02:48:19Z2025-01-31T22:47:37ZThe NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multinode communication primitives optimized for NVIDIA GPUs and networking. NCCL...
]]>Matthew Nicely<![CDATA[Just Released: NVIDIA cuDNN 9.7]]>http://www.open-lab.net/blog/?p=956702025-02-06T19:33:52Z2025-01-31T21:23:42ZBringing support for NVIDIA Blackwell architecture across data center and GeForce products, NVIDIA cuDNN 9.7 delivers speedups of up to 84% for FP8 Flash...
]]>Zachary Bourque<![CDATA[Dynamic Loading in the CUDA Runtime]]>http://www.open-lab.net/blog/?p=939582025-04-23T14:57:41Z2025-01-31T20:03:32ZHistorically, the GPU device code is compiled alongside the application with offline tools such as nvcc. In this case, the GPU device code is managed internally...
]]>Prem Sagar Gali<![CDATA[Mastering the cudf.pandas Profiler for GPU Acceleration]]>http://www.open-lab.net/blog/?p=953512025-04-23T15:00:07Z2025-01-30T17:00:00ZIn the world of Python data science, pandas has long reigned as the go-to library for intuitive data manipulation and analysis. However, as data volumes grow,...
]]>Nick Comly<![CDATA[Optimize AI Inference Performance with NVIDIA Full-Stack Solutions]]>http://www.open-lab.net/blog/?p=953102025-04-23T15:02:06Z2025-01-24T16:00:00ZThe explosion of AI-driven applications has placed unprecedented demands on both developers, who must balance delivering cutting-edge performance with managing...
]]>Martin Cimmino<![CDATA[Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with iGenius and NVIDIA DGX Cloud]]>http://www.open-lab.net/blog/?p=950122025-01-23T19:54:22Z2025-01-16T12:00:00ZIn recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and...
]]>Harry Petty<![CDATA[Transforming Data Centers into AI Factories for the 5th Industrial Revolution]]>http://www.open-lab.net/blog/?p=948792025-01-23T19:54:25Z2025-01-14T19:58:01ZIn a recent DC Anti-Conference Live presentation, Wade Vinson, chief data center distinguished engineer at NVIDIA, shared insights based upon work by NVIDIA...
]]>Dror Goldenberg<![CDATA[Powering the Next Wave of DPU-Accelerated Cloud Infrastructures with NVIDIA DOCA Platform Framework]]>http://www.open-lab.net/blog/?p=948892025-01-23T19:54:26Z2025-01-13T17:30:25ZOrganizations are increasingly turning to accelerated computing to meet the demands of generative AI, 5G telecommunications, and sovereign clouds. NVIDIA has...
]]>Brad Nemire<![CDATA[NVIDIA Project DIGITS, A Grace Blackwell AI Supercomputer On Your Desk]]>http://www.open-lab.net/blog/?p=947652025-01-23T19:54:30Z2025-01-09T18:19:00ZPowered by the new GB10 Grace Blackwell Superchip, Project DIGITS can tackle large generative AI models of up to 200B parameters.
]]>5Charu Chaubal<![CDATA[New Whitepaper: NVIDIA AI Enterprise Security]]>http://www.open-lab.net/blog/?p=944752024-12-20T20:56:54Z2024-12-20T00:41:33ZThis white paper details our commitment to securing the NVIDIA AI Enterprise software stack. It outlines the processes and measures NVIDIA takes to ensure...
]]>Dmitriy Tishechkin<![CDATA[Spotlight: Stone Ridge Technology Accelerates Reservoir Simulation Workflows with NVIDIA PhysicsNeMo on AWS]]>http://www.open-lab.net/blog/?p=943232025-03-18T18:07:35Z2024-12-19T18:00:00ZRisk and uncertainty inherent in energy exploration include unknown geological parameters, variations in fluid and rock properties, boundary conditions, and...
]]>Emeka Obiodu<![CDATA[Five Takeaways from NVIDIA 6G Developer Day 2024]]>http://www.open-lab.net/blog/?p=938402024-12-16T21:28:03Z2024-12-18T20:30:00ZNVIDIA 6G Developer Day 2024 brought together members of the 6G research and development community to share insights and learn new ways of engaging with NVIDIA...
]]>Michelle Horton<![CDATA[Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization]]>http://www.open-lab.net/blog/?p=935662024-12-16T18:34:16Z2024-12-16T18:34:14Z2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...
]]>0Sophia Schuur<![CDATA[An Introduction to NVIDIA Air]]>http://www.open-lab.net/blog/?p=927492024-12-23T02:15:20Z2024-12-12T22:18:40ZThe advent of AI has introduced a new type of data center, the AI factory, purpose-built from the ground up to handle AI workloads. AI workloads can...
]]>Alberto Carpentieri<![CDATA[Advancing Solar Irradiance Prediction with NVIDIA Earth-2]]>http://www.open-lab.net/blog/?p=935962025-01-07T20:18:50Z2024-12-12T17:57:29ZAs global electricity demand continues to rise, traditional sources of energy are increasingly unsustainable. Energy providers are facing pressure to reduce...
]]>Tim Lustig<![CDATA[Integration of NVIDIA BlueField DPUs with WEKA Client Boosts AI Workload Efficiency]]>http://www.open-lab.net/blog/?p=935782024-12-12T19:35:12Z2024-12-12T17:45:46ZWEKA, a pioneer in scalable software-defined data platforms, and NVIDIA are collaborating to unite WEKA's state-of-the-art data platform solutions with powerful...
]]>Leigh Engel<![CDATA[Deploying NVIDIA H200 NVL at Scale with New Enterprise Reference Architecture]]>http://www.open-lab.net/blog/?p=936862024-12-12T19:35:14Z2024-12-12T00:40:45ZLast month at the Supercomputing 2024 conference, NVIDIA announced the availability of NVIDIA H200 NVL, the latest NVIDIA Hopper platform. Optimized for...
]]>Amr Elmeleegy<![CDATA[Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack]]>http://www.open-lab.net/blog/?p=933962025-03-18T18:26:38Z2024-12-05T17:58:43ZThe demand for AI-enabled services continues to grow rapidly, placing increasing pressure on IT and infrastructure teams. These teams are tasked with...
]]>Carl (Izzy) Putterman<![CDATA[TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x]]>http://www.open-lab.net/blog/?p=928472025-01-11T17:32:51Z2024-12-02T23:09:43ZNVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
]]>3Amr Elmeleegy<![CDATA[NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200]]>http://www.open-lab.net/blog/?p=925912024-12-12T19:47:20Z2024-11-22T00:53:18ZGenerative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
]]>1Bethann Noble<![CDATA[Deploying Fine-Tuned AI Models with NVIDIA NIM]]>http://www.open-lab.net/blog/?p=916962024-12-17T00:07:21Z2024-11-21T22:04:57ZFor organizations adapting AI foundation models with domain-specific data, the ability to rapidly create and deploy fine-tuned models is key to efficiently...
]]>Ian Pegler<![CDATA[Advancing Ansys Workloads with NVIDIA Grace and NVIDIA Grace Hopper]]>http://www.open-lab.net/blog/?p=924962024-12-12T19:38:41Z2024-11-21T17:30:00ZAccelerated computing is enabling giant leaps in performance and energy efficiency compared to traditional CPU computing. Delivering these advancements requires...
]]>Phoebe Lee<![CDATA[Powering AI-Augmented Workloads with NVIDIA and Windows 365]]>http://www.open-lab.net/blog/?p=917092024-12-12T19:38:43Z2024-11-21T17:26:44ZWe are entering a new era of AI-powered digital workflow, where Windows 365 Cloud PCs are dynamic platforms that host AI technologies and reshape traditional...
]]>Ashraf Eassa<![CDATA[Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs]]>http://www.open-lab.net/blog/?p=901422024-11-22T23:11:53Z2024-11-19T16:00:00ZMeta recently released its Llama 3.2 series of vision language models (VLMs), which come in 11B parameter and 90B parameter variants. These models are...
]]>Szymon Karpi��ski<![CDATA[Fusing Epilog Operations with Matrix Multiplication Using nvmath-python]]>http://www.open-lab.net/blog/?p=920982025-04-01T18:19:57Z2024-11-18T18:30:00Znvmath-python (Beta) is an open-source Python library, providing Python programmers with access to high-performance mathematical operations from NVIDIA CUDA-X...
]]>1Bethann Noble<![CDATA[NVIDIA NIM 1.4 Ready to Deploy with 2.4x Faster Inference]]>http://www.open-lab.net/blog/?p=921722024-11-20T04:40:21Z2024-11-16T00:41:54ZThe demand for ready-to-deploy high-performance inference is growing as generative AI reshapes industries. NVIDIA NIM provides production-ready microservice...
]]>Amr Elmeleegy<![CDATA[Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill]]>http://www.open-lab.net/blog/?p=920522024-11-15T17:59:38Z2024-11-15T17:59:35ZIn this blog post, we take a closer look at chunked prefill, a feature of NVIDIA TensorRT-LLM that increases GPU utilization and simplifies the deployment...
]]>Rob Nertney<![CDATA[Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing]]>http://www.open-lab.net/blog/?p=912162025-02-04T19:53:37Z2024-11-14T22:01:38ZConfidential and self-sovereign AI is a new approach to AI development, training, and inference where the user��s data is decentralized, private, and...
]]>25David Wills<![CDATA[NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features]]>http://www.open-lab.net/blog/?p=918292025-01-15T17:31:35Z2024-11-14T15:00:00ZNVIDIA DOCA enhances the capabilities of NVIDIA networking platforms by providing a comprehensive software framework for developers to leverage hardware...
]]>Sukru Burc Eryilmaz<![CDATA[NVIDIA Blackwell Doubles LLM Training Performance in MLPerf Training v4.1]]>http://www.open-lab.net/blog/?p=918072024-11-14T17:10:37Z2024-11-13T16:00:00ZAs models grow larger and are trained on more data, they become more capable, making them more useful. To train these models quickly, more performance,...
]]>Phoebe Lee<![CDATA[Spotlight: Accelerating into AI with VDI]]>http://www.open-lab.net/blog/?p=917042024-11-14T19:06:48Z2024-11-12T19:56:22ZThe key to starting in AI may be right under your nose. It��s all about seeing the potential in the tools and resources that you already have. Adopt a crawl,...
]]>Chelsea Gomatam<![CDATA[Discover New Biological Insights with Accelerated Pangenome Alignment in NVIDIA Parabricks]]>http://www.open-lab.net/blog/?p=912202024-11-14T17:10:48Z2024-11-04T17:39:18ZNVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...
]]>1Tyler Whitehouse<![CDATA[Frictionless Collaboration and Rapid Prototyping in Hybrid Environments with NVIDIA AI Workbench]]>http://www.open-lab.net/blog/?p=912342024-11-14T17:10:49Z2024-11-04T17:30:00ZNVIDIA AI Workbench is a free development environment manager that streamlines data science, AI, and machine learning (ML) projects on systems of choice. The...
]]>Amr Elmeleegy<![CDATA[NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models]]>http://www.open-lab.net/blog/?p=908972024-11-06T02:24:56Z2024-10-28T15:00:00ZDeploying large language models (LLMs) in production environments often requires making hard trade-offs between enhancing user interactivity and increasing...
]]>1Michael Yh Wang<![CDATA[Bridging the CUDA C++ Ecosystem and Python Developers with Numbast]]>http://www.open-lab.net/blog/?p=900862024-10-31T16:26:15Z2024-10-24T16:30:00ZBy enabling CUDA kernels to be written in Python similar to how they can be implemented within C++, Numba bridges the gap between the Python ecosystem and the...
]]>Jihyun Yang<![CDATA[Spotlight: Accelerating HPC in Energy with AWS Energy HPC Orchestrator and NVIDIA Energy Samples]]>http://www.open-lab.net/blog/?p=903672024-10-31T16:21:17Z2024-10-24T16:00:00ZThe energy industry��s digital transformation requires a substantial increase in computational demands for key HPC workloads and applications. This trend is...
]]>1Max Bazalii<![CDATA[Building AI Agents to Automate Software Test Case Creation]]>http://www.open-lab.net/blog/?p=903872025-02-17T05:12:03Z2024-10-24T16:00:00ZIn software development, testing is crucial for ensuring the quality and reliability of the final product. However, creating test plans and specifications can...
]]>1Michelle Horton<![CDATA[Maximizing Energy and Power Efficiency in Applications with NVIDIA GPUs]]>http://www.open-lab.net/blog/?p=901002024-10-30T18:55:08Z2024-10-16T16:50:10ZAs the demand for high-performance computing (HPC) and AI applications grows, so does the importance of energy efficiency. NVIDIA Principal Developer Technology...
]]>Charlie Huang<![CDATA[Scale High-Performance AI Inference with Google Kubernetes Engine and NVIDIA NIM]]>http://www.open-lab.net/blog/?p=901982024-10-30T18:57:03Z2024-10-16T16:30:00ZThe rapid evolution of AI models has driven the need for more efficient and scalable inferencing solutions. As organizations strive to harness the power of AI,...
]]>Anurag Gudahttps://www.linkedin.com/in/anuragguda/<![CDATA[Simplify AI Application Development with NVIDIA Cloud Native Stack]]>http://www.open-lab.net/blog/?p=899702024-10-29T21:00:38Z2024-10-16T16:00:00ZIn the rapidly evolving landscape of AI and data science, the demand for scalable, efficient, and flexible infrastructure has never been higher. Traditional...
]]>Matan Raz<![CDATA[Future-Proof Your Networking Stack with NVIDIA DOCA-OFED]]>http://www.open-lab.net/blog/?p=902992024-10-17T18:18:56Z2024-10-15T20:28:17ZThe NVIDIA DOCA software platform unlocks the potential of the NVIDIA BlueField networking platform and provides all needed host drivers for NVIDIA BlueField...
]]>Rob Davis<![CDATA[Supermicro Launches NVIDIA BlueField-Powered JBOF to Optimize AI Storage]]>http://www.open-lab.net/blog/?p=902422024-10-17T18:18:58Z2024-10-15T16:35:00ZThe growth of AI is driving exponential growth in computing power and a doubling of networking speeds every few years. Less well-known is that it��s also...
]]>1Itay Ozery<![CDATA[Powering Next-Generation AI Networking with NVIDIA SuperNICs]]>http://www.open-lab.net/blog/?p=901762024-11-01T14:27:00Z2024-10-15T16:30:00ZIn the era of generative AI, accelerated networking is essential to build high-performance computing fabrics for massively distributed AI workloads. NVIDIA...
]]>Amr Elmeleegy<![CDATA[NVIDIA Contributes NVIDIA GB200 NVL72 Designs to Open Compute Project]]>http://www.open-lab.net/blog/?p=901822024-11-26T20:56:44Z2024-10-15T16:30:00ZDuring the 2024 OCP Global Summit, NVIDIA announced that it has contributed the NVIDIA GB200 NVL72 rack and compute and switch tray liquid cooled designs to the...
]]>2Nathan Patterson<![CDATA[Learning Fluid Flow with AI-Enabled Virtual Wind Tunnels]]>http://www.open-lab.net/blog/?p=878612024-11-25T17:28:18Z2024-10-14T18:39:40ZThere��s never enough time to do everything, even in engineering education. Employers want engineers capable of wielding simulation tools to expedite iterative...
]]>Michelle Horton<![CDATA[AI Research Revs Up EV Charging for Large-Scale Optimization, Speed, and Savings]]>http://www.open-lab.net/blog/?p=901192024-10-21T16:29:21Z2024-10-14T15:54:39ZElectric vehicle (EV) charging is getting a jolt with an innovative new AI algorithm that boosts efficiency, reduces cost, and keeps the grid from...
]]>Alexandra Junk<![CDATA[Transforming CFD?Simulations with ML Using NVIDIA PhysicsNeMo]]>http://www.open-lab.net/blog/?p=897582025-03-19T17:38:54Z2024-10-11T18:36:14ZSimulations play a critical role in advancing science and engineering, especially in the vast field of fluid dynamics. However, high-fidelity fluid simulations...
]]>Ivan Goldwasser<![CDATA[NVIDIA Grace CPU Delivers World-Class Data Center Performance and Breakthrough Energy Efficiency]]>http://www.open-lab.net/blog/?p=900872024-11-06T02:26:22Z2024-10-09T19:00:00ZNVIDIA designed the NVIDIA Grace CPU to be a new kind of high-performance, data center CPU��one built to deliver breakthrough energy efficiency and optimized...
]]>Nick Comly<![CDATA[Boosting Llama 3.1 405B Throughput by Another 1.5x on NVIDIA H200 Tensor Core GPUs and NVLink Switch]]>http://www.open-lab.net/blog/?p=900402024-11-22T23:12:12Z2024-10-09T15:00:00ZThe continued growth of LLMs capability, fueled by increasing parameter counts and support for longer contexts, has led to their usage in a wide variety of...
]]>1Soma Velayutham<![CDATA[Bringing AI-RAN to a Telco Near You]]>http://www.open-lab.net/blog/?p=899202024-11-12T04:34:20Z2024-10-08T14:00:00ZInferencing for generative AI and AI agents will drive the need for AI compute infrastructure to be distributed from edge to central clouds. IDC predicts that...
]]>William Raveane<![CDATA[Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries]]>http://www.open-lab.net/blog/?p=898312024-11-14T16:23:01Z2024-10-07T21:11:06ZMicrosoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
]]>Tanya Lenz<![CDATA[Webinar: Accelerating Python with GPUs]]>http://www.open-lab.net/blog/?p=896592024-10-17T19:07:02Z2024-10-02T18:00:00ZJoin us on October 9 to learn how your applications can benefit from NVIDIA CUDA Python software initiatives.
]]>Candice Mudrick<![CDATA[Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN]]>http://www.open-lab.net/blog/?p=895212024-10-17T19:07:06Z2024-10-01T17:00:00ZGaming has always pushed the boundaries of graphics hardware. The most popular games typically required robust GPU, CPU, and RAM resources on a user��s PC or...
]]>Shiva Krishna Merla<![CDATA[Managing AI Inference Pipelines on Kubernetes with NVIDIA NIM Operator]]>http://www.open-lab.net/blog/?p=895412024-10-17T19:07:07Z2024-09-30T21:50:06ZDevelopers have shown a lot of excitement for NVIDIA NIM microservices, a set of easy-to-use cloud-native microservices that shortens the time-to-market and...
]]>4Nick Comly<![CDATA[Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance]]>http://www.open-lab.net/blog/?p=889382024-11-29T21:06:06Z2024-09-26T21:44:00ZMany of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...