Deep dive

Oct 10, 2024
Advanced RAG Techniques for Telco O-RAN Specifications Using NVIDIA NIM Microservices
Mobile communication standards play a crucial role in the telecommunications ecosystem by harmonizing technology protocols to facilitate interoperability...
8 MIN READ

Oct 09, 2024
NVIDIA Grace CPU Delivers World-Class Data Center Performance and Breakthrough Energy Efficiency
NVIDIA designed the NVIDIA Grace CPU to be a new kind of high-performance, data center CPU—one built to deliver breakthrough energy efficiency and optimized...
8 MIN READ

Oct 09, 2024
Boosting Llama 3.1 405B Throughput by Another 1.5x on NVIDIA H200 Tensor Core GPUs and NVLink Switch
The continued growth of LLMs capability, fueled by increasing parameter counts and support for longer contexts, has led to their usage in a wide variety of...
8 MIN READ

Oct 08, 2024
Rapidly Triage Container Security with the Vulnerability Analysis NVIDIA NIM Agent Blueprint
Addressing software security issues is becoming more challenging as the number of vulnerabilities reported in the CVE database continues to grow at an...
2 MIN READ

Oct 08, 2024
Accelerate Large Linear Programming Problems with NVIDIA cuOpt
The evolution of linear programming (LP) solvers has been marked by significant milestones over the past century, from Simplex to the interior point method...
10 MIN READ

Oct 08, 2024
Bringing AI-RAN to a Telco Near You
Inferencing for generative AI and AI agents will drive the need for AI compute infrastructure to be distributed from edge to central clouds. IDC predicts that...
14 MIN READ

Oct 07, 2024
Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs
Reality capture creates highly accurate, detailed, and immersive digital representations of environments. Innovations in site scanning and accelerated data...
10 MIN READ

Oct 07, 2024
Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
11 MIN READ

Oct 02, 2024
Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds
With the rapid expansion of language models over the past 18 months, hundreds of variants are now available. These include large language models (LLMs), small...
15 MIN READ

Oct 01, 2024
Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN
Gaming has always pushed the boundaries of graphics hardware. The most popular games typically required robust GPU, CPU, and RAM resources on a user’s PC or...
7 MIN READ

Oct 01, 2024
Evolving AI-Powered Game Development with Retrieval-Augmented Generation
Game development is a complex and resource-intensive process, particularly when using advanced tools like Unreal Engine. Developers find themselves navigating...
6 MIN READ

Sep 30, 2024
Managing AI Inference Pipelines on Kubernetes with NVIDIA NIM Operator
Developers have shown a lot of excitement for NVIDIA NIM microservices, a set of easy-to-use cloud-native microservices that shortens the time-to-market and...
5 MIN READ

Sep 30, 2024
Advancing Quantum Algorithm Design with GPTs
AI techniques like large language models (LLMs) are rapidly transforming many scientific disciplines. Quantum computing is no exception. A collaboration between...
8 MIN READ

Sep 26, 2024
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...
8 MIN READ

Sep 26, 2024
Harnessing Data with AI to Boost Zero Trust Cyber Defense
Modern cyber threats have grown increasingly sophisticated, posing significant risks to federal agencies and critical infrastructure. According to Deloitte,...
8 MIN READ

Sep 25, 2024
Deploying Accelerated Llama 3.2 from the Edge to the Cloud
Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...
6 MIN READ