LLMs

Apr 29, 2025

Structuring Applications to Secure the KV Cache

When interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the...

11 MIN READ

Apr 28, 2025

Advancing Cybersecurity Operations with Agentic AI Systems

The age of passive AI is over. A new era is beginning, where AI doesn’t just respond—it thinks, plans, and acts. The rapid advancement of large language...

15 MIN READ

Apr 24, 2025

Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM

This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...

7 MIN READ

Apr 23, 2025

Spotlight: Qodo Innovates Efficient Code Search with NVIDIA DGX

Large language models (LLMs) have enabled AI tools that help you write more code faster, but as we ask these tools to take on more and more complex tasks, there...

8 MIN READ

Apr 23, 2025

Announcing NVIDIA Secure AI General Availability

As many enterprises move to running AI training or inference on their data, the data and the code need to be protected, especially for large language models...

3 MIN READ

Apr 23, 2025

Enhance Your AI Agent with Data Flywheels Using NVIDIA NeMo Microservices

Enterprise data is constantly changing. This presents significant challenges for maintaining AI system accuracy over time. As organizations increasingly rely on...

12 MIN READ

Apr 16, 2025

Announcing ComputeEval, an Open-Source Framework for Evaluating LLMs on CUDA

Large language models (LLMs) are revolutionizing how developers code and how they learn to code. For seasoned or junior developers alike, today’s...

4 MIN READ

Apr 16, 2025

Developing an AI-Powered Tool for Automatic Citation Validation Using NVIDIA NIM

The accuracy of citations is crucial for maintaining the integrity of both academic and AI-generated content. When citations are inaccurate or wrong, they can...

9 MIN READ

Apr 10, 2025

Curating Biological Findings from Scientific Literature with NVIDIA NIM

Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...

7 MIN READ

Apr 09, 2025

Prevent LLM Hallucinations with the Cleanlab Trustworthy Language Model in NVIDIA NeMo Guardrails

As more enterprises integrate LLMs into their applications, they face a critical challenge: LLMs can generate plausible but incorrect responses, known as...

9 MIN READ

Apr 08, 2025

Build Enterprise AI Agents with Advanced Open NVIDIA Llama Nemotron Reasoning Models

This updated post was originally published on March 18, 2025. Organizations are embracing AI agents to enhance productivity and streamline operations. To...

12 MIN READ

Apr 02, 2025

NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0

The compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency...

10 MIN READ

Apr 02, 2025

LLM Benchmarking: Fundamental Concepts

The past few years have witnessed the rise in popularity of generative AI and large language models (LLMs), as part of a broad AI revolution. As LLM-based...

14 MIN READ

Mar 26, 2025

Deploying the NVIDIA AI Blueprint for Cost-Efficient LLM Routing

Since the release of ChatGPT in November 2022, the capabilities of large language models (LLMs) have surged, and the number of available models has grown...

7 MIN READ

Mar 20, 2025

Boost Llama Model Performance on Microsoft Azure AI Foundry with NVIDIA TensorRT-LLM

Microsoft, in collaboration with NVIDIA, announced transformative performance improvements for the Meta Llama family of models on its Azure AI Foundry platform....

4 MIN READ

Mar 19, 2025

NVIDIA Virtual GPU 18.0 Enables VDI for AI on Every Virtualized Platform

NVIDIA Virtual GPU (vGPU) technology unlocks AI capabilities within Virtual Desktop Infrastructure (VDI), making it more powerful and versatile than ever...

4 MIN READ