NIM

Apr 29, 2025

Spotlight: Personal AI Brings AI Receptionists to Small Business Owners with NVIDIA Riva

It's 10 p.m. on a Tuesday when the phone rings at the Sapochnick Law Firm, a specialized law practice in San Diego, California. The caller, a client of the...

6 MIN READ

Apr 29, 2025

NVIDIA NIM Operator 2.0 Boosts AI Deployment with NVIDIA NeMo Microservices Support

The first release of NVIDIA NIM Operator simplified the deployment and lifecycle management of inference pipelines for NVIDIA NIM microservices, reducing the...

5 MIN READ

Apr 24, 2025

Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM

This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...

7 MIN READ

Apr 23, 2025

Enhance Your AI Agent with Data Flywheels Using NVIDIA NeMo Microservices

Enterprise data is constantly changing. This presents significant challenges for maintaining AI system accuracy over time. As organizations increasingly rely on...

12 MIN READ

Apr 16, 2025

Developing an AI-Powered Tool for Automatic Citation Validation Using NVIDIA NIM

The accuracy of citations is crucial for maintaining the integrity of both academic and AI-generated content. When citations are inaccurate or wrong, they can...

9 MIN READ

Apr 15, 2025

NVIDIA Llama Nemotron Ultra Open Model Delivers Groundbreaking Reasoning Accuracy

AI is no longer just about generating text or images—it’s about deep reasoning, detailed problem-solving, and powerful adaptability for real-world...

7 MIN READ

Apr 10, 2025

Curating Biological Findings from Scientific Literature with NVIDIA NIM

Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...

7 MIN READ

Apr 09, 2025

Delivering NVIDIA Accelerated Computing for Enterprise AI Workloads with Rafay

The worldwide adoption of generative AI has driven massive demand for accelerated compute hardware globally. In enterprises, this has accelerated the deployment...

8 MIN READ

Apr 09, 2025

Prevent LLM Hallucinations with the Cleanlab Trustworthy Language Model in NVIDIA NeMo Guardrails

As more enterprises integrate LLMs into their applications, they face a critical challenge: LLMs can generate plausible but incorrect responses, known as...

9 MIN READ

Apr 02, 2025

LLM Benchmarking: Fundamental Concepts

The past few years have witnessed the rise in popularity of generative AI and large language models (LLMs), as part of a broad AI revolution. As LLM-based...

14 MIN READ

Mar 26, 2025

Deploying the NVIDIA AI Blueprint for Cost-Efficient LLM Routing

Since the release of ChatGPT in November 2022, the capabilities of large language models (LLMs) have surged, and the number of available models has grown...

7 MIN READ

Mar 25, 2025

Accelerating the Future of Transportation with SES AI's NVIDIA-Powered Innovation for Electric Vehicles

Electric vehicles (EVs) are transforming transportation, but challenges such as cost, longevity, and range remain barriers to widespread adoption. At the heart...

6 MIN READ

Mar 25, 2025

Kickstart Your AI Journey on RTX AI PCs and Workstations with NVIDIA NIM Microservices

With emerging use cases such as digital humans, agents, podcasts, images, and video generation, generative AI is changing the way we interact with PCs. This...

7 MIN READ

Mar 19, 2025

Guiding Generative Molecular Design with Experimental Feedback Using Oracles

Generative chemistry with AI has the potential to revolutionize how scientists approach drug discovery and development, health, and materials science and...

9 MIN READ

Mar 18, 2025

Seamlessly Scale AI Across Cloud Environments with NVIDIA DGX Cloud Serverless Inference

NVIDIA DGX Cloud Serverless Inference is an auto-scaling AI inference solution that enables application deployment with speed and reliability. Powered by NVIDIA...

9 MIN READ

Mar 18, 2025

Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models

NVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for...

14 MIN READ