Generative AI

Apr 16, 2025
Announcing ComputeEval, an Open-Source Framework for Evaluating LLMs on CUDA
Large language models (LLMs) are revolutionizing how developers code and how they learn to code. For seasoned or junior developers alike, today’s...
4 MIN READ

Apr 16, 2025
Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming
Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy....
8 MIN READ

Apr 15, 2025
NVIDIA Llama Nemotron Ultra Open Model Delivers Groundbreaking Reasoning Accuracy
AI is no longer just about generating text or images—it’s about deep reasoning, detailed problem-solving, and powerful adaptability for real-world...
7 MIN READ

Apr 15, 2025
Event: Data Filtering Challenge for Training Edge Language Models
You’re invited to join the challenge. Develop and apply innovative data filtering techniques to curate datasets that enhance edge LM performance.
1 MIN READ

Apr 10, 2025
Just Released: NVIDIA Llama Nemotron Ultra as NVIDIA NIM
Try NVIDIA Llama Nemotron Ultra as an NVIDIA NIM microservice. At only 253B total parameters, it offers reasoning performance that meets or beats top open...
1 MIN READ

Apr 10, 2025
Curating Biological Findings from Scientific Literature with NVIDIA NIM
Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...
7 MIN READ

Apr 09, 2025
Just Released: NVIDIA AI Workbench 2025.03.10
NVIDIA AI Workbench 2025.03.10 features streamlined onboarding and enhanced UX for multicontainer projects.
1 MIN READ

Apr 08, 2025
Build Enterprise AI Agents with Advanced Open NVIDIA Llama Nemotron Reasoning Models
This updated post was originally published on March 18, 2025. Organizations are embracing AI agents to enhance productivity and streamline operations. To...
12 MIN READ

Apr 07, 2025
Evaluating and Enhancing RAG Pipeline Performance Using Synthetic Data?
As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal...
8 MIN READ

Apr 07, 2025
Startups Use AI to Deliver Better Maternal and Newborn Care
Nearly 300,000 women across the globe die each year due to complications arising from pregnancy or childbirth. The number of stillborns and babies that die...
4 MIN READ

Apr 07, 2025
Event: HP & NVIDIA Developer Challenge
Join the hackathon to build open-source AI solutions, optimize models, enhance workflows, connect with peers, and win prizes.
1 MIN READ

Apr 05, 2025
NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick
The newest generation of the popular Llama AI models is here with Llama 4 Scout and Llama 4 Maverick. Accelerated by NVIDIA open-source software, they can...
4 MIN READ

Apr 02, 2025
NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0
The compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency...
9 MIN READ

Apr 02, 2025
LLM Benchmarking: Fundamental Concepts
The past few years have witnessed the rise in popularity of generative AI and large language models (LLMs), as part of a broad AI revolution. As LLM-based...
14 MIN READ

Mar 26, 2025
Deploying the NVIDIA AI Blueprint for Cost-Efficient LLM Routing
Since the release of ChatGPT in November 2022, the capabilities of large language models (LLMs) have surged, and the number of available models has grown...
7 MIN READ

Mar 26, 2025
Spotlight: Tomorrow.io?Transforms Global Weather Resilience with NVIDIA AI
From hyperlocal forecasts that guide daily operations to planet-scale models illuminating new climate insights, the world is entering a new frontier in weather...
8 MIN READ