Microservices

Jan 06, 2025
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ

Nov 19, 2024
Building a Generative AI OpenUSD App for Brand-Accurate Marketing Visuals
Today, brands and their creative agencies are under huge strain to create and deliver high-quality, accurate product images at scale, from campaign key visuals...
7 MIN READ

Oct 28, 2024
Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA
The rapid development of solutions using retrieval augmented generation (RAG) for question-and-answer LLM workflows has led to new types of system...
11 MIN READ

Sep 10, 2024
Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator
Domain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate...
16 MIN READ

Jul 31, 2024
Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator
In a recent post, we discussed how to use NVIDIA NeMo Curator to curate custom datasets for pretraining or continuous training use cases of large language...
11 MIN READ

Jun 24, 2024
Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim
As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...
13 MIN READ

Jun 18, 2024
Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0
Intelligent Transportation Systems (ITS) applications are becoming increasingly valuable and prevalent in modern urban environments. The benefits of using ITS...
11 MIN READ

Jun 04, 2024
Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA
NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...
12 MIN READ

Apr 03, 2024
New Lab: Generative AI Inference with NVIDIA NIM
Get started with NVIDIA NIM for deploying large language models (LLMs). Request access to a free, hands-on lab today.
1 MIN READ

Mar 27, 2024
Scale and Curate High-Quality Datasets for LLM Training with NVIDIA NeMo Curator
Enterprises are using large language models (LLMs) as powerful tools to improve operational efficiency and drive innovation. NVIDIA NeMo microservices aim to...
6 MIN READ

Mar 27, 2024
Fine-Tune and Align LLMs Easily with NVIDIA NeMo Customizer
As large language models (LLMs) continue to gain traction in enterprise AI applications, the demand for custom models that can understand and integrate specific...
5 MIN READ

Mar 27, 2024
Streamline Evaluation of LLMs for Accuracy with NVIDIA NeMo Evaluator
Large language models (LLMs) have demonstrated remarkable capabilities, from tackling complex coding tasks to crafting compelling stories to translating natural...
5 MIN READ

Mar 18, 2024
Simplify Custom Generative AI Development with NVIDIA NeMo Microservices
Across the globe, enterprises are realizing the benefits of generative AI models. They are racing to adopt these models in various applications, such as...
5 MIN READ

Mar 18, 2024
Translate Your Enterprise Data into Actionable Insights with NVIDIA NeMo Retriever
Across every industry, and every job function, generative AI is activating the potential within organizations—turning data into knowledge and empowering...
9 MIN READ

Mar 18, 2024
NVIDIA NIM Offers Optimized Inference Microservices for Deploying AI Models at Scale
The rise in generative AI adoption has been remarkable. Catalyzed by the launch of OpenAI’s ChatGPT in 2022, the new technology amassed over 100M users within...
6 MIN READ