Vinh Nguyen

Vinh Nguyen is a deep learning engineer and data scientist, having published more than 50 scientific articles that collectively attracted more than 5,000 citations. At NVIDIA, his work spans a wide range of deep learning and AI applications, including large language models and multi-modality models.
Avatar photo

Posts by Vinh Nguyen

A larger and smaller cartoon llama on a sunny beach, wearing shirts that say 8B and 4B.
Generative AI

LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework

Model pruning and knowledge distillation are powerful cost-effective strategies for obtaining smaller language models from an initial larger sibling. ... 10 MIN READ
Generative AI

Mistral-NeMo-Minitron 8B Model Delivers Unparalleled Accuracy

This post was originally published August 21, 2024 but has been revised with current data. Recently, NVIDIA and Mistral AI unveiled Mistral NeMo 12B, a leading... 7 MIN READ
Decorative image of two cartoon llamas in sunglasses.
Generative AI

How to Prune and Distill Llama-3.1 8B to an NVIDIA Llama-3.1-Minitron 4B Model

Large language models (LLM) are now a dominant force in natural language processing and understanding, thanks to their effectiveness and versatility. LLMs such... 12 MIN READ
Generative AI

Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM

The latest state-of-the-art foundation large language models (LLMs) have billions of parameters and are pretrained on trillions of tokens of input text. They... 11 MIN READ
Generative AI

Applying Mixture of Experts in LLM Architectures

Mixture of experts (MoE) large language model (LLM) architectures have recently emerged, both in proprietary LLMs such as GPT-4, as well as in community models... 12 MIN READ
Generative AI

Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model

Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation... 12 MIN READ