Posts by Anu Srivastava
Generative AI
Apr 05, 2025
NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick
The newest generation of the popular Llama AI models is here with Llama 4 Scout and Llama 4 Maverick. Accelerated by NVIDIA open-source software, they can...
4 MIN READ
Generative AI
Mar 12, 2025
Lightweight, Multimodal, Multilingual Gemma 3 Models Are Streamlined for Performance
Building AI systems with foundation models requires a delicate balancing of resources such as memory, latency, storage, compute, and more. One size does not fit...
3 MIN READ
Models / Libraries / Frameworks
Feb 26, 2025
Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs
Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...
4 MIN READ
Generative AI
Dec 17, 2024
Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding
Meta's Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only...
8 MIN READ