Posts by Uttara Kumar
Data Center / Cloud
Mar 20, 2025
Boost Llama Model Performance on Microsoft Azure AI Foundry with NVIDIA TensorRT-LLM
Microsoft, in collaboration with NVIDIA, announced transformative performance improvements for the Meta Llama family of models on its Azure AI Foundry platform....
4 MIN READ
Data Center / Cloud
Aug 21, 2024
Google Cloud Run Adds Support for NVIDIA L4 GPUs, NVIDIA NIM, and Serverless AI Inference Deployments at Scale
Deploying AI-enabled applications and services presents enterprises with significant challenges: Performance is critical as it directly shapes user...
6 MIN READ
Data Center / Cloud
May 31, 2023
Protecting Sensitive Data and AI Models with Confidential Computing
Rapid digital transformation has led to an explosion of sensitive data being generated across the enterprise. That data has to be stored and processed in data...
10 MIN READ
Data Center / Cloud
Jul 28, 2022
Building a Speech-Enabled AI Virtual Assistant with NVIDIA Riva on Amazon EC2
Speech AI can assist human agents in contact centers, power virtual assistants and digital avatars, generate live captioning in video conferencing, and much...
12 MIN READ
Data Center / Cloud
Mar 07, 2022
Deploy AI Workloads at Scale with Bottlerocket and NVIDIA-Powered Amazon EC2 Instances
Deploying AI-powered services like voice-based assistants, e-commerce product recommendations, and contact-center automation into production at scale is...
3 MIN READ
Networking / Communications
Nov 29, 2021
AWS Launches First NVIDIA GPU-Accelerated Graviton-Based Instance with Amazon EC2 G5g
Today at AWS re:Invent 2021, AWS announced the general availability of Amazon EC2 G5g instances—bringing the first NVIDIA GPU-accelerated Arm-based instance...
3 MIN READ