Author: Wenhan Tan | NVIDIA Technical Blog

Wenhan Tan

Wenhan Tan is a solutions architect at NVIDIA, assisting customers to adopt NVIDIA AI solutions at large scale. His work focuses on accelerating deep learning applications and addressing inference and training challenges.

Posts by Wenhan Tan

Conversational AI Oct 22, 2024

Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes

Large language models (LLMs) have been widely used for chatbots, content generation, summarization, classification, translation, and more. State-of-the-art LLMs... 16 MIN READ