Wenhan Tan

Wenhan Tan is a solutions architect at NVIDIA, assisting customers to adopt NVIDIA AI solutions at large scale. His work focuses on accelerating deep learning applications and addressing inference and training challenges.
Avatar photo

Posts by Wenhan Tan

Conversational AI

Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes

Large language models (LLMs) have been widely used for chatbots, content generation, summarization, classification, translation, and more. State-of-the-art LLMs... 16 MIN READ