Hao Lu

Hao Lu is a principal software engineer on the NVIDIA TensorRT-LLM team, specializing in large language model (LLM) inference. Before joining NVIDIA, Hao co-founded HippoML, a startup focused on generative AI inference. She also played a key role in the development of AITemplate and PyTorch Static Runtime, and contributed to open source projects including PyTorch, Caffe2, Caffe2Go, and QNNPACK during her time at Meta. Hao received her bachelor’s degree from Peking University and master’s degree and PhD from University of Notre Dame.
Avatar photo

Posts by Hao Lu

Generative AI

NVIDIA Blackwell Delivers World-Record DeepSeek-R1 Inference Performance

NVIDIA announced world-record DeepSeek-R1 inference performance at NVIDIA GTC 2025. A single NVIDIA DGX system with eight NVIDIA Blackwell GPUs can achieve over... 14 MIN READ