Khubaib Khubaib

Khubaib is a senior deep learning performance architect at NVIDIA, dedicated to maximizing GPU efficiency and shaping next-generation architectures. His recent work involves evaluating inference performance of large language models and guiding software teams in optimization efforts for high-performance MLPerf Inference submissions. Previously, he worked on performance modeling and simulation of deep learning models to inform GPU architecture design. Before NVIDIA, he contributed to CPU architecture research and development at Apple and Intel. Khubaib holds a master’s degree and PhD in Electrical and Computer Engineering from the University of Texas at Austin.
Avatar photo

Posts by Khubaib Khubaib

Data Center / Cloud

NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0

The compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency... 9 MIN READ