Zhihan Jiang – NVIDIA Technical BlogNews and tutorials for developers, data scientists, and IT admins2025-04-23T19:41:12Zhttp://www.open-lab.net/blog/feed/Zhihan Jiang<![CDATA[NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0]]>http://www.open-lab.net/blog/?p=983672025-04-23T19:41:12Z2025-04-02T18:14:48ZThe compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency...
]]>Zhihan Jiang<![CDATA[NVIDIA Blackwell Platform Sets New LLM Inference Records in MLPerf Inference v4.1]]>http://www.open-lab.net/blog/?p=879572024-09-05T17:57:17Z2024-08-28T15:00:00ZLarge language model (LLM) inference is a full-stack challenge. Powerful GPUs, high-bandwidth GPU-to-GPU interconnects, efficient acceleration libraries, and a...
]]>1Zhihan Jiang<![CDATA[NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records]]>http://www.open-lab.net/blog/?p=801972024-11-14T15:53:12Z2024-03-27T15:29:05ZGenerative AI is unlocking new computing applications that greatly augment human capability, enabled by continued model innovation. Generative AI...
]]>Zhihan Jiang<![CDATA[Leading MLPerf Inference v3.1 Results with NVIDIA GH200 Grace Hopper Superchip Debut]]>http://www.open-lab.net/blog/?p=704502023-09-22T16:17:33Z2023-09-09T16:00:00ZAI is transforming computing, and inference is how the capabilities of AI are deployed in the world��s applications. Intelligent chatbots, image and video...
]]>1Zhihan Jiang<![CDATA[Setting New Records in MLPerf Inference v3.0 with Full-Stack Optimizations for AI]]>http://www.open-lab.net/blog/?p=629582023-07-05T19:23:50Z2023-04-05T19:10:55ZThe most exciting computing applications currently rely on training and running inference on complex AI models, often in demanding, real-time deployment...
]]>0Zhihan Jiang<![CDATA[Full-Stack Innovation Fuels Highest MLPerf Inference 2.1 Results for NVIDIA]]>http://www.open-lab.net/blog/?p=546382023-07-05T19:26:31Z2022-09-08T18:10:00ZToday��s AI-powered applications are enabling richer experiences, fueled by both larger and more complex AI models as well as the application of many models in...