Michael Andersch – NVIDIA Technical Blog

Michael Andersch – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2023-10-25T23:51:26Z http://www.open-lab.net/blog/feed/ Michael Andersch <![CDATA[NVIDIA Grace Hopper Superchip Architecture In-Depth]]> http://www.open-lab.net/blog/?p=57192 2022-11-18T11:48:05Z 2022-11-10T19:00:00Z

The NVIDIA Grace Hopper Superchip Architecture is the first true heterogeneous accelerated platform for high-performance computing (HPC) and AI workloads. It...]]>

The NVIDIA Grace Hopper Superchip Architecture is the first true heterogeneous accelerated platform for high-performance computing (HPC) and AI workloads. It accelerates applications with the strengths of both GPUs and CPUs while providing the simplest and most productive distributed heterogeneous programming model to date. Scientists and engineers can focus on solving the world’s most important…

]]> 11 Michael Andersch <![CDATA[NVIDIA Hopper Architecture In-Depth]]> http://www.open-lab.net/blog/?p=45555 2023-10-25T23:51:26Z 2022-03-22T18:00:00Z

Today during the 2022 NVIDIA GTC Keynote address, NVIDIA CEO Jensen Huang introduced the new NVIDIA H100 Tensor Core GPU based on the new NVIDIA Hopper GPU...]]>

Today during the 2022 NVIDIA GTC Keynote address, NVIDIA CEO Jensen Huang introduced the new NVIDIA H100 Tensor Core GPU based on the new NVIDIA Hopper GPU architecture. This post gives you a look inside the new H100 GPU and describes important new features of NVIDIA Hopper architecture GPUs. The NVIDIA H100 Tensor Core GPU is our ninth-generation data center GPU designed to deliver an…

]]> 2 Michael Andersch <![CDATA[Tips for Optimizing GPU Performance Using Tensor Cores]]> http://www.open-lab.net/blog/?p=14687 2023-07-27T20:01:41Z 2019-06-10T13:00:06Z

Our most popular question is "What can I do to get great GPU performance for deep learning?"?We��ve recently published a detailed Deep Learning Performance...]]>

Our most popular question is “What can I do to get great GPU performance for deep learning?” We’ve recently published a detailed Deep Learning Performance Guide to help answer this question. The guide explains how GPUs process data and gives tips on how to design networks for better performance. We also take a close look at Tensor Core optimization to help improve performance. This post takes a…

]]> 15 Michael Andersch <![CDATA[Inference: The Next Step in GPU-Accelerated Deep Learning]]> http://www.open-lab.net/blog/parallelforall/?p=5777 2022-08-21T23:37:37Z 2015-11-11T22:50:00Z

[stextbox id="info" float="true" align="right" width="320"]At 45 images/s/W, Jetson TX1 is super efficient at deep learning inference. Read the...]]>

At 45 images/s/W, Jetson TX1 is super efficient at deep learning inference. Read the whitepaper. Deep learning is revolutionizing many areas of machine perception, with the potential to impact the everyday experience of people everywhere. On a high level, working with deep neural networks is a two-stage process: First, a neural network is trained: its parameters are determined using labeled…

]]> 1 ��˳��97caoporen��