Michael Carilli – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2023-02-13T17:46:37Z http://www.open-lab.net/blog/feed/ Michael Carilli <![CDATA[NVIDIA Apex: Tools for Easy Mixed-Precision Training in PyTorch]]> http://www.open-lab.net/blog/?p=12951 2022-08-21T23:39:14Z 2018-12-03T16:00:57Z Most deep learning frameworks, including PyTorch, train using 32-bit floating point (FP32) arithmetic by default. However, using FP32 for all operations is not...]]>

Most deep learning frameworks, including PyTorch, train using 32-bit floating point (FP32) arithmetic by default. However, using FP32 for all operations is not essential to achieve full accuracy for many state-of-the-art deep neural networks (DNNs). In 2017, NVIDIA researchers developed a methodology for mixed-precision training in which a few operations are executed in FP32 while the majority…

Source

]]>
0
Michael Carilli <![CDATA[New Optimizations To Accelerate Deep Learning Training on NVIDIA GPUs]]> http://www.open-lab.net/blog/?p=12964 2023-02-13T17:46:37Z 2018-12-03T16:00:36Z The pace of AI adoption across diverse industries depends on maximizing data scientists�� productivity. NVIDIA releases optimized NGC containers every month...]]>

The pace of AI adoption across diverse industries depends on maximizing data scientists’ productivity. NVIDIA releases optimized NGC containers every month with improved performance for deep learning frameworks and libraries, helping scientists maximize their potential. NVIDIA continuously invests in the full data science stack, including GPU architecture, systems, and software stacks.

Source

]]>
0
���˳���97caoporen����