Jeremy Appleyard – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2024-05-17T17:25:34Z http://www.open-lab.net/blog/feed/ Jeremy Appleyard <![CDATA[Programming Tensor Cores in CUDA 9]]> http://www.open-lab.net/blog/parallelforall/?p=8496 2024-05-17T17:25:34Z 2017-10-17T09:29:09Z A defining feature of the new NVIDIA Volta GPU architecture is Tensor Cores, which give the NVIDIA V100 accelerator a peak throughput that is 12x...]]>

A defining feature of the new NVIDIA Volta GPU architecture is Tensor Cores, which give the NVIDIA V100 accelerator a peak throughput that is 12x the 32-bit floating point throughput of the previous-generation NVIDIA P100. Tensor Cores enable you to use mixed-precision for higher throughput without sacrificing accuracy. Tensor Cores provide a huge boost to convolutions and matrix operations.

Source

]]>
14
Jeremy Appleyard <![CDATA[Optimizing Recurrent Neural Networks in cuDNN 5]]> http://www.open-lab.net/blog/parallelforall/?p=6577 2022-08-21T23:37:50Z 2016-04-06T21:15:57Z [caption id="attachment_6585" align="alignright" width="300"] Figure 1: cuDNN 5 + Torch speedup vs. Torch-rnn implementation, M40, Intel? Xeon? Processor...]]>

This week at GTC 2016, we announced the latest update to NVIDIA Deep Learning SDK, which now includes cuDNN 5. Version 5 offers new features, improved performance and support for the latest generation NVIDIA Tesla P100 GPU. New features in cuDNN 5 include: One of the new features we’ve added in cuDNN 5 is support for Recurrent Neural Networks (RNN). RNNs are a powerful tool used for…

Source

]]>
17
Jeremy Appleyard <![CDATA[CUDA Pro Tip: Optimize for Pointer Aliasing]]> http://www.open-lab.net/blog/parallelforall/?p=3431 2022-08-21T23:37:07Z 2014-08-08T01:29:25Z Often cited as the main reason that na?ve C/C++ code cannot match FORTRAN performance, pointer aliasing is an important topic to understand when considering...]]>

Often cited as the main reason that naïve C/C++ code cannot match FORTRAN performance, pointer aliasing is an important topic to understand when considering optimizations for your C/C++ code. In this tip I will describe what pointer aliasing is and a simple way to alter your code so that it does not harm your application performance. Two pointers alias if the memory to which they point…

Source

]]>
13
���˳���97caoporen����