Jonathan Bentz – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-10T18:02:30Z http://www.open-lab.net/blog/feed/ Jonathan Bentz <![CDATA[Understanding PTX, the Assembly Language of CUDA GPU Computing]]> http://www.open-lab.net/blog/?p=96891 2025-03-07T23:54:11Z 2025-03-12T18:00:00Z Parallel thread execution (PTX) is a virtual machine instruction set architecture that has been part of CUDA from its beginning. You can think of PTX as the...]]>

Parallel thread execution (PTX) is a virtual machine instruction set architecture that has been part of CUDA from its beginning. You can think of PTX as the assembly language of the NVIDIA CUDA GPU computing platform. In this post, we’ll explain what that means, what PTX is for, and what you need to know about it to make the most of CUDA for your applications. We’ll start by walking through…

Source

]]>
Jonathan Bentz <![CDATA[Optimizing Compile Times for CUDA C++]]> http://www.open-lab.net/blog/?p=96775 2025-03-10T18:02:30Z 2025-03-10T18:02:27Z In modern software development, time is an incredibly valuable resource, especially during the compilation process. For developers working with CUDA C++ on...]]>

In modern software development, time is an incredibly valuable resource, especially during the compilation process. For developers working with CUDA C++ on large-scale GPU-accelerated applications, optimizing compile times can significantly enhance productivity and streamline the entire development cycle. When using the compiler for offline compilation, efficient compilation times enable…

Source

]]>
Jonathan Bentz <![CDATA[CUDA Toolkit Now Available for NVIDIA Blackwell?]]> http://www.open-lab.net/blog/?p=95358 2025-02-06T19:33:53Z 2025-01-31T19:17:12Z The latest release of the CUDA Toolkit, version 12.8, continues to push accelerated computing performance in data sciences, AI, scientific computing, and...]]>

The latest release of the CUDA Toolkit, version 12.8, continues to push accelerated computing performance in data sciences, AI, scientific computing, and computer graphics and simulation, using the latest NVIDIA CPUs and GPUs. This post highlights some of the new features and enhancements included with this release: CUDA Toolkit 12.8 is the first version of the Toolkit to support…

Source

]]>
���˳���97caoporen����