Shelton Dsouza – NVIDIA Technical Blog

Shelton Dsouza – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-04-23T14:57:41Z http://www.open-lab.net/blog/feed/ Shelton Dsouza <![CDATA[Dynamic Loading in the CUDA Runtime]]> http://www.open-lab.net/blog/?p=93958 2025-04-23T14:57:41Z 2025-01-31T20:03:32Z

Historically, the GPU device code is compiled alongside the application with offline tools such as nvcc. In this case, the GPU device code is managed internally...]]>

Historically, the GPU device code is compiled alongside the application with offline tools such as . In this case, the GPU device code is managed internally to the CUDA runtime. You can then launch kernels using and the CUDA runtime ensures that the invoked kernel is launched. However, in some cases, GPU device code needs to be dynamically compiled and loaded. This post shows a way to…

]]> Shelton Dsouza <![CDATA[CUDA Context-Independent Module Loading]]> http://www.open-lab.net/blog/?p=58232 2024-07-30T21:28:50Z 2022-12-12T20:10:00Z

Most CUDA developers are familiar with the cuModuleLoad API and its counterparts for loading a module containing device code into a CUDA context. In...]]>

Most CUDA developers are familiar with the API and its counterparts for loading a module containing device code into a CUDA context. In most cases, you want to load identical device code on all devices. This requires loading device code into each CUDA context explicitly. Moreover, libraries and frameworks that do not control context creation and destruction must keep track of them to explicitly…

]]> 2 ��˳��97caoporen��