CUDA Pro Tip: The Fast Way to Query Device Properties – NVIDIA Technical Blog

CUDA Pro Tip: The Fast Way to Query Device Properties – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-21T20:30:26Z http://www.open-lab.net/blog/feed/ Thejaswi Rao <![CDATA[CUDA Pro Tip: The Fast Way to Query Device Properties]]> http://www.open-lab.net/blog/?p=15512 2023-05-22T22:00:37Z 2019-08-20T16:10:48Z

CUDA applications often need to know the maximum available shared memory per block or to query the number of multiprocessors in the active GPU. One way to do...]]>

CUDA applications often need to know the maximum available shared memory per block or to query the number of multiprocessors in the active GPU. One way to do...

cuda-12.0-nvJitLink

CUDA applications often need to know the maximum available shared memory per block or to query the number of multiprocessors in the active GPU. One way to do this is by calling . Unfortunately, calling this function inside a performance-critical section of your code lead to huge slowdowns, depending on your code. We found out the hard way when caused a 20x slowdown in the Random Forests algorithm��

]]> 6 ��˳��97caoporen��