CUDA Pro Tip: Minimize the Tail Effect – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-21T20:30:26Z http://www.open-lab.net/blog/feed/ Julien Demouth <![CDATA[CUDA Pro Tip: Minimize the Tail Effect]]> http://www.open-lab.net/blog/parallelforall/?p=3275 2022-08-21T23:37:05Z 2014-06-04T14:17:42Z When I work on the optimization of CUDA kernels, I sometimes see a discrepancy between Achieved and Theoretical Occupancies. The Theoretical Occupancy is the...]]> When I work on the optimization of CUDA kernels, I sometimes see a discrepancy between Achieved and Theoretical Occupancies. The Theoretical Occupancy is the...GPU Pro Tip

When I work on the optimization of CUDA kernels, I sometimes see a discrepancy between Achieved and Theoretical Occupancies. The Theoretical Occupancy is the ratio between the number of threads which may run on each multiprocessor (SM) and the maximum number of executable threads per SM (2048 on the Kepler architecture). This value is estimated from the size of the blocks and the amount of��

Source

]]>
2
���˳���97caoporen����