Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-04-01T21:14:57Z http://www.open-lab.net/blog/feed/ Terry Chen <![CDATA[Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling]]> http://www.open-lab.net/blog/?p=95998 2025-02-20T15:56:57Z 2025-02-12T18:00:00Z As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is...]]> As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is...Mixture of experts icons for attention kernels.

As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is emerging. Also known as AI reasoning or long-thinking, this technique improves model performance by allocating additional computational resources during inference to evaluate multiple possible outcomes and then selecting the best one��

Source

]]>
2
���˳���97caoporen����