Stephen Jones – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-02-03T22:25:21Z http://www.open-lab.net/blog/feed/ Stephen Jones <![CDATA[Dynamic Control Flow in CUDA Graphs with Conditional Nodes]]> http://www.open-lab.net/blog/?p=81012 2025-02-03T22:25:21Z 2024-05-10T18:43:37Z Post updated on February 3, 2025 with details about CUDA 12.8. CUDA Graphs can provide a significant performance increase, as the driver is able to optimize...]]>

Post updated on February 3, 2025 with details about CUDA 12.8. CUDA Graphs can provide a significant performance increase, as the driver is able to optimize execution using the complete description of tasks and dependencies. Graphs provide incredible benefits for static workflows where the overhead of graph creation can be amortized over many successive launches. However…

Source

]]>
2
Stephen Jones <![CDATA[Enabling Dynamic Control Flow in CUDA Graphs with Device Graph Launch]]> http://www.open-lab.net/blog/?p=58258 2024-11-01T14:33:40Z 2022-12-12T20:51:14Z CUDA Graphs significantly reduce the overhead of launching a large batch of user operations by defining them as a task graph, which may be launched in a single...]]>

CUDA Graphs significantly reduce the overhead of launching a large batch of user operations by defining them as a task graph, which may be launched in a single operation. Knowing the workflow upfront enables the CUDA driver to apply various optimizations, which cannot be performed when launching through a stream model. However, this performance comes at the cost of flexibility.

Source

]]>
3
Stephen Jones <![CDATA[NVIDIA Ampere Architecture In-Depth]]> http://www.open-lab.net/blog/?p=17431 2023-05-24T00:05:26Z 2020-05-14T13:00:00Z Today, during the 2020 NVIDIA GTC keynote address, NVIDIA founder and CEO Jensen Huang introduced the new NVIDIA A100 GPU based on the new NVIDIA Ampere GPU...]]>

Today, during the 2020 NVIDIA GTC keynote address, NVIDIA founder and CEO Jensen Huang introduced the new NVIDIA A100 GPU based on the new NVIDIA Ampere GPU architecture. This post gives you a look inside the new A100 GPU, and describes important new features of NVIDIA Ampere architecture GPUs. The diversity of compute-intensive applications running in modern cloud data centers has driven…

Source

]]>
0
���˳���97caoporen����