Pascal – NVIDIA Technical Blog

Pascal – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-21T20:30:26Z http://www.open-lab.net/blog/feed/ Rick Merritt <![CDATA[Explainer: What Is Accelerated Computing?]]> http://www.open-lab.net/blog/?p=54460 2024-06-05T21:58:00Z 2022-11-18T20:00:00Z

Accelerated computing uses parallel processing to speed up work on demanding applications, from AI and data analytics to simulations and visualizations.]]>

Accelerated computing uses parallel processing to speed up work on demanding applications, from AI and data analytics to simulations and visualizations.

accelerated-computing-road-x1283-1280x683

Accelerated computing uses parallel processing to speed up work on demanding applications, from AI and data analytics to simulations and visualizations.

]]> 0 Nikolay Sakharnykh <![CDATA[Maximizing Unified Memory Performance in CUDA]]> http://www.open-lab.net/blog/parallelforall/?p=8603 2022-08-21T23:38:33Z 2017-11-20T03:37:53Z

Many of today's applications process large volumes of data. While GPU architectures have very fast HBM or GDDR memory, they have limited capacity. Making the...]]>

Many of today's applications process large volumes of data. While GPU architectures have very fast HBM or GDDR memory, they have limited capacity. Making the... Unified Memory

Unified Memory

Many of today��s applications process large volumes of data. While GPU architectures have very fast HBM or GDDR memory, they have limited capacity. Making the most of GPU performance requires the data to be as close to the GPU as possible. This is especially important for applications that iterate over the same data multiple times or have a high flops/byte ratio. Many real-world codes have to��

]]> 18 Mark Harris <![CDATA[Unified Memory for CUDA Beginners]]> http://www.open-lab.net/blog/parallelforall/?p=7937 2022-08-21T23:38:11Z 2017-06-20T03:59:57Z

My previous introductory post, "An Even Easier Introduction to CUDA C++", introduced the basics of CUDA programming by showing how to write a simple program...]]>

My previous introductory post, "An Even Easier Introduction to CUDA C++", introduced the basics of CUDA programming by showing how to write a simple program...

CUDA_Cube_1K

]]> 46 Mark Harris <![CDATA[NVIDIA DGX-1: The Fastest Deep Learning System]]> http://www.open-lab.net/blog/parallelforall/?p=7684 2022-08-21T23:38:08Z 2017-04-05T15:00:55Z

[caption id="attachment_7685" align="alignright" width="300"] Figure 1: NVIDIA DGX-1.[/caption] One year ago today, NVIDIA announced the NVIDIA? DGX-1™,...]]>

[caption id="attachment_7685" align="alignright" width="300"] Figure 1: NVIDIA DGX-1.[/caption] One year ago today, NVIDIA announced the NVIDIA? DGX-1™,...

dgx-1-thumb

One year ago today, NVIDIA announced the NVIDIA? DGX-1, an integrated system for deep learning. DGX-1 (shown in Figure 1) features eight Tesla P100 GPU accelerators connected through NVLink, the NVIDIA high-performance GPU interconnect, in a hybrid cube-mesh network. Together with dual socket Intel Xeon CPUs and four 100 Gb InfiniBand network interface cards, DGX-1 provides unprecedented��

]]> 2 Nikolay Sakharnykh <![CDATA[Beyond GPU Memory Limits with Unified Memory on Pascal]]> http://www.open-lab.net/blog/parallelforall/?p=7233 2022-08-21T23:37:59Z 2016-12-14T10:31:50Z

[caption id="attachment_7428" align="alignright" width="300"] Figure 1: Dimethyl ether jet simulations designed to study complex new fuels. Image courtesy of...]]>

[caption id="attachment_7428" align="alignright" width="300"] Figure 1: Dimethyl ether jet simulations designed to study complex new fuels. Image courtesy of...

Figure 1: Dimethyl ether jet simulations designed to study complex new fuels. Image courtesy of the Center for Exascale Simulation of Combustion in Turbulence (ExaCT).

Modern computer architectures have a hierarchy of memories of varying size and performance. GPU architectures are approaching a terabyte per second memory bandwidth that, coupled with high-throughput computational cores, creates an ideal device for data-intensive tasks. However, everybody knows that fast memory is expensive. Modern applications striving to solve larger and larger problems can be��

]]> 15 Mark Harris <![CDATA[New Pascal GPUs Accelerate Inference in the Data Center]]> http://www.open-lab.net/blog/parallelforall/?p=7156 2022-08-21T23:37:57Z 2016-09-13T03:01:32Z

Artificial intelligence is already more ubiquitous than many people realize. Applications of AI abound, many of them powered by complex deep neural networks...]]>

Artificial intelligence is already more ubiquitous than many people realize. Applications of AI abound, many of them powered by complex deep neural networks...

Artificial intelligence is already more ubiquitous than many people realize. Applications of AI abound, many of them powered by complex deep neural networks trained on massive data using GPUs. These applications understand when you talk to them; they can answer questions; and they can help you find information in ways you couldn��t before. Pinterest image search technology allows users to find��

]]> 3 Mark Harris <![CDATA[Inside Pascal: NVIDIA��s Newest Computing Platform]]> http://www.open-lab.net/blog/parallelforall/?p=6535 2022-08-21T23:37:50Z 2016-04-05T17:00:44Z

At the 2016 GPU Technology Conference in San Jose, NVIDIA CEO Jen-Hsun Huang announced the new NVIDIA Tesla P100, the most advanced accelerator ever built....]]>

At the 2016 GPU Technology Conference in San Jose, NVIDIA CEO Jen-Hsun Huang announced the new NVIDIA Tesla P100, the most advanced accelerator ever built....

pascal_key_image

At the 2016 GPU Technology Conference in San Jose, NVIDIA CEO Jen-Hsun Huang announced the new NVIDIA Tesla P100, the most advanced accelerator ever built. Based on the new NVIDIA Pascal GP100 GPU and powered by ground-breaking technologies, Tesla P100 delivers the highest absolute performance for HPC, technical computing, deep learning, and many computationally intensive datacenter workloads.

]]> 51 Mark Harris <![CDATA[CUDA 8 Features Revealed]]> http://www.open-lab.net/blog/parallelforall/?p=6554 2022-08-21T23:37:50Z 2016-04-05T12:00:11Z

Today I'm excited to announce the general availability of CUDA 8, the latest update to NVIDIA's powerful parallel computing?platform and programming model. In...]]>

Today I'm excited to announce the general availability of CUDA 8, the latest update to NVIDIA's powerful parallel computing?platform and programming model. In...

CUDA_Cube_1K

Today I��m excited to announce the general availability of CUDA 8, the latest update to NVIDIA��s powerful parallel computing platform and programming model. In this post I��ll give a quick overview of the major new features of CUDA 8. To learn more you can watch the recording of my talk from GTC 2016, ��CUDA 8 and Beyond��. A crucial goal for CUDA 8 is to provide support for the powerful new��

]]> 51 Denis Foley <![CDATA[NVLink, Pascal and Stacked Memory: Feeding the Appetite for Big Data]]> http://www.open-lab.net/blog/parallelforall/?p=3097 2022-08-21T23:37:04Z 2014-03-25T16:31:41Z

For more recent info on NVLink, check out the?post, "How NVLink Will Enable Faster, Easier Multi-GPU Computing". NVIDIA GPU accelerators have emerged in...]]>

For more recent info on NVLink, check out the?post, "How NVLink Will Enable Faster, Easier Multi-GPU Computing". NVIDIA GPU accelerators have emerged in...

stacked_memory

For more recent info on NVLink, check out the post, ��How NVLink Will Enable Faster, Easier Multi-GPU Computing��. NVIDIA GPU accelerators have emerged in High-Performance Computing as an energy-efficient way to provide significant compute capability. The Green500 supercomputer list makes this clear: the top 10 supercomputers on the list feature NVIDIA GPUs. Today at the 2014 GPU Technology��

]]> 14 ��˳��97caoporen��