Unified Memory

Feb 13, 2025
Simplify System Memory Management with the Latest NVIDIA GH200 NVL2 Enterprise RA
NVIDIA Enterprise Reference Architectures (Enterprise RAs) can reduce the time and cost of deploying AI infrastructure solutions. They provide a streamlined...
8 MIN READ

Oct 05, 2021
Improving GPU Memory Oversubscription Performance
Since its introduction more than 7 years ago, the CUDA Unified Memory programming model has kept gaining popularity among developers. Unified Memory provides a...
16 MIN READ

Sep 08, 2021
Analyzing the RNA-Sequence of 1.3M Mouse Brain Cells with RAPIDS on NVIDIA GPUs
Single-cell genomics research continues to advance drug discovery for disease prevention. For example, it has been pivotal in developing treatments for the...
8 MIN READ

Nov 19, 2017
Maximizing Unified Memory Performance in CUDA
Many of today's applications process large volumes of data. While GPU architectures have very fast HBM or GDDR memory, they have limited capacity. Making the...
18 MIN READ

Jun 19, 2017
Unified Memory for CUDA Beginners
My previous introductory post, "An Even Easier Introduction to CUDA C++", introduced the basics of CUDA programming by showing how to write a simple program...
16 MIN READ

Dec 14, 2016
Beyond GPU Memory Limits with Unified Memory on Pascal
Figure 1: Dimethyl ether jet simulations designed to study complex new fuels. Image courtesy of...
20 MIN READ

Apr 05, 2016
Inside Pascal: NVIDIA's Newest Computing Platform
At the 2016 GPU Technology Conference in San Jose, NVIDIA CEO Jen-Hsun Huang announced the new NVIDIA Tesla P100, the most advanced accelerator ever built....
19 MIN READ

Apr 05, 2016
CUDA 8 Features Revealed
Today I'm excited to announce the general availability of CUDA 8, the latest update to NVIDIA's powerful parallel computing?platform and programming model. In...
19 MIN READ

Feb 23, 2016
High-Performance Geometric Multi-Grid with GPU Acceleration
Linear solvers are probably the most common tool in scientific computing applications. There are two basic classes of methods that can be used to solve an...
16 MIN READ

Sep 16, 2015
Combine OpenACC and Unified Memory for Productivity and Performance
The post Getting Started with OpenACC?covered four steps to progressively accelerate your code with OpenACC. It's often necessary to use OpenACC directives to...
12 MIN READ

Nov 14, 2014
How NVLink Will Enable Faster, Easier Multi-GPU Computing
Accelerated systems have become the new standard for high performance computing (HPC) as GPUs continue to raise the bar for both performance and energy...
8 MIN READ

Aug 13, 2014
Unified Memory: Now for CUDA Fortran Programmers
Unified Memory is a CUDA feature that we've talked a lot about on Parallel Forall. CUDA 6 introduced Unified Memory, which dramatically simplifies GPU...
3 MIN READ

Mar 25, 2014
NVLink, Pascal and Stacked Memory: Feeding the Appetite for Big Data
For more recent info on NVLink, check out the?post, "How NVLink Will Enable Faster, Easier Multi-GPU Computing". NVIDIA GPU accelerators have emerged in...
5 MIN READ

Jan 27, 2014
CUDA Pro Tip: Control GPU Visibility with CUDA_VISIBLE_DEVICES
As a CUDA developer, you will often need to control which devices your application uses. In a short-but-sweet post on the Acceleware blog, Chris Mason writes:...
3 MIN READ

Nov 18, 2013
Unified Memory in CUDA 6
With CUDA 6, NVIDIA introduced one of the most dramatic programming model improvements in the history of the CUDA platform, Unified Memory. In a typical PC or...
12 MIN READ