Improving GPU Memory Oversubscription Performance

jwitsoe · October 5, 2021, 11:29pm

Originally published at: https://developer.nvidia.com/blog/improving-gpu-memory-oversubscription-performance/

Since its introduction more than 7 years ago, the CUDA Unified Memory programming model has kept gaining popularity among developers. Unified Memory provides a simple interface for prototyping GPU applications without manually migrating memory between host and device. Starting from the NVIDIA Pascal GPU architecture, Unified Memory enabled applications to use all available CPU and…

Jemma · October 12, 2021, 2:33am

where is the code example?

ChirayuGarg · October 13, 2021, 3:38am

You can find the code example here . Thank you for your interest.

Jemma · October 13, 2021, 8:08am

yw, I downloaded and updated to the latst Commit: 0754981b37b343474c45222ea487c9667551e854 [0754981] of master branch.
But could not find any project files, how to debug .cu file using window 10& VS2019?
Commit: 0754981b37b343474c45222ea487c9667551e854 [0754981]
Parents: 9ad4c010fd, 5a003551a1
Author: Mark Harris mharris@nvidia.com
Date: Tuesday, August 3, 2021 2:44:36 PM
Committer: GitHub
Merge pull request #36 from chirayuG-nvidia/unified_memory

Add Unified Memory oversubscription benchmark

ChirayuGarg · November 2, 2021, 2:07pm

Apologies for the delay in the response.
We don’t have visual studio project files for this sample, I think it should be easy to convert to a VS project from the provided Makefile. Worth mentioning that many of the Unified Memory features such as on-demand paging, oversubscription discussed here are not available on Windows, these limitation are documented here.

Topic		Replies	Views
Maximizing Unified Memory Performance in CUDA Technical Blog	18	1145	May 14, 2019
Unified Memory for CUDA Beginners Technical Blog	46	2484	December 1, 2023
Unified Memory in CUDA 6 Technical Blog	87	1892	August 16, 2019
Beyond GPU Memory Limits with Unified Memory on Pascal Technical Blog	15	873	March 11, 2022
Unified memory has slow bandwidth over NVLink 2.0 for large data sizes CUDA Programming and Performance	4	1055	September 25, 2019
Abysmal performance with Unified Memory and CUBLAS CUDA Programming and Performance	15	4261	November 29, 2014
Simplifying GPU Application Development with Heterogeneous Memory Management Technical Blog	0	383	August 22, 2023
Using Shared Memory in CUDA C/C++ Technical Blog	36	1913	October 8, 2020
Introducing Low-Level GPU Virtual Memory Management Technical Blog	59	7233	June 4, 2024
GPU Perfomance How much GFlops??? CUDA Programming and Performance	27	37095	August 30, 2009

Improving GPU Memory Oversubscription Performance

Related topics