cudaMemPrefetchAsync returns cudaErrorInvalidDevice

rob.lewis · September 20, 2018, 10:17pm

OK, it’s now September, I’m using 9.2, and just ran into this problem.
Why are nVidia sooo Windows belligerent?

Yuki_Ni · November 15, 2021, 7:22am

Thanks for reporting this ticket .After checking with our engineers , we think this is expected behavior , please see CUDA Runtime API :: CUDA Toolkit Documentation
Passing in cudaCpuDeviceId for dstDevice will prefetch the data to host memory. If dstDevice is a GPU, then the device attribute cudaDevAttrConcurrentManagedAccess must be non-zero. Additionally, stream must be associated with a device that has a non-zero value for the device attribute cudaDevAttrConcurrentManagedAccess.

Doing cudaMemPrefetchAsync on managed memory requires support for “cudaDevAttrConcurrentManagedAccess”.

Currently, this is only supported on Linux. You are suggested to check for the above device attr before doing mem prefetch on managed mem. like

int* data;
size_t len = 10;
int featureSupported = 0;

CHECK_RT(cudaMallocManaged(reinterpret_cast<void **>(&data), len, cudaMemAttachGlobal));
CHECK_RT(cudaDeviceGetAttribute(&featureSupported, cudaDevAttrConcurrentManagedAccess, 0));

if (featureSupported) {
CHECK_RT(cudaMemPrefetchAsync(data, len, 0, 0));
}

Hope this explains your question.

Best,
Yuki

Topic		Replies	Views
cudaMallocManaged and CUDA 8.0 CUDA Programming and Performance	5	2550	June 21, 2018
Unified Memory for CUDA Beginners Technical Blog	46	2694	December 1, 2023
Pascal & capabilities 6.0 show cudaDevAttrConcurrentManagedAccess is 0 CUDA Programming and Performance	15	1424	December 27, 2018
cudaMallocManaged() clarification needed CUDA Programming and Performance	5	11349	November 20, 2018
cudaMemPrefetchAsync why is it Device to Host? Profiling Linux Targets cuda	1	907	May 1, 2023
Using unified memory causes system crash CUDA Programming and Performance	28	6006	February 4, 2019
CUDA Error 101 with cudaMemPrefetchAsync Positioning on WSL2 CUDA Programming and Performance	3	314	May 29, 2024
Unified memory oversubscription and page faults CUDA Programming and Performance	7	2873	March 23, 2018
unified memory with CUDA 8 CUDA Programming and Performance	7	3391	April 2, 2018
Pascal resorting to zero-copy memory CUDA Programming and Performance	9	1958	August 14, 2017

cudaMemPrefetchAsync returns cudaErrorInvalidDevice

Related topics