CUDA non-default stream synchronization

Robert_Crovella · October 28, 2024, 6:24pm

I believe striker159 has addressed your item a. For your item b, my suggestion would be to check the concurrentManagedAccess device property, before trying to access a managed allocation from host code, without an intervening cudaDeviceSynchronize, after 1 or more kernel launches. If the concurrent managed access property is false, then what you are trying to do is expected to seg fault. For discrete GPUs, this would typically be true on linux for a pascal or newer GPU, but false on maxwell or on windows. Jetson devices have a somewhat different footprint, I believe.

Topic		Replies	Views
Fail to sync the cudaMemcpyAsync using the cudaEvent in two streams CUDA Programming and Performance	4	244	April 1, 2024
cudaMemcpyAsync, unexpected behaviour while using cudaStreamNonBlocking? CUDA Programming and Performance	6	2069	May 29, 2018
CUDA streams and error handling CUDA Programming and Performance	11	2400	December 14, 2023
Streams Problem CUDA Programming and Performance	2	4658	December 7, 2008
cudaDeviceSynchronize needed between kernel launch and cudaMemcpy ? CUDA Programming and Performance	15	16266	September 29, 2017
Problem regarding data transfer overlap between multiple asynchronous streams CUDA Programming and Performance	8	799	September 11, 2016
performance problem CUDA Programming and Performance	2	609	July 16, 2018
Problems with streams CUDA Programming and Performance	4	1077	September 30, 2010
Cannot get any stream parallelism. CUDA Programming and Performance	13	1282	December 31, 2019
QUIT CUDA? Kernel and pinned memory gives strange results CUDA Programming and Performance	6	6717	September 22, 2011

CUDA non-default stream synchronization

Related topics