Problem when launching many CUDA processes in the same time

laurenluckiez · October 22, 2019, 9:58pm

Hi,

I try to launch CUDA processes concurrently (I know they cannot run concurrently).

What I observe is that in the case of many processes (for example 32 concurrent processes) the 25 CUDA processes will run correctly (they return the correct result) but the 7 CUDA processes will end with very small latency and will return wrong result.

I do not expect to run concurrently or quickly the 32 concurrent different CUDA processes, but why the 7 out of 32 processes do not run correctly? And whenever I launch concurrently the 32 processes, always 7 processes do not run correctly.

Could you please explain me why?
Thanks in advance!

Robert_Crovella · October 22, 2019, 10:47pm

You may be running out of memory. The GPU concurrency model when in DEFAULT compute mode is to allow all processes to run, but kernel activity must be serialized/time-sliced/context-switched. However all processes may attempt to allocate GPU memory, and if any such allocations fail, the process will likely produce incorrect results.
There is a limit to the number of concurrent processes that can be run on a GPU, which is basically limited by the number of concurrent contexts. AFAIK this limit is unpublished.

[url]https://devtalk.nvidia.com/default/topic/1030080/is-there-a-maximum-number-of-contexts-per-gpu-encoded-into-the-driver-/[/url]

If any of this is happening, and you are unaware of it, it means you are not doing proper CUDA error checking. I always encourage people to do proper CUDA error checking, especially when having trouble with a CUDA code, preferably before asking others for help.

Topic		Replies	Views
32-256+ different process running in parallel CUDA Programming and Performance	3	3543	August 4, 2009
CUDA limitations CUDA Programming and Performance	4	2473	March 8, 2012
A question the parallelization CUDA Programming and Performance	1	1184	July 28, 2008
My first test on CUDA and some questions sync, thread with CUDA CUDA Programming and Performance	5	3018	November 13, 2007
Concurrent execution of more than one CUDA application CUDA Programming and Performance	5	2975	May 1, 2009
cuda with multicore (multitasking) multicore CPU(for multitasking) and CUDA CUDA Programming and Performance	13	12021	February 23, 2009
Synchronizing Blocks CUDA Programming and Performance	3	2342	January 10, 2018
CUDA processor allocation CUDA Programming and Performance	7	3434	October 5, 2007
running two CUDA processes CUDA Programming and Performance	1	891	June 2, 2015
Multiple Context? CUDA Programming and Performance	0	574	July 30, 2018

Problem when launching many CUDA processes in the same time

Related topics