CUDA-context consume more GPU memory in ChildProcess(start by execl) than in ParentProcess(eg. 186MB more than 108MB) Why?

opengpu · April 11, 2024, 6:14am

striker159 · April 11, 2024, 11:59am

The amount of device memory the cuda context requires is unspecified and can differ between GPUs.

opengpu · April 12, 2024, 12:08am

and depends on versions of CUDA and driver?

njuffa · April 12, 2024, 2:52am

Like any other internal implementation artifact, yes.

opengpu · April 12, 2024, 5:37am

thanks! but why CUDA-context consume more GPU memory in ChildProcess(start by execl) than in ParentProcess ?
////////////////////////////////////////////////////

Parent Process (initializes CUDA), this only mainProcess consume 108MB GPU memory.

////////////////////////////////////////////////////

Parent Process
|
child process1 (initializes CUDA), this childProcess(by fork, execl) consume 186MB GPU memory.

why?

njuffa · April 12, 2024, 5:51am

Note the “internal” designation. Companies, in any kind of business, are not in the habit of telegraphing internal design details to the world. For software, programmers get to rely on whatever is promised in the documentation, modulo documentation bugs. Everything else is a design artifact that can change at any time without notice.

How many experiments have you run to conclude that this is always the case? The amount of memory used by any kind of complex software stack is often a function of a largish number of configuration parameters. Until these experiments provide comprehensive data, I guess the strongest statement we can make at this time is that the above statement holds in one particular software context, on one particular operating system, with one particular CUDA version, with one particular GPU.

opengpu · April 12, 2024, 6:08am

yes, but i want to predict the GPU memory size of my CUDA app before it start.
there seems no document about the GPU memory size consumed by CUDA-device itself.
Anyway, and curious that why only CUDA-context consumed much more GPU memory size when using CUDA only in ChildProcess than only in PrarentProcess.

Thanks!

Topic		Replies	Views
What's CUDA-context's GpuMemory contain? Is it necessary and available to Minimum the CUDA-context's GpuMemorySize? CUDA Programming and Performance	6	269	April 15, 2024
CUDA kernels consuming device memory? CUDA Programming and Performance	3	3893	December 9, 2010
Host memory use of retained primary context CUDA Programming and Performance	2	655	June 12, 2021
Per-process GPU memory overhead CUDA Programming and Performance	0	965	July 29, 2011
Does CUDA automatically allocate more GPU memory during the initialization of the application? CUDA Programming and Performance cuda	2	117	November 26, 2024
Determine CUDA Context Memory Usage CUDA Programming and Performance	0	550	November 9, 2018
Determine Memory CUDA Context Memory Usage CUDA Programming and Performance	16	10674	March 9, 2019
[CUDA8.0 BUG?] Child process forked after cuInit() get CUDA_ERROR_NOT_INITIALIZED on cuInit() CUDA Programming and Performance	7	5034	April 12, 2024
Why does GPU code require more RAM than same code on CPU? CUDA Programming and Performance	3	512	January 13, 2025
too much global memory occupication CUDA Programming and Performance	6	1062	February 5, 2020

CUDA-context consume more GPU memory in ChildProcess(start by execl) than in ParentProcess(eg. 186MB more than 108MB) Why?

Related topics