Issues using CUDA on a RealTime RedHat System

olsonc2009 · December 1, 2014, 9:39pm

All,

I’m working on a real time system which means I need control of the memory that gets allocated, thread priorities, and thread affinities. I’ve run into a few issues that maybe you guys have seen.

I’m running on RedHat 5.8 Real Time Distribution

libcuda.so spawns one thread per GPU I use, anybody have any idea how to set affinities/priorities on these worker threads is there something in the API I am missing?
When I call cudaSetDevice( 0 ) for example my OS reports my process has 100GB+ of virtual memory reserved! That seems like a lot and since I’m using multiple devices it tends to go up to 200 or 300 GB of memory. I’ve never seen it actually page that into the physical memory space but it is frightening that some worst case scenario could invoke that without me knowing about it. Has anyone seen this or know a reason why this would happen?

#include <iostream>

#include "cuda_runtime_api.h"

int main( int argc, char* argv[] )
{

  cudaSetDevice( 0 );

  std::cout << "Break here and check the OS reported memory usage" << std::endl;

}

Before cudaSetDevice I have in the neighborhood of 300 Megs allocated, after the cudaSetDevice I have 107 GB reported.

njuffa · December 2, 2014, 3:46pm

This is outside my area of expertise, but as far as I am aware in order to create a unified virtual address space across all CPUs and all GPUs in the system the driver needs to reserve enough virtual memory to map all of the host’s system memory plus the memory of all attached GPUs.

olsonc2009 · December 2, 2014, 4:24pm

njuffa,

That never even crossed my mind! That’s a great thought. I wonder, is there a way to turn that “off”? I know it hasn’t been around forever in CUDA so maybe there’s a knob somewhere that I can find. I guess its time to do some documentation spelunking.

Thanks for the reply!

njuffa · December 2, 2014, 4:54pm

I don’t know of a way to turn this off, nor do I think this would make much sense. UVA was the first step in creating a seamless heterogeneous computing platform, and has been around for at least 3.5 years:

[url]https://devtalk.nvidia.com/default/topic/493902/cuda-programming-and-performance/consumption-of-host-memory-increases-abnormally/[/url]

I am not aware as to why or how a large virtual memory reservation is an issue.

olsonc2009 · December 2, 2014, 7:49pm

You’re right of course, I rely on the UVA to allow P2P transfers I believe so turning it off would make no sense for me anyway.

As far as why it matters, I the GPU developer, understand that it probably doesn’t matter, my customer on the other hand only sees a massive amount of memory being allocated and fears the worst case scenario that somehow the OS tries to bring that into physical memory which then causes thrashing with the swap and loss of timeline wrt computing. I think the best course of action is customer education.

Thanks for the help!

njuffa · December 2, 2014, 8:05pm

I agree that customer education is probably the best course of action here. BTW, sorry for copy/pasting the wrong link into my previous response, creating a circular reference. I have fixed the link which now points to a thread about the large virtual allocation that includes a response from a relevant NVIDIA engineer shortly after UVA was first deployed.

Topic		Replies	Views
High virtual memory consumption on Linux for CUDA programs: is it possible to avoid it? CUDA Programming and Performance	4	2518	November 27, 2018
Consumption of host memory increases abnormally CUDA Programming and Performance	5	5556	June 2, 2011
Device Memory Mangement CUDA Programming and Performance	14	3460	December 5, 2008
Should we expect cuda-gdb to repeatedly allocate and deallocate memory on the fly? CUDA-GDB	7	692	May 17, 2021
CUDA runtime version 0.0 CUDA Programming and Performance	2	1666	May 23, 2011
Device memory size CUDA Programming and Performance	11	46872	June 6, 2008
Global memory occupied until cudaDeviceReset() or app exits CUDA Programming and Performance	0	2508	June 25, 2014
My first test on CUDA and some questions sync, thread with CUDA CUDA Programming and Performance	5	3024	November 13, 2007
how to effectively free large memory allocation CUDA Programming and Performance	8	7661	November 5, 2015
CUDA memory leak persistence CUDA Programming and Performance	13	8935	February 27, 2009

Issues using CUDA on a RealTime RedHat System

Related topics