Finding Idle GPU in Multi-GPU System

ertyu · December 19, 2007, 8:56pm

A couple issues I’m trying to solve.

A) If I want to use the full resources of a multi-gpu system, is there any way to determine dynamically which GPUs I’m already using without explicitly tracking device id?

i.e. start one thread, it picks the first available device and executes, a second thread starts, how can it pick the next gpu

Ideally it would be nice to launch a thread and have it scheduled to an idle device.

B) Similar idea, can I track which device is being used as a primary display and avoid using it?

jimh · December 20, 2007, 6:30pm

AFAIK, you’ll need to write your own code to track GPU use and allocation. I’ve been designing something to do just this - it’s not too tough.

Create an array (one element per GPU) that tracks GPU use. An array of booleans would be enough. Provide “getGPU()” and “releaseGPU()” functions that your threads can call as needed. The functions update the array and call cudaSetDevice() and cudaThreadExit(). Make sure the functions are protected by a common mutex to avoid race conditions.

You need to think about how to handle the condition when all GPUs are in use. My design blocks the calling thread until a GPU is available, but your code may have different needs.

I’m not sure how you’d determine which device is the primary display - I’m working on a headless system. Maybe someone else can help you there.

tachyon_john · December 21, 2007, 5:18am

Hi,

I filed a CUDA feature request (# 298834) along these lines many moons ago, but I’m not sure where it stands. It’s undoubtedly one of hundreds of such feature requests. I’m sure the NVIDIA staff will get to this at some point when they’ve dealt with more pressing issues and more highly-requested features.

Cheers,

John Stone

Topic		Replies	Views
Multiple GPUs: finding one that's not busy CUDA Programming and Performance	3	1986	September 3, 2008
Dynamically find the next available GPU during run-time? CUDA Programming and Performance	3	439	October 12, 2021
Multithread - determine GPU currently in use CUDA Programming and Performance	1	4890	January 28, 2012
How to do GPU allocation in N GPU + M process env CUDA Programming and Performance	6	7608	October 10, 2008
best way to tell which GPU is free in a Multi-GPU server? CUDA Programming and Performance	1	699	February 16, 2015
device in use How to detect the device is in use CUDA Programming and Performance	2	4076	June 16, 2010
Different index definition in nvml & CUDA runtime? CUDA Programming and Performance	6	2652	March 5, 2015
MultiGPU Thread Dependent? CUDA Programming and Performance	3	2310	February 12, 2010
How to get fastest free GPU cutGetMaxGflopsDeviceId always same device CUDA Programming and Performance	3	3222	June 22, 2012
cudaSetDevice : overrides the auto select free gpu feature in cuda CUDA Programming and Performance	2	1139	February 6, 2015

Finding Idle GPU in Multi-GPU System

Related topics