CUDA device id vs NVAPI phisical GPU id

bemobali · December 22, 2008, 6:41pm

Hi,

Can I use GPU ids enumerated from NVAPI to create a CUDA device? I am hoping that I can use the result of NvAPI_EnumPhysicalGPUs to get a device handle via function call cuDeviceGet. I need the ability to target a specific GPU for computation while not in SLI mode.

Thanks,

tmurray · December 22, 2008, 6:47pm

I do not believe this is the case; CUDA device IDs are CUDA specific because we reserve the right to reorder them at our sole discretion (as we have done with 2.1).

alex_dubinsky · December 22, 2008, 8:46pm

What’s NVAPI?

tmurray · December 22, 2008, 11:15pm

it’s this thing

basically lots of hooks into the Windows driver.

alex_dubinsky · December 23, 2008, 3:34am

“Frame Rendering - Ability to control Video and DX rendering not available in DX runtime.”

What’s that?

“GPU Topology - Ability to enable SLI and Hybrid GPU topologies.”

Can that be used by a CUDA app to temporarily control SLI?

“GPU Management - Enumeration of physical and logical GPUs. Thermal and Cooling controls.”

I suppose this is very useful, to monitor/control temps and fans to ensure reliable calculation. (High temps == bit errors.) Does such a thing also exist on Linux? Does it work with Tesla?

tmurray · December 23, 2008, 9:47am

As far as I know, NVAPI is Windows only. I’ve never used it and don’t know much about it, sorry.

bemobali · December 23, 2008, 6:48pm

How does high temperature influence bit error probability?

Regarding CUDA device enumeration:

I can use cuD3D9GetDevice() to retrieve CUDA device number, as long as an adapter identifier for the device exist. So far this approach works for video cards. Tesla cards don’t have these identifiers.

Have you come across any techniques to correlate pci device information to CUDA device?

Thanks,

alex_dubinsky · December 24, 2008, 5:15am

Very simply. The higher the temperature, the higher the probability of a bit error. (Remember people who use liquid nitrogen for overclocking? By lowering the temperature significantly, they suppress the very high error rates that extreme overclocking would induce. It’s a direct relationship.)

Incidentally, the problem with consumer cards is that they’re tuned to be quiet, and don’t spin up their fans until the card gets very hot. This is why I find this API very significant. Is there really no equivalent on Linux?

Btw, why is the normal CUDA device enumeration API insufficient for you?

Reimar · December 24, 2008, 2:17pm

To my knowledge, the nvidia-settings tool is open-source (and I know it displays the temperature), so you can just look at what it does (or otherwise use strace etc. to find out).

I am quite sure that the API it uses is public and documented, and nvclock uses it to change clock frequencies, so maybe look at the nvclock source, too.

I think it might only work when an X server is running, as I understood it the so-called “NV-CONTROL” stuff is an X protocol extension.

bemobali · December 31, 2008, 1:36am

I am writing a non-interactive GPU-computing program using an architecture that enumerates video cards, with information such as PCI ids, display ids, bus id, slot id, etc. The user of this application runs GPU computing by selecting a video device provided by the architecture. With video cards, I can enumerate CUDA device using a Direct3D device, because I have display id to identify a video card uniquely. Unfortunately this approach does not work with Tesla cards. Other than going the Direct3D route, I can’t find any unique CUDA device information from the CUDA enumeration API.

Thanks,

Biozinc · August 4, 2010, 10:23am

Digging up an old thread, with two questions:

Is this still true? No correlation between CUDA device ID and NVAPI stuff?
How fixed are CUDA device ID v.s. NVAPI IDs? I’m thinking that one could do a once-off map on a given machine (some soft of GPU burn maybe) to figure out which device is which, and use that for further use.

Topic		Replies	Views
NvPhysicalGpuHandle of CUDA device CUDA Programming and Performance	3	9172	March 26, 2009
nvidia-smi and driver api Matching results from the two CUDA Programming and Performance	10	16563	June 18, 2015
CUDA device not primary display adapter CUDA Programming and Performance	14	10677	January 16, 2008
GPU overclocking tool CUDA Programming and Performance	33	80719	July 31, 2017
P2P access not enabled, is this a software or a hardware issue? CUDA Setup and Installation	7	9632	November 10, 2015
how to relate device ID to CPU cores/ PCIe ID in NUMA system CUDA Programming and Performance	18	8029	June 26, 2023
Help Switching between Discrete and Integrated GPUs CUDA Programming and Performance	6	32595	May 31, 2011
why "all CUDA-capable devices are busy or unavailable" ? CUDA Programming and Performance	34	64319	April 20, 2011
CUDA-Noob, Slackware 12.2, getting ready to buy card CUDA Programming and Performance	14	10053	October 5, 2009
Problems after inserting a P100 CUDA Setup and Installation	35	3840	October 31, 2021

CUDA device id vs NVAPI phisical GPU id

Related topics