why "all CUDA-capable devices are busy or unavailable" ?

luoyi · July 8, 2010, 4:03pm

Thinkpad T410 RT9

Core i5 520M + 4G Memory + NVS 3100M

ArchLinux x86_64

cuda-sdk & cuda-toolkit installed

I can execute deviceQuery to get the following info:

[a@b vectorAdd]$ /usr/share/cuda-sdk/C/bin/deviceQuery

/usr/share/cuda-sdk/C/bin/deviceQuery Starting…

CUDA Device Query (Runtime API) version (CUDART static linking)

There is 1 device supporting CUDA

Device 0: “NVS 3100M”

CUDA Driver Version: 3.10

CUDA Runtime Version: 3.10

CUDA Capability Major revision number: 1

CUDA Capability Minor revision number: 2

Total amount of global memory: 267714560 bytes

Number of multiprocessors: 2

Number of cores: 16

Total amount of constant memory: 65536 bytes

Total amount of shared memory per block: 16384 bytes

Total number of registers available per block: 16384

Warp size: 32

Maximum number of threads per block: 512

Maximum sizes of each dimension of a block: 512 x 512 x 64

Maximum sizes of each dimension of a grid: 65535 x 65535 x 1

Maximum memory pitch: 2147483647 bytes

Texture alignment: 256 bytes

Clock rate: 1.47 GHz

Concurrent copy and execution: Yes

Run time limit on kernels: Yes

Integrated: No

Support host page-locked memory mapping: Yes

Compute mode: Default (multiple host threads can use this device simultaneously)

Concurrent kernel execution: No

Device has ECC support enabled: No

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 3.10, CUDA Runtime Version = 3.10, NumDevs = 1, Device = NVS 3100M

but when I execute the cuda program, then it give me the error msg:

any body can help on this ?

Simon_Green · July 8, 2010, 4:43pm

Have you set the device to prohibited or exclusive mode with nvidia-smi for some reason?

[url=“CUDA Toolkit Documentation”]http://developer.download.nvidia.com/compu...6292f1ffbe.html[/url]

luoyi · July 9, 2010, 2:31am

the deviceQuery program tell me that :

Compute mode: Default (multiple host threads can use this device simultaneously)

you can get this from my post msg.

or do you think the deviceQuery’s result is not very precise ?

luoyi · July 9, 2010, 2:19pm

after I restart my machine. the app runs normally. looks like some bugs in the driver ?

llukas · July 17, 2010, 6:47pm

if you use kde try to suspend and resume composition in kwin.

gpufg · November 11, 2010, 11:10pm

I am experiencing this same problem. Installation of the 260.19.14 driver and cudatoolkit_3.2.12_linux_64_rhel5.5 and gpucomputingsdk_3.2.12_linux all seems fine under RHEL 5.5. deviceQuery properly shows my GTX 285, yet any application from the SDK gives an error as follows. In my case, I have restarted and reinstalled several times, and still have the same problem.

The error appears:

Running on...

Device 0: GeForce GTX 285

 Quick Mode

bandwidthTest.cu(598) : cudaSafeCall() Runtime API error : all CUDA-capable devices are busy or unavailable.

deviceQuery shows:

./deviceQuery Starting...

CUDA Device Query (Runtime API) version (CUDART static linking)

There is 1 device supporting CUDA

Device 0: "GeForce GTX 285"

  CUDA Driver Version:                           3.20

  CUDA Runtime Version:                          3.20

  CUDA Capability Major/Minor version number:    1.3

  Total amount of global memory:                 1073020928 bytes

  Multiprocessors x Cores/MP = Cores:            30 (MP) x 8 (Cores/MP) = 240 (Cores)

  Total amount of constant memory:               65536 bytes

  Total amount of shared memory per block:       16384 bytes

  Total number of registers available per block: 16384

  Warp size:                                     32

  Maximum number of threads per block:           512

  Maximum sizes of each dimension of a block:    512 x 512 x 64

  Maximum sizes of each dimension of a grid:     65535 x 65535 x 1

  Maximum memory pitch:                          2147483647 bytes

  Texture alignment:                             256 bytes

  Clock rate:                                    1.48 GHz

  Concurrent copy and execution:                 Yes

  Run time limit on kernels:                     Yes

  Integrated:                                    No

  Support host page-locked memory mapping:       Yes

  Compute mode:                                  Default (multiple host threads can use this device simultaneously)

  Concurrent kernel execution:                   No

  Device has ECC support enabled:                No

  Device is using TCC driver mode:               No

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 3.20, CUDA Runtime Version = 3.20, NumDevs = 1, Device = GeForce GTX 285

PASSED

And nvidia-smi shows:

nvidia-smi -a

==============NVSMI LOG==============

Timestamp                       : Thu Nov 11 15:08:03 2010

Driver Version                  : 260.19.14

GPU 0:

        Product Name            : GeForce GTX 285

        PCI Device/Vendor ID    : 5e310de

        PCI Location ID         : 0:3:0

        Display                 : Connected

        Temperature             : 45 C

        Fan Speed               : 40%

        Utilization

            GPU                 : 0%

            Memory              : 1%

Any suggestions?

Thanks,

G

gpufg · November 11, 2010, 11:10pm

I am experiencing this same problem. Installation of the 260.19.14 driver and cudatoolkit_3.2.12_linux_64_rhel5.5 and gpucomputingsdk_3.2.12_linux all seems fine under RHEL 5.5. deviceQuery properly shows my GTX 285, yet any application from the SDK gives an error as follows. In my case, I have restarted and reinstalled several times, and still have the same problem.

The error appears:

Running on...

Device 0: GeForce GTX 285

 Quick Mode

bandwidthTest.cu(598) : cudaSafeCall() Runtime API error : all CUDA-capable devices are busy or unavailable.

deviceQuery shows:

./deviceQuery Starting...

CUDA Device Query (Runtime API) version (CUDART static linking)

There is 1 device supporting CUDA

Device 0: "GeForce GTX 285"

  CUDA Driver Version:                           3.20

  CUDA Runtime Version:                          3.20

  CUDA Capability Major/Minor version number:    1.3

  Total amount of global memory:                 1073020928 bytes

  Multiprocessors x Cores/MP = Cores:            30 (MP) x 8 (Cores/MP) = 240 (Cores)

  Total amount of constant memory:               65536 bytes

  Total amount of shared memory per block:       16384 bytes

  Total number of registers available per block: 16384

  Warp size:                                     32

  Maximum number of threads per block:           512

  Maximum sizes of each dimension of a block:    512 x 512 x 64

  Maximum sizes of each dimension of a grid:     65535 x 65535 x 1

  Maximum memory pitch:                          2147483647 bytes

  Texture alignment:                             256 bytes

  Clock rate:                                    1.48 GHz

  Concurrent copy and execution:                 Yes

  Run time limit on kernels:                     Yes

  Integrated:                                    No

  Support host page-locked memory mapping:       Yes

  Compute mode:                                  Default (multiple host threads can use this device simultaneously)

  Concurrent kernel execution:                   No

  Device has ECC support enabled:                No

  Device is using TCC driver mode:               No

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 3.20, CUDA Runtime Version = 3.20, NumDevs = 1, Device = GeForce GTX 285

PASSED

And nvidia-smi shows:

nvidia-smi -a

==============NVSMI LOG==============

Timestamp                       : Thu Nov 11 15:08:03 2010

Driver Version                  : 260.19.14

GPU 0:

        Product Name            : GeForce GTX 285

        PCI Device/Vendor ID    : 5e310de

        PCI Location ID         : 0:3:0

        Display                 : Connected

        Temperature             : 45 C

        Fan Speed               : 40%

        Utilization

            GPU                 : 0%

            Memory              : 1%

Any suggestions?

Thanks,

G

gpufg · November 11, 2010, 11:49pm

I have now uninstalled the 3.2 driver, toolkit, and SDK and installed the 3.0 versions, which I have been using successfully on another similar machine with a GTX285. Interestingly, the results are basically identical. deviceQuery still shows the card, as does nvidia-smi, but programs still fail with:

Running on...

Device 0: GeForce GTX 285

 Quick Mode

bandwidthTest.cu(600) : cudaSafeCall() Runtime API error : no CUDA-capable device is available.

gpufg · November 11, 2010, 11:49pm

I have now uninstalled the 3.2 driver, toolkit, and SDK and installed the 3.0 versions, which I have been using successfully on another similar machine with a GTX285. Interestingly, the results are basically identical. deviceQuery still shows the card, as does nvidia-smi, but programs still fail with:

Running on...

Device 0: GeForce GTX 285

 Quick Mode

bandwidthTest.cu(600) : cudaSafeCall() Runtime API error : no CUDA-capable device is available.

gpufg · November 18, 2010, 7:24pm

This issue was finally solved by updating RHEL and installing driver 260.19.21 and toolkit 3.2.16. Not sure if one or the other or both was needed. Disappointed by lack of response from NVIDIA on this issue, but glad to have it resolved.

gpufg · November 18, 2010, 7:24pm

This issue was finally solved by updating RHEL and installing driver 260.19.21 and toolkit 3.2.16. Not sure if one or the other or both was needed. Disappointed by lack of response from NVIDIA on this issue, but glad to have it resolved.

h_corey · November 23, 2010, 2:53am

With the update to .21 Driver and .16 toolkit. I am seeing these results

cudaSafeCall() Runtime API error : all CUDA-capable devices are busy or unavailable.

Any ideas?

h_corey · November 23, 2010, 2:53am

With the update to .21 Driver and .16 toolkit. I am seeing these results

cudaSafeCall() Runtime API error : all CUDA-capable devices are busy or unavailable.

Any ideas?

Mitsu_DunDee · November 23, 2010, 4:33pm

Same here (hp elitebook 8730w, quadro 2700M, fedora core 13, nvidia proprietary dev driver, .21 .16 toolkit),

one launch and it is over :p

Mitsu_DunDee · November 23, 2010, 4:33pm

Same here (hp elitebook 8730w, quadro 2700M, fedora core 13, nvidia proprietary dev driver, .21 .16 toolkit),

one launch and it is over :p

nuntius · November 24, 2010, 8:55pm

I just fought through this error. The following procedure seemed to fix it; I suspect the problem starts when contexts don’t close properly (e.g. a program segfaults, kill -9, etc.).

nvidia-smi -g 0 -c 1

nvidia-smi -g 0 -c 0

You may need to run some cuda code between the commands (I did, but haven’t replicated the problem yet to test whether it was necessary). I suspect the “-c 1” (exclusive mode) has some cleanup for contexts. Switching in and out of “-c 2” had no effect (other than to quickly abort all cuda code).

nuntius · November 24, 2010, 8:55pm

I just fought through this error. The following procedure seemed to fix it; I suspect the problem starts when contexts don’t close properly (e.g. a program segfaults, kill -9, etc.).

nvidia-smi -g 0 -c 1

nvidia-smi -g 0 -c 0

You may need to run some cuda code between the commands (I did, but haven’t replicated the problem yet to test whether it was necessary). I suspect the “-c 1” (exclusive mode) has some cleanup for contexts. Switching in and out of “-c 2” had no effect (other than to quickly abort all cuda code).

h_corey · November 27, 2010, 5:05am

No dice. Changing the mode doesn’t help, trying to run cuda code in between or or not.

I did notice that when I run

nvidia-smi -L

Comes back blank. Is that normal?

I am on Ubuntu 10.04 with a 9600 GT. I got one launch, fully functional. I really don’t want to reinstall the driver every time I boot.

h_corey · November 27, 2010, 5:05am

No dice. Changing the mode doesn’t help, trying to run cuda code in between or or not.

I did notice that when I run

nvidia-smi -L

Comes back blank. Is that normal?

I am on Ubuntu 10.04 with a 9600 GT. I got one launch, fully functional. I really don’t want to reinstall the driver every time I boot.

nuntius · November 27, 2010, 6:41am

Well, it seemed to fix my card; so I was hoping it was repeatable.

nvidia-smi -L may be blank, but nvidia-smi -a -L should show your card. If it doesn’t say “GPU 0” (unlikely), then you need to change the number after the -g.

Topic		Replies	Views
Install Problem CUDA Programming and Performance	32	12706	December 17, 2009
CUDA 3.2 on GTX 480 is "busy or unavailable" CUDA Programming and Performance	19	73461	March 21, 2011
trying to get a tesla k10 online. cuda_5.5.22_linux_64.run fails Linux	18	5801	February 16, 2014
deviceQuery passes and then fails CUDA Setup and Installation	4	2149	July 6, 2016
K80 crashed or wrong computation results on K80 CUDA Programming and Performance	13	4969	September 20, 2015
Windows 7 no CUDA-capable device is detected CUDA Setup and Installation	23	19268	January 9, 2018
bandwidthTest example throws cudaErrorCallRequiresNewerDriver error when launched via nv-nsight-cu-cli Nsight Compute linux , driver	17	1320	February 9, 2024
CUDA very slow performance CUDA Programming and Performance	21	16731	March 6, 2020
GTX295 Specefications & CUDA CUDA Programming and Performance	5	12286	October 7, 2010
Failed: CUDA driver version is insufficient for CUDA runtime version Parabricks cuda , containers , ai , driver	8	2045	November 21, 2023

why "all CUDA-capable devices are busy or unavailable" ?

nvidia-smi -g 0 -c 1

nvidia-smi -g 0 -c 0

nvidia-smi -g 0 -c 1

nvidia-smi -g 0 -c 0

Related topics