I am running an old (2013) CUDA-based Windows command line tool for research. Here are the recommended settings according to the manual:
===== Recommended platform, software version and hardware for GBOOST 2.0=====================
CPU: Intel(R) Core™ i5-4570 CPU 3.20 GHz
Main Memory: 4 GB or above
Graphics Card: Nvidia GTX 580
Operation System: Windows 10 x64
CUDA Driver: 7.5 or above
I am running on a Nvidia GTX 1050 Ti. Running this program (GBOOST 2.0), I receive the following error:
There is no device supporting CUDA 1.x.
Unable to initialize CUDA
I have done the following to troubleshoot this error:
-
First install latest driver for the graphics card (v442.50), then install CUDA while unchecking the box for driver installation, as demonstrated in this thread (https://devtalk.nvidia.com/default/topic/1045793/cuda-setup-and-installation/cuda-9-0-installation-failed-/)
-
Build and run deviceQuery. This appears to produce a normal output.
C:\ProgramData\NVIDIA Corporation\CUDA Samples\v10.2\bin\win64\Debug>deviceQuery.exe
deviceQuery.exe Starting...
CUDA Device Query (Runtime API) version (CUDART static linking)
Detected 1 CUDA Capable device(s)
Device 0: "GeForce GTX 1050 Ti"
CUDA Driver Version / Runtime Version 10.2 / 10.2
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 4096 MBytes (4294967296 bytes)
( 6) Multiprocessors, (128) CUDA Cores/MP: 768 CUDA Cores
GPU Max Clock rate: 1392 MHz (1.39 GHz)
Memory Clock rate: 3504 Mhz
Memory Bus Width: 128-bit
L2 Cache Size: 1048576 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: zu bytes
Total amount of shared memory per block: zu bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: zu bytes
Texture alignment: zu bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: Yes
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
CUDA Device Driver Mode (TCC or WDDM): WDDM (Windows Display Driver Model)
Device supports Unified Addressing (UVA): Yes
Device supports Compute Preemption: Yes
Supports Cooperative Kernel Launch: No
Supports MultiDevice Co-op Kernel Launch: No
Device PCI Domain ID / Bus ID / location ID: 0 / 1 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.2, CUDA Runtime Version = 10.2, NumDevs = 1
Result = PASS
What should I try next? I am confused by the error referring to CUDA 1.x when the recommended settings for the software are CUDA 7.5 or higher. Why does the error message refer to CUDA 1.x and is this something wrong with my setup? If not, then what is wrong? Thank you for the help!