I am running an old (2013) CUDA-based Windows command line tool for research. Here are the recommended settings according to the manual:
===== Recommended platform, software version and hardware for GBOOST 2.0=====================
CPU: Intel® Core™ i5-4570 CPU 3.20 GHz
Main Memory: 4 GB or above
Graphics Card: Nvidia GTX 580
Operation System: Windows 10 x64
CUDA Driver: 7.5 or above
I am running on a Nvidia GTX 1050 Ti. Running this program (GBOOST 2.0), I receive the following error:
There is no device supporting CUDA 1.x. Unable to initialize CUDA
I have done the following to troubleshoot this error:
First install latest driver for the graphics card (v442.50), then install CUDA while unchecking the box for driver installation, as demonstrated in this thread (https://devtalk.nvidia.com/default/topic/1045793/cuda-setup-and-installation/cuda-9-0-installation-failed-/)
Build and run deviceQuery. This appears to produce a normal output.
C:\ProgramData\NVIDIA Corporation\CUDA Samples\v10.2\bin\win64\Debug>deviceQuery.exe deviceQuery.exe Starting... CUDA Device Query (Runtime API) version (CUDART static linking) Detected 1 CUDA Capable device(s) Device 0: "GeForce GTX 1050 Ti" CUDA Driver Version / Runtime Version 10.2 / 10.2 CUDA Capability Major/Minor version number: 6.1 Total amount of global memory: 4096 MBytes (4294967296 bytes) ( 6) Multiprocessors, (128) CUDA Cores/MP: 768 CUDA Cores GPU Max Clock rate: 1392 MHz (1.39 GHz) Memory Clock rate: 3504 Mhz Memory Bus Width: 128-bit L2 Cache Size: 1048576 bytes Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers Total amount of constant memory: zu bytes Total amount of shared memory per block: zu bytes Total number of registers available per block: 65536 Warp size: 32 Maximum number of threads per multiprocessor: 2048 Maximum number of threads per block: 1024 Max dimension size of a thread block (x,y,z): (1024, 1024, 64) Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) Maximum memory pitch: zu bytes Texture alignment: zu bytes Concurrent copy and kernel execution: Yes with 2 copy engine(s) Run time limit on kernels: Yes Integrated GPU sharing Host Memory: No Support host page-locked memory mapping: Yes Alignment requirement for Surfaces: Yes Device has ECC support: Disabled CUDA Device Driver Mode (TCC or WDDM): WDDM (Windows Display Driver Model) Device supports Unified Addressing (UVA): Yes Device supports Compute Preemption: Yes Supports Cooperative Kernel Launch: No Supports MultiDevice Co-op Kernel Launch: No Device PCI Domain ID / Bus ID / location ID: 0 / 1 / 0 Compute Mode: < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) > deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.2, CUDA Runtime Version = 10.2, NumDevs = 1 Result = PASS
What should I try next? I am confused by the error referring to CUDA 1.x when the recommended settings for the software are CUDA 7.5 or higher. Why does the error message refer to CUDA 1.x and is this something wrong with my setup? If not, then what is wrong? Thank you for the help!