errors building CUDA 9.2 using Visual Studio 2013

hpweiss · May 20, 2018, 12:35pm

Running Windows 7

I have followed the quick start instructions at

file:///C:/Program%20Files/NVIDIA%20GPU%20Computing%20Toolkit/CUDA/v9.2/doc/html/cuda-quick-start-guide/index.html

When I rebuild, I see errors

in release mode, errors are of the form

Error 31 error LNK2038: mismatch detected for ‘_MSC_VER’: value ‘1600’ doesn’t match value ‘1800’ in cdp_lu.cu.obj C:\ProgramData\NVIDIA Corporation\CUDA Samples\v9.2\6_Advanced\cdpLUDecomposition\cublas_device.lib(sgemmEx.obj) cdpLUDecomposition

in debug mode, errors are of the form

'14>cublas_device.lib(sgemmEx.obj) : error LNK2038: mismatch detected for ‘_MSC_VER’: value ‘1600’ doesn’t match value ‘1800’ in cdp_lu.cu.obj
14>cublas_device.lib(sgemmEx.obj) : error LNK2038: mismatch detected for ‘_ITERATOR_DEBUG_LEVEL’: value ‘0’ doesn’t match value ‘2’ ’

Do I need to rebuild cublas_device.lib and, if so, how?

Howard Weiss

guokaiwen_neu · June 9, 2018, 12:49am

Running on Windows 10, using Visual Studio 2015 Update 3. Same problem here. It seems cublas_device.lib is compiled by MSC_VER=1600 (Visual Studio 2010)? This is a wired bug.

Does anyone know how to resolve this? Otherwise, we cannot call cublas in kernels.

Thanks,
Kaiwen

Robert_Crovella · June 9, 2018, 1:31am

NVIDIA is aware of this issue.

It will not be fixed.

the cublas device functionality is deprecated in the CUDA 9.2 toolkit and will be removed from a future toolkit release.

[url]https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#deprecated-features[/url]

It’s recommended that you begin modifying codes to not depend on this functionality if you want to maintain them with future toolkits. It will not be possible to maintain cublas device functionality with future toolkits.

If you don’t wish to do that, then it’s suggested that you revert to CUDA 9.1, or switch to VS 2010

guokaiwen_neu · June 9, 2018, 2:57am

Thank you for reply, txbob.

For the first view, This dynamic parallelism sounds like a perfect solution to put as much as control logics to GPU and eliminates as much as kernels launched from host and GPU-CPU sync. Theoretically, we can put any single thread critical path to CUDA by launching <<<1, 1>>> and use this path to launch second level data-parallel kernels. However, practically, we found that after enabling rdc, second level kernels become much slower. I am curious why this happens and what is the main difficulty behind this good story.

Thanks,
Kaiwen

Robert_Crovella · June 9, 2018, 3:34am

rdc prevents the compiler from making certain optimizations it might otherwise make

it’s not uncommon for code to run slower with rdc

beyond that, it would be necessary to inspect a specific case.

guokaiwen_neu · June 9, 2018, 9:33pm

Thanks for reply, txbob.

Topic		Replies	Views
cannot compile with cublas code in kernel CUDA Programming and Performance	4	992	July 8, 2019
nvlink error : Undefined reference CUDA Programming and Performance	8	6538	July 6, 2017
including cuda_16fp.h breaks Visual Studio 2015 compilation CUDA Setup and Installation	5	1585	September 15, 2017
CUBLAS initialization failed when running cuBLAS example CUDA Programming and Performance	4	2989	October 12, 2021
Undefined reference to `cublasCreate_v2' GPU-Accelerated Libraries cublas	16	30879	April 9, 2024
unhelpful build error CUDA Programming and Performance	10	12794	November 5, 2014
simple question on CUBLAS cublasInit CUDA Programming and Performance	14	17110	June 3, 2010
dynamic parallelism with cudadevrt.lib issues under visual studio 2010 CUDA Setup and Installation	2	2182	September 10, 2013
CUBLAS inicialization error in CUDA 3.1 CUDA Programming and Performance	3	2091	June 27, 2010
cublas SEGFAULT in cublasInit() cublas SEGFAULT in cublasInit() but locally compiled examples run. CUDA Programming and Performance	11	4686	January 19, 2010

errors building CUDA 9.2 using Visual Studio 2013

Related topics