call cublas in cuda kernel and use static link

siri · July 25, 2019, 9:38am

In my project i need to call cublas in cuda kernel, and use static link with cublas. I write a test code to verify my ideas, but compile failed

#include <cuda_runtime.h>
#include <cublas_v2.h>

extern "C" {
__global__ void testcublas(float *d_B, float *d_A, float* d_refC) {
    cublasHandle_t cb_handle = NULL;
    cublasStatus_t status = cublasCreate(&cb_handle);
    int M = 512, N = 512, K = 512;
    float alpha = 1.0, beta = 1.0;
    cublasSgemm(
                cb_handle, CUBLAS_OP_N, CUBLAS_OP_N,
                N, M, K,
                &alpha, d_B, N, d_A, K,
                &beta,  d_refC, N
               );
}

}

compile:

nvcc -arch=sm_70 -lcublas_static -lculibos --relocatable-device-code=true cublas_call.cu -o cublas.cubin

but got errors:
nvlink error : Undefined reference to ‘cublasCreate_v2’ in ‘/tmp/tmpxft_0001d286_00000000-10_cublas_call.o’
nvlink error : Undefined reference to ‘cublasSgemm_v2’ in ‘/tmp/tmpxft_0001d286_00000000-10_cublas_call.o’

Any suggestions about the program would be very helpful!
thanks in advance.

Robert_Crovella · July 25, 2019, 1:15pm

You need to be using CUDA 9.2 or before. This capability is deleted and no longer available in CUDA 10.0 and beyond. On Windows I recommend CUDA 9.1 or before.

You need to link against cublas device library.
You need to link against the device runtime library.

-lcublas_device -lcudadevrt

If you have CUDA 9.2 or before, study the Makefile for simpleDevLibCUBLAS sample code.

siri · July 26, 2019, 8:32am

Thanks Robert!

By the way, would you please talk about the reason of delete that capability in CUDA10.0, Is CUDA10.0 has a better solution for that case?

Topic		Replies	Views
Static linking of CuBLAS in CUDA-10.1 CUDA Programming and Performance	0	595	June 27, 2020
Static linking cublas GPU-Accelerated Libraries cublas	4	1637	April 5, 2022
Cublas on device in Cuda 10 CUDA Programming and Performance	0	267	December 4, 2019
linker problem with cuda 10.1.105 on cento 7 ............... undefined references GPU-Accelerated Libraries	2	2116	April 30, 2019
Cublas in Cuda 10 CUDA Programming and Performance	0	267	December 4, 2019
Cublas in Cuda 10 CUDA Programming and Performance	0	224	December 4, 2019
Device not found for Shared cublas, but found for static cublas_static CUDA Setup and Installation cuda	0	184	April 15, 2024
Undefined reference to `cublasCreate_v2' GPU-Accelerated Libraries cublas	16	31477	April 9, 2024
undefined reference when using cublas_device Legacy PGI Compilers	1	2700	May 17, 2019
Mcudalib=cublas static linking issue Legacy PGI Compilers	3	3437	December 6, 2019

call cublas in cuda kernel and use static link

Related topics