Cublas function calls inside kernel code

pedro.leite · October 23, 2007, 1:49pm

Can I call cublas functions inside kernel code?

I’m trying to call cublasSgemm inside kernel code, but in emulation mode (I currently don’t have a enabled graphics card) the kernel function hangs at that point.

__global__ void kernel_function(float* mA, float* mB, float* mC, float* fA, float* fB) {

   printf("calling cublas function\n");

    cublasSgemm('n', 'n', 3, 1, 3, 1.0f, fA, 3, mA, 3, 0.0f, mC, 3);

    cublasSgemm('n', 'n', 3, 1, 3, 1.0f, fB, 3, mB, 3, 1.0f, mC, 3);

    printf("leaving kernel code\n");

}

I know that such code isn’t perfect, but it serves as an example. When the first function call to cublasSgemm happens, the executions hangs and doesn’t return.

What can I do?

Chirality · October 23, 2007, 7:16pm

Can I call cublas functions inside kernel code?

I’m trying to call cublasSgemm inside kernel code, but in emulation mode (I currently don’t have a enabled graphics card) the kernel function hangs at that point.
__global__ void kernel_function(float* mA, float* mB, float* mC, float* fA, float* fB) {

   printf("calling cublas function\n");

    cublasSgemm('n', 'n', 3, 1, 3, 1.0f, fA, 3, mA, 3, 0.0f, mC, 3);

    cublasSgemm('n', 'n', 3, 1, 3, 1.0f, fB, 3, mB, 3, 1.0f, mC, 3);

    printf("leaving kernel code\n");

}
I know that such code isn’t perfect, but it serves as an example. When the first function call to cublasSgemm happens, the executions hangs and doesn’t return.

What can I do?

[snapback]268738[/snapback]

Cublas functions are basically kernels unto themselves. Just do this in int main(), compiled with g++ (for example).

int main(){

cublasinit()

// error handler

cublasStatus stat;

// create and alloc device memory, e.g.

float* host_B = (float*) malloc(mem_size_B);

// create and alloc device memory, e.g.

float* device_B;

stat = cublasAlloc(number_of_elements, sizeof(float), (void**)&device_B);

if(stat!=CUBLAS_STATUS_SUCCESS) printf(“memory allocation failed”);

// copy data from host to device

stat = cublasSetMatrix(num_rows,num_cols,sizeof(float),host_B,num_rows,device_B,num_rows);

//do cublass matrix-matrix operation

cublasSgemm(…);

// return result

stat = cublasGetMatrix(…);

// free memory

cublasFree(host_B)

…

cublasShutdown()

return 0;

}

Basically, calling a cublas function from a kernel makes no sense.

pedro.leite · October 23, 2007, 7:33pm

Thanks Chirality, that’s what I argued with my friends here at work, except the fact that i wasn’t sure about that. It’s like a kernel calling another one.

Topic		Replies	Views
Cublas function call from within the kernel ? is it possible ? CUDA Programming and Performance	4	2725	April 2, 2008
Help required with CUBLAS CUDA Programming and Performance	2	1461	March 26, 2009
Calling CUBLAS routines from insdie a kernel Legacy PGI Compilers	1	4854	June 28, 2011
Cublas within kernel CUDA Programming and Performance	1	1528	July 28, 2009
Combining cuBlas and Kernel code CUDA Programming and Performance	14	6650	April 1, 2017
Multiple Cublas functions on single GPU CUDA Programming and Performance	5	1791	August 8, 2010
CUBLAS functions in a kernel CUDA Programming and Performance	5	7048	June 4, 2008
Can a cuda kernel call CUBLAS function or how to call a cublas functions from Python ? CUDA Programming and Performance	7	3775	November 15, 2019
Call cublas API from kernel GPU-Accelerated Libraries	3	5117	December 8, 2015
cal cuBLAS with in global kernel CUDA Programming and Performance	1	4907	June 29, 2011

Cublas function calls inside kernel code

Related topics