CUBLAS_STATUS_EXECUTION_FAILED cublasDscal calls

bobtown · May 25, 2009, 8:06am

I seem to get this error if I am calling cublasDscal on a vector that isn’t very large (<200 elements).

I work around it by allocation more memory than necessary, but I would like to avoid this if possible.

Has anyone else noticed this issue and is there a solution?

and yes, my drivers are all up to date.

Thanks

bobtown · May 26, 2009, 3:38pm

Actually, it seems to happen with a lot of the scalar multiply and copy functions. cutilCheckMsg() just gives an “unknown error.”

Anyone???

mfatica · May 26, 2009, 3:55pm

Is the GPU you are using, double precision capable?

Your report is too generic, unless you post a small repro case, you will not get a meaningful response.

bobtown · June 1, 2009, 8:57am

KERNEL CALL:

[codebox]extern “C” void

actDouble(double* f, unsigned int len)

{

cudaThreadSynchronize();

unsigned int num_threads = 256;

unsigned int blocks = (len/num_threads) + 1;		

dim3 grid(blocks, 1);

dim3 threads(num_threads, 1);     

actFuncDouble<<< grid, threads >>>(f, len);     

cudaThreadSynchronize();

}[/codebox]

KERNEL:

[codebox]global void

actFuncDouble( double* d_data, unsigned int len )

{

const unsigned int tid = blockIdx.x*blockDim.x + threadIdx.x;

if(tid<len) {

	double d = d_data[tid];	

	d_data[tid] = 1/(   1+expf(-d)       );

}

}[/codebox]

I have this simple function mixed in with numerous CUBLAS calls. Is there a “warm-up” that needs to happen this first time I call a run-time API (assuming CUBLAS init was called and successful)?

avidday · June 1, 2009, 9:13am

I thought you were complaining that CUBLAS calls were failing, but I don’t see any CUBLAS calls in your sample code at all.

As an aside, I presume you are aware that your kernel as written, despite taking double precision arguments, is doing the computations using a mixture of integers cast to single precision and a single precision transcendental function, and simply casting the result back to double afterwards.

bobtown · June 1, 2009, 12:43pm

Yes, I am aware of the expf call. I was just trying everything I could think of to isolate the errors.

I presume you are aware your response was worthless.

avidday · June 1, 2009, 8:23pm

Whatever those might be…

You’re welcome. Best of luck.

Topic		Replies	Views
CUBLAS + Kernels + double precision + "for loop" = strange behavior CUDA Programming and Performance	17	4478	September 25, 2009
cublasCscal CUDA Programming and Performance	2	2723	May 28, 2009
CUBLAS 2.2 Index Problem (Bug?) A test program on scalar multiplication cublasSscal won't comput CUDA Programming and Performance	2	1494	May 15, 2009
Cublas_getError() retrieved unknown error! CUDA Programming and Performance	4	3508	January 16, 2009
cudaErrorUnknown CUDA Programming and Performance	4	4107	June 1, 2009
Issue when calling cublasDdot from within kernel GPU-Accelerated Libraries	7	1028	March 21, 2018
passing a scalar to a cublas_v2 rise a segfault CUDA Programming and Performance	0	693	July 26, 2011
Is it correct that my Pascal card is calling Maxwell_gemm kernels through cublas? And if so, why is cublas unusably slow for me? CUDA Programming and Performance	6	1044	August 23, 2018
cublas call from kernel ( not getting right results ) CUDA Programming and Performance	0	586	October 17, 2014
Weird problem of cublasZscal CUDA Programming and Performance	0	702	August 24, 2009

CUBLAS_STATUS_EXECUTION_FAILED cublasDscal calls

Related topics