Bug in cuda-memcheck?

Shura58 · March 13, 2013, 4:48pm

Hi All,
I have a code with structs in shared memory with member variables initialized in a function called from the main kernel. Regular execution produces no problems, but cuda-memcheck throws an “Address out of bounds error”. If I move the initialization to the main kernel, cuda-memcheck doesn’t complain. Is this a “feature” or a bug or is there something I’m missing?
Below is the simplest reproducible code I could come up with. I use CUDA 5.0 on a CentOS 6.3 x64 and driver 304.54.
Thanks

//main.cpp
include “gpu.h”

int main(int argc, char* argv){
run_kernel();
return 0;
}

//gpu.h
#ifndef GPU_H_
define GPU_H_

void run_kernel();

struct cuda_model{
int size;
};
endif /* GPU_H_ */

//gpu.cu
include <stdio.h>
include “gpu.h”

define FUNCTION 0 //use 1 to select in-function initialization

device shared cuda_model model;

device void init(cuda_model* model){
if (threadIdx.x == 0){
model->size = 5;
}
__syncthreads();
}

global void compute(){

if (FUNCTION){
    init(&model);
}
else{
    if (threadIdx.x == 0) model.size = 5;
}

}

host void run_kernel(){
compute<<<1,1>>>();
cudaDeviceSynchronize();
}

[sasha@gpudev Debug]$ cuda-memcheck ./Test
========= CUDA-MEMCHECK
========= Invalid global write of size 4
========= at 0x00000088 in /home/sasha/cuda-workspace/Test/Debug/../gpu.cu:11:init(cuda_model*)
========= by thread (0,0,0) in block (0,0,0)
========= Address 0x01000000 is out of bounds
========= Device Frame:/home/sasha/cuda-workspace/Test/Debug/../gpu.cu:19:compute(void) (compute(void) : 0x100)
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame:/usr/lib64/libcuda.so (cuLaunchKernel + 0x3dc) [0xc9edc]
========= Host Frame:/usr/local/cuda-5.0/lib64/libcudart.so.5.0 [0x11d54]
========= Host Frame:/usr/local/cuda-5.0/lib64/libcudart.so.5.0 (cudaLaunch + 0x182) [0x38152]
========= Host Frame:./Test [0x92a]
========= Host Frame:./Test [0x9b9]
========= Host Frame:/lib64/libc.so.6 (__libc_start_main + 0xfd) [0x1ecdd]
========= Host Frame:./Test [0x749]

========= Program hit error 4 on CUDA API call to cudaDeviceSynchronize
========= Saved host backtrace up to driver entry point at error
========= Host Frame:/usr/lib64/libcuda.so [0x26a180]
========= Host Frame:/usr/local/cuda-5.0/lib64/libcudart.so.5.0 (cudaDeviceSynchronize + 0x1dd) [0x412dd]
========= Host Frame:./Test [0x9b9]
========= Host Frame:/lib64/libc.so.6 (__libc_start_main + 0xfd) [0x1ecdd]
========= Host Frame:./Test [0x749]

========= ERROR SUMMARY: 2 errors

vyas · March 14, 2013, 4:47pm

Hi Shura58. Thanks for reporting this issue. Can you try the latest publicly available driver from www.nvidia.com and see if your issue still presents ? I believe the latest driver for Linux64 is 310.40.

Shura58 · March 14, 2013, 5:31pm

Thanks, Vyas. The updated driver actually fixed the issue.

Topic		Replies	Views
In-function initialization of a shared struct produces a memcheck error CUDA Programming and Performance	0	757	March 13, 2013
cuda-memcheck error: Address is out of bounds. CUDA Programming and Performance	2	5869	November 12, 2012
bug in cuda-memcheck CUDA-MEMCHECK	1	3095	January 17, 2013
cuda-memcheck dilemma cuda-memcheck problem with in-kernel allocations CUDA Programming and Performance	3	831	April 4, 2011
Memcheck CUDA Programming and Performance	2	584	July 20, 2017
Memory checker bug? CUDA Programming and Performance	1	663	September 4, 2017
Using cudaMemCheck Legacy PGI Compilers	3	6540	November 21, 2013
What is wrong? please help me,thanks CUDA Programming and Performance	2	1732	December 13, 2017
Cuda-memcheck can't report the invalid write CUDA-MEMCHECK cuda	3	595	June 8, 2023
Potential Bug, cuda-memcheck can someone verify? Program crashing on GPU initialisation with cuda-me CUDA Programming and Performance	11	3584	April 24, 2020

Bug in cuda-memcheck?

Related topics