Memcheck error, dynamic parallelism and address

kissrei · September 22, 2023, 9:44am

Win10, GTX 1050 Ti, CUDA 12.1, VS 2017
part of error messages I post below
my questions are:

the 481th line of codebook.cu is loop << <grid_size, REDUCE_ADD_WIDTH, REDUCE_ADD_WIDTH * sizeof(uint64_t) >> >, so the invalid global write located inside the global function “loop”?
“Address 0x180080bd60 is out of bounds” means the program tried to write to 0x180080bd60? If so, how to find who tried to write to the address
As for error 719 from cudaMemcpy and cudaFree, it only occures in memcheck, otherwise no error is reported. So is this really an error, or something about tdrdelay?

========= Invalid __global__ write of size 4 bytes
=========     at 0x13c8 in G:/cuda/codebook.cu:481:solution(etc1s_optimizer_state_tag *, int, unsigned int, const rgba *, const unsigned int *, etc1s_optimizer_solution_coordinates_tag, etc1s_optimizer_potential_solution_tag *, etc1s_optimizer_potential_solution_tag *)
=========     by thread (2,0,0) in block (36,0,0)
=========     Address 0x180080bd60 is out of bounds
=========     and is 47,969 bytes after the nearest allocation at 0x1800800000 of size 512 bytes
=========     Device Frame:G:/cuda/codebook.cu:667:fit(unsigned int, etc1s_optimizer_state_tag *, encode_etc1s_param_struct_tag, unsigned int, const rgba *, const unsigned int *) [0xf90]
=========     Device Frame:G:/cuda/codebook.cu:740:cluster(encode_etc1s_param_struct_tag, const pixel_cluster_tag *, const rgba *, const unsigned int *, etc_block_tag *, int) [0x978]
=========     Saved host backtrace up to driver entry point at kernel launch time
=========     Host Frame:cuEventRecordWithFlags [0x7ff972c54db8]
=========                in C:\Windows\system32\DriverStore\FileRepository\nv_dispui.inf_amd64_c3352d3df1cf4d8c\nvcuda64.dll
......
......
========= Program hit cudaErrorLaunchFailure (error 719) due to "unspecified launch failure" on CUDA API call to cudaMemcpy.
......
......
========= Program hit cudaErrorLaunchFailure (error 719) due to "unspecified launch failure" on CUDA API call to cudaFree.

really thanks.

achartiernv · September 28, 2023, 8:07pm

Compiling the code with -G or -lineinfo might provide a more precise location.
Yes. See the third line for the thread/block causing the illegal access
This is indirectly caused by the tool causing the kernel to abort as a result of the detected error.

kissrei · October 15, 2023, 1:17pm

I’ve gotten around the “Invalid write” problem in a few ways, and will probably ask for help again, thank you very much for your help!

veraj · November 16, 2023, 8:26am

Thanks for letting us know！ If there is new issue, please file a new topic and we will do our best to help !

Topic		Replies	Views
What is wrong? please help me,thanks CUDA Programming and Performance	2	1738	December 13, 2017
Invalid __global__ write how to determine the wriight line? CUDA Programming and Performance	1	786	October 31, 2011
Invalid __global__ write how to determine the wriight line? CUDA Programming and Performance	0	2533	October 31, 2011
Tracking Invalid read size and illegal memory access CUDA Programming and Performance	3	7826	May 24, 2016
Invalid __global__ write of size 4 Error on kernel Launch CUDA Programming and Performance	7	6250	March 18, 2013
Cuda-memcheck can't report the invalid write CUDA-MEMCHECK cuda	2	609	June 8, 2023
Memcheck CUDA Programming and Performance	2	593	July 20, 2017
Can't write to memory after doing malloc - Invalid __global__ write Compute Sanitizer cuda	1	67	March 18, 2026
Invalid __global__ write of size 4. Need help with debugging CUDA Programming and Performance cuda	2	1096	October 26, 2020
CUDA Address out of bounds error - help! CUDA Programming and Performance	0	910	August 8, 2019

Memcheck error, dynamic parallelism and address

Related topics