I wrote a kernel that uses a lot of registers. After checking, I confirmed that there are no errors such as array out-of-bounds in the kernel.
But when I was debugging, the program stayed at the entrance, and Nsight’s warp info indicated that the current status was InLineBreakpoint.
When I continue to debug, Nsight’s warp info prompts that the current status is OutOfRangeAddress.
I used cuda-memcheck again to help me find errors, but cuda-memcheck can only give an “Unknown Error” prompt.
I don’t know why this problem occurs. I have been programming with CUDA for two or three years. This is the first time I have encountered this problem. I hope someone can help me, thank you