cuda context monitor && nvvm opt=0 problem

yefu.chen · March 14, 2018, 5:00am

Hi, I face a very tough problem, so I hope I would get some ideas from here.
this is the background of my problem:
In the project, I use llvm to generate the IR code dynamically, then I chose the nvvm as the backend to generate the ptx code, after that I use cuModuleLoadDataEx to load the ptx and do the kernel launch.
my problem is very weired:
If when I use nvvm to compile the IR to PTX, if I chose the opt=3, I seem every thing is right, and there is no problem,
however, if the set opt=0, the kernel function seem to fall into a dead loop, after launch the kernel function several time.
so, these are my question

Is there any tool to let me monitor the cuda context, because my kernel function failed after kernel launch several times, so I want to know what’s the context difference between each kernel launch.
Does anyone know is there any problem with nvvm opt=0.
looking forward to any reply.
thanks

Topic		Replies	Views
debug nvvm ir CUDA Programming and Performance	0	400	March 16, 2018
what's the status of libNVVM? CUDA Programming and Performance	0	702	November 8, 2014
CUDA to NVVM CUDA Programming and Performance	1	1901	February 12, 2014
Ncu does not detect kernels, ==ERROR== The application returned an error code (11) Nsight Compute kernel , profiling	5	2279	December 13, 2023
Generate line tables and optimize with libnvvm CUDA Programming and Performance	2	398	October 24, 2021
debug question CUDA Programming and Performance	1	637	March 15, 2016
How to get the cuda "first-call overhead" to happen only once for cuda called from dll? CUDA Programming and Performance	50	1260	November 25, 2024
Strange Validation Error OptiX	8	1579	September 30, 2022
[Solved] Weird ray generation hang (really simple code) OptiX	23	5920	June 19, 2014
ERR_NVGPUCTRPERM when using nv-nsight-cu-cli for profilling CUDA Setup and Installation	0	415	May 22, 2022

cuda context monitor && nvvm opt=0 problem

Related topics