You might be hitting this. In a nutshell, try running your app on CUDA 12.2 with
CUDA_MODULE_LOADING=EAGER ./my_app
to see if the behavior changes. (change the my_app above to be the name of your actual compiled executable.)
Anyone who wants to can file a bug with NVIDIA using the bug reporting portal. The instructions are linked to a sticky post at the top of this sub-forum. However what you have here is not sufficient for a bug report. At a minimum, if you filed such a bug, the QA team would ask you for a minimal reproducer (short but complete code that demonstrates the issue) along with other instructions such as your compile command line, GPU you are running on, and maybe other things.
If you intend to do that (you can do as you wish, of course), you also have the option to post such a short, complete example and instructions here, and the community will generally have a look at it. They may spot something. If that were the case, it would be a better path than filing a bug, for a number of reasons.