@waitting33
Thank you for the logs - we are investigating the issue. Could you also share the console output of the cuda-gdb when attaching to the application on WSL?
In the mean time - could you try running the application with theCUDA_MODULE_LOADING=EAGER environment variable?
I found that the program wasn’t stuck, it was executing very slowly. The process I ran yesterday was found to have finished executing today and could be output normally.
Here is cuda-gdb console output.
(sphere) zxy@DESKTOP-0TU3RE2:/mnt/e/jlu/SphereFormer$ cuda-gdb -p 4743
NVIDIA (R) CUDA Debugger
CUDA Toolkit 12.1 release
Portions Copyright (C) 2007-2023 NVIDIA Corporation
GNU gdb (GDB) 12.1
Copyright (C) 2022 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
--Type <RET> for more, q to quit, c to continue without paging--
There is NO WARRANTY, to the extent permitted by law.
Type "show copying" and "show warranty" for details.
This GDB was configured as "x86_64-pc-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<https://www.gnu.org/software/gdb/bugs/>.
Find the GDB manual and other documentation resources online at:
--Type <RET> for more, q to quit, c to continue without paging--
<http://www.gnu.org/software/gdb/documentation/>.
For help, type "help".
Type "apropos word" to search for commands related to "word".
Attaching to process 4743
Reading symbols from /home/zxy/mambaforge/envs/sphere/bin/python3.7...
Reading symbols from /lib/x86_64-linux-gnu/libpthread.so.0...
(No debugging symbols found in /lib/x86_64-linux-gnu/libpthread.so.0)
Reading symbols from /lib/x86_64-linux-gnu/libdl.so.2...
(No debugging symbols found in /lib/x86_64-linux-gnu/libdl.so.2)
Reading symbols from /lib/x86_64-linux-gnu/libutil.so.1...
(No debugging symbols found in /lib/x86_64-linux-gnu/libutil.so.1)
Reading symbols from /lib/x86_64-linux-gnu/librt.so.1...
(No debugging symbols found in /lib/x86_64-linux-gnu/librt.so.1)
Reading symbols from /lib/x86_64-linux-gnu/libm.so.6...
(No debugging symbols found in /lib/x86_64-linux-gnu/libm.so.6)
Reading symbols from /lib/x86_64-linux-gnu/libc.so.6...
(No debugging symbols found in /lib/x86_64-linux-gnu/libc.so.6)
Reading symbols from /lib64/ld-linux-x86-64.so.2...
(No debugging symbols found in /lib64/ld-linux-x86-64.so.2)
Reading symbols from /home/zxy/mambaforge/envs/sphere/lib/python3.7/lib-dynload/_heapq.cpython-37m-x86_64-linux-gnu.so...
Reading symbols from /home/zxy/mambaforge/envs/sphere/lib/python3.7/lib-dynload/readline.cpython-37m-x86_64-linux-gnu.so...
Reading symbols from /home/zxy/mambaforge/envs/sphere/lib/python3.7/lib-dynload/../../libreadline.so.8...
(No debugging symbols found in /home/zxy/mambaforge/envs/sphere/lib/python3.7/lib-dynload/../../libreadline.so.8)
Reading symbols from /home/zxy/mambaforge/envs/sphere/lib/python3.7/lib-dynload/../.././libtinfo.so.6...
(No debugging symbols found in /home/zxy/mambaforge/envs/sphere/lib/python3.7/lib-dynload/../.././libtinfo.so.6)
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
0x00007f1d89690cd7 in select () from /lib/x86_64-linux-gnu/libc.so.6
(cuda-gdb) continue
Continuing.
[Detaching after fork from child process 4896]
[New Thread 0x7f1c9b2ad700 (LWP 4914)]
[New Thread 0x7f1c9aaac700 (LWP 4915)]
[New Thread 0x7f1c982ab700 (LWP 4916)]
[New Thread 0x7f1c93aaa700 (LWP 4917)]
[New Thread 0x7f1c912a9700 (LWP 4918)]
[New Thread 0x7f1c8eaa8700 (LWP 4919)]
[New Thread 0x7f1c8c2a7700 (LWP 4920)]
[New Thread 0x7f1c89aa6700 (LWP 4921)]
[New Thread 0x7f1c872a5700 (LWP 4922)]
[New Thread 0x7f1c86aa4700 (LWP 4923)]
[New Thread 0x7f1c842a3700 (LWP 4924)]
[New Thread 0x7f1c81aa2700 (LWP 4925)]
[New Thread 0x7f1c7d2a1700 (LWP 4926)]
[New Thread 0x7f1c7aaa0700 (LWP 4927)]
[New Thread 0x7f1c7829f700 (LWP 4928)]
[New Thread 0x7f1c75a9e700 (LWP 4929)]
[New Thread 0x7f1c7529d700 (LWP 4930)]
[New Thread 0x7f1c70a9c700 (LWP 4931)]
[New Thread 0x7f1c7029b700 (LWP 4932)]
[New Thread 0x7f1c669e5700 (LWP 4937)]
[New Thread 0x7f1c65700700 (LWP 4938)]
[Thread 0x7f1c86aa4700 (LWP 4923) exited]
[Thread 0x7f1c89aa6700 (LWP 4921) exited]
[Thread 0x7f1c8c2a7700 (LWP 4920) exited]
[Thread 0x7f1c93aaa700 (LWP 4917) exited]
[Thread 0x7f1c982ab700 (LWP 4916) exited]
[Thread 0x7f1c9aaac700 (LWP 4915) exited]
[Thread 0x7f1c912a9700 (LWP 4918) exited]
[Thread 0x7f1c9b2ad700 (LWP 4914) exited]
[Thread 0x7f1c75a9e700 (LWP 4929) exited]
[Thread 0x7f1c70a9c700 (LWP 4931) exited]
[Thread 0x7f1c7529d700 (LWP 4930) exited]
[Thread 0x7f1c7aaa0700 (LWP 4927) exited]
[Thread 0x7f1c7029b700 (LWP 4932) exited]
[Thread 0x7f1c7d2a1700 (LWP 4926) exited]
[Thread 0x7f1c842a3700 (LWP 4924) exited]
[Thread 0x7f1c872a5700 (LWP 4922) exited]
[Thread 0x7f1c8eaa8700 (LWP 4919) exited]
[Thread 0x7f1c81aa2700 (LWP 4925) exited]
[Thread 0x7f1c7829f700 (LWP 4928) exited]
[Detaching after fork from child process 4939]
[New Thread 0x7f1c7029b700 (LWP 4948)]
[Thread 0x7f1c7029b700 (LWP 4948) exited]
[New Thread 0x7f1c7029b700 (LWP 4949)]
Hi @waitting33
Than you for the update! Would you be able to try the same scenario, but launch the app with the CUDA_MODULE_LOADING=EAGER environment variable?
Hello, I tried to use CUDA_MODULE_LOADING=EAGER, but the program is still executing very slowly, it’s been ten minutes now, and still the program is still not finished.
Sorry for this one I don’t really know how to reproduce it, I don’t have any other apps open except pycharm. This is the log under CUDA_MODULE_LOADING=EAGER. debugger.zip (12.8 MB)