Hi all,
All my tensorRT threads hang at cudaStreamSynchronize ()
This is the part of my bt in gdb:
Thread 35 (Thread 0xfffe590b5500 (LWP 3604513)):
#0 0x0000ffff88069938 in ioctl () from target:/lib/aarch64-linux-gnu/libc.so.6
#1 0x0000fffeef8d5e98 in ?? () from target:/lib/libnvrm_host1x.so
#2 0x0000fffeefdec0ac in ?? () from target:/lib/libcuda.so.1
#3 0x0000fffeefca2de8 in ?? () from target:/lib/libcuda.so.1
#4 0x0000fffeefcbae34 in ?? () from target:/lib/libcuda.so.1
#5 0x0000fffeefd16f54 in ?? () from target:/lib/libcuda.so.1
#6 0x0000fffeefd220d8 in ?? () from target:/lib/libcuda.so.1
#7 0x0000fffeefd35298 in ?? () from target:/lib/libcuda.so.1
#8 0x0000ffff841b1568 in ?? () from target:/apollo/bazel-bin/modules/rs_perception/trafficlight/tfl_nn/component/…/…/…/…/…/_solib_local/_U@local_Uconfig_Ucuda_S_Scuda_Ccudart___Uexternal_Slocal_Uconfig_Ucuda_Scuda_Scuda_Slib/libcudart.so.11.0
#9 0x0000ffff8420c974 in cudaStreamSynchronize () from target:/apollo/bazel-bin/modules/rs_perception/trafficlight/tfl_nn/component/…/…/…/…/…/_solib_local/_U@local_Uconfig_Ucuda_S_Scuda_Ccudart___Uexternal_Slocal_Uconfig_Ucuda_Scuda_Scuda_Slib/libcudart.so.11.0
…
…
Thread 33 (Thread 0xfffe5b1db500 (LWP 3604511)):
#0 0x0000ffff88069938 in ioctl () from target:/lib/aarch64-linux-gnu/libc.so.6
#1 0x0000fffeef8d5e98 in ?? () from target:/lib/libnvrm_host1x.so
#2 0x0000fffeefdec0ac in ?? () from target:/lib/libcuda.so.1
#3 0x0000fffeefca2de8 in ?? () from target:/lib/libcuda.so.1
#4 0x0000fffeefcbae34 in ?? () from target:/lib/libcuda.so.1
#5 0x0000fffeefd16f54 in ?? () from target:/lib/libcuda.so.1
#6 0x0000fffeefd220d8 in ?? () from target:/lib/libcuda.so.1
#7 0x0000fffeefd35298 in ?? () from target:/lib/libcuda.so.1
#8 0x0000ffff841b1568 in ?? () from target:/apollo/bazel-bin/modules/rs_perception/trafficlight/tfl_nn/component/…/…/…/…/…/_solib_local/_U@local_Uconfig_Ucuda_S_Scuda_Ccudart___Uexternal_Slocal_Uconfig_Ucuda_Scuda_Scuda_Slib/libcudart.so.11.0
#9 0x0000ffff8420c974 in cudaStreamSynchronize () from target:/apollo/bazel-bin/modules/rs_perception/trafficlight/tfl_nn/component/…/…/…/…/…/_solib_local/_U@local_Uconfig_Ucuda_S_Scuda_Ccudart___Uexternal_Slocal_Uconfig_Ucuda_Scuda_Scuda_Slib/libcudart.so.11.0
…
…
I have 8 stream and Running in parallel in 8 threads,all the 8 threads hang at cudaStreamSynchronize (), and they have same backtrace.
How can I fix it?
thanks