Yolov3 to TensorRT - Segmentation fault on inference

meetdave06 · April 3, 2019, 5:00pm

Hi,

I am currently using the following repository to convert Yolo v3 to TensorRT.
https://github.com/xuwanqi/yolov3-tensorrt

The same repository is present in the NGC container of TensorRT 5.1.

I can successfully convert YOLO to .trt file but getting a segmentation error on inference.

TensorRT version : 5.1.2.2
CUDA version : 10.1
cuDNN version : 7.4.2
GPU : V100 (AWS)

Error Dump :-

Loading ONNX file from path yolov3.onnx...
Beginning ONNX file parsing
Completed parsing of ONNX file
Building an engine from file yolov3.onnx; this may take a while...
Completed creating Engine
Running inference on image dog.jpg...
Fatal Python error: Segmentation fault

Current thread 0x00007fbb1850b700 (most recent call first):
  File "/workspace/tensorrt/samples/python/yolov3_onnx/../common.py", line 145 in do_inference
  File "onnx_to_tensorrt.py", line 160 in main
  File "onnx_to_tensorrt.py", line 183 in <module>
Segmentation fault (core dumped)

Below is the function where it throws the error:-

def do_inference(context, bindings, inputs, outputs, stream, batch_size=1):
    start = time.time()
    # Transfer input data to the GPU.
    [cuda.memcpy_htod_async(inp.device, inp.host, stream) for inp in inputs]
    # Run inference.
    context.execute_async(batch_size=batch_size, bindings=bindings, stream_handle=stream.handle)
    # Transfer predictions back from the GPU.
    [cuda.memcpy_dtoh_async(out.host, out.device, stream) for out in outputs]
    # Synchronize the stream
    stream.synchronize()
    # Return only the host outputs.
    print("=> time: %.4f" %(time.time()-start))
    return [out.host for out in outputs]

Any help is appreciated.

adit_bhrgv · April 30, 2019, 12:01pm

Hello @meetdave06 ,

Are you able to resolve this issue?
I am also encountering the similar problem
Thanks

meetdave06 · April 30, 2019, 12:49pm

Yes.

Do not build the python module if using TensorRT5 while installing onnx-tensorrt.

https://github.com/onnx/onnx-tensorrt#python-modules

adit_bhrgv · April 30, 2019, 12:56pm

I didn’t build the python modules while installing onnx-tensorrt.
I just followed these steps:

mkdir build
cd build
cmake … -DTENSORRT_ROOT=/opt/tensorrt
make -j8
sudo make install

Yes,I am using TensorRT version 5.1.2.2

meetdave06 · April 30, 2019, 1:05pm

I used to do this.

cmake -DCUDA_INCLUDE_DIRS=/usr/local/cuda-10.0/include -DTENSORRT_ROOT=/opt/tensorrt …

github.com/onnx/onnx-tensorrt

fatal error: driver_types.h: No such file or directory

opened 02:57PM - 18 Dec 18 UTC

closed 09:13AM - 23 Jan 19 UTC

bpinaya

Hi there, this issue is related to #81 so I'll also tag @goldgeisser . After I …fixed the symlink I was still having that `fatal error: driver_types.h: No such file or directory`, the complete log is: ``` [ 2%] Running gen_proto.py on onnx/onnx.in.proto Processing /workspace/onnx-tensorrt/third_party/onnx/onnx/onnx.in.proto Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_onnx2trt_onnx.proto Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_onnx2trt_onnx.proto3 Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx.pb.h generating /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_pb.py [ 4%] Running C++ protocol buffer compiler on /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_onnx2trt_onnx.proto [ 4%] Built target gen_onnx_proto [ 6%] Running gen_proto.py on onnx/onnx-operators.in.proto Processing /workspace/onnx-tensorrt/third_party/onnx/onnx/onnx-operators.in.proto Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators_onnx2trt_onnx.proto Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators_onnx2trt_onnx.proto3 Writing /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators.pb.h generating /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_operators_pb.py [ 8%] Running C++ protocol buffer compiler on /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators_onnx2trt_onnx.proto Scanning dependencies of target onnx_proto [ 11%] Building CXX object third_party/onnx/CMakeFiles/onnx_proto.dir/onnx/onnx_onnx2trt_onnx.pb.cc.o /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx_onnx2trt_onnx.pb.cc:598:13: warning: 'dynamic_init_dummy_onnx_2fonnx_5fonnx2trt_5fonnx_2eproto' defined but not used [-Wunused-variable] static bool dynamic_init_dummy_onnx_2fonnx_5fonnx2trt_5fonnx_2eproto = []() { AddDescriptors_onnx_2fonnx_5fonnx2trt_5fonnx_2eproto(); return true; }(); ^ [ 13%] Building CXX object third_party/onnx/CMakeFiles/onnx_proto.dir/onnx/onnx-operators_onnx2trt_onnx.pb.cc.o /workspace/onnx-tensorrt/build/third_party/onnx/onnx/onnx-operators_onnx2trt_onnx.pb.cc:204:13: warning: 'dynamic_init_dummy_onnx_2fonnx_2doperators_5fonnx2trt_5fonnx_2eproto' defined but not used [-Wunused-variable] static bool dynamic_init_dummy_onnx_2fonnx_2doperators_5fonnx2trt_5fonnx_2eproto = []() { AddDescriptors_onnx_2fonnx_2doperators_5fonnx2trt_5fonnx_2eproto(); return true; }(); ^ [ 15%] Linking CXX static library libonnx_proto.a [ 20%] Built target onnx_proto [ 22%] Building CUDA object CMakeFiles/nvonnxparser_plugin.dir/FancyActivation.cu.o [ 24%] Building CUDA object CMakeFiles/nvonnxparser_plugin.dir/ResizeNearest.cu.o [ 26%] Building CUDA object CMakeFiles/nvonnxparser_plugin.dir/Split.cu.o [ 28%] Building CXX object CMakeFiles/nvonnxparser_plugin.dir/InstanceNormalization.cpp.o In file included from /workspace/onnx-tensorrt/InstanceNormalization.hpp:27:0, from /workspace/onnx-tensorrt/InstanceNormalization.cpp:23: /usr/include/cudnn.h:63:26: fatal error: driver_types.h: No such file or directory compilation terminated. CMakeFiles/nvonnxparser_plugin.dir/build.make:101: recipe for target 'CMakeFiles/nvonnxparser_plugin.dir/InstanceNormalization.cpp.o' failed make[2]: *** [CMakeFiles/nvonnxparser_plugin.dir/InstanceNormalization.cpp.o] Error 1 CMakeFiles/Makefile2:185: recipe for target 'CMakeFiles/nvonnxparser_plugin.dir/all' failed make[1]: *** [CMakeFiles/nvonnxparser_plugin.dir/all] Error 2 Makefile:151: recipe for target 'all' failed make: *** [all] Error 2 ``` To rule out installation problems I might have commited I decided to reproduce it in a container. I'm pulling this container (from `nvidia-docker` at https://ngc.nvidia.com) ``` docker pull nvcr.io/nvidia/tensorrt:18.11-py3 ``` And running with: ``` nvidia-docker run -it --rm nvcr.io/nvidia/tensorrt:18.11-py3 ``` so the versions are the following: - cmake `cmake --version`: ``` cmake version 3.12.1 CMake suite maintained and supported by Kitware (kitware.com/cmake). ``` - gcc `gcc --version`: ``` gcc (Ubuntu 5.4.0-6ubuntu1~16.04.10) 5.4.0 20160609 Copyright (C) 2015 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. ``` - nvidia-smi: ``` Tue Dec 18 14:39:10 2018 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 410.48 Driver Version: 410.48 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+======================| | 0 GeForce GTX 108... Off | 00000000:03:00.0 Off | N/A | | 28% 25C P8 8W / 250W | 2MiB / 11178MiB | 0% Default | +-------------------------------+----------------------+----------------------+ | 1 GeForce GTX 108... Off | 00000000:04:00.0 On | N/A | | 23% 40C P8 12W / 250W | 1099MiB / 11175MiB | 0% Default | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: GPU Memory | | GPU PID Type Process name Usage | |=============================================================================| +-----------------------------------------------------------------------------+ ``` - nvcc `nvcc --version`: ``` nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2018 NVIDIA Corporation Built on Sat_Aug_25_21:08:01_CDT_2018 Cuda compilation tools, release 10.0, V10.0.130 ``` - tensorrt `dpkg -l | grep -i tensorrt` ``` ii libnvinfer-dev 5.0.2-1+cuda10.0 amd64 TensorRT development libraries and headers ii libnvinfer-samples 5.0.2-1+cuda10.0 all TensorRT samples and documentation ii libnvinfer5 5.0.2-1+cuda10.0 amd64 TensorRT runtime libraries ii python3-libnvinfer 5.0.2-1+cuda10.0 amd64 Python 3 bindings for TensorRT ii python3-libnvinfer-dev 5.0.2-1+cuda10.0 amd64 Python 3 development package for TensorRT ii tensorrt 5.0.2.6-1+cuda10.0 amd64 Meta package of TensorRT ``` - protobuf (latest version installed from [repo](https://github.com/protocolbuffers/protobuf)) Also the output of `locate driver_types.h` is empty but the symlink for cuda seems to be there. Since the output of `ll /usr/local/` is: ``` total 64 drwxr-xr-x 1 root root 4096 Nov 3 01:57 ./ drwxr-xr-x 1 1000 1000 4096 Nov 27 2017 ../ drwxr-xr-x 1 root root 4096 Dec 18 14:24 bin/ lrwxrwxrwx 1 root root 9 Nov 3 01:33 cuda -> cuda-10.0/ drwxr-xr-x 1 root root 4096 Nov 3 01:45 cuda-10.0/ drwxr-xr-x 3 root root 4096 Nov 3 01:57 doc/ drwxr-xr-x 2 root root 4096 Oct 5 18:03 etc/ drwxr-xr-x 2 root root 4096 Oct 5 18:03 games/ drwxr-xr-x 1 root root 4096 Dec 18 14:24 include/ drwxr-xr-x 1 root root 4096 Dec 18 14:24 lib/ lrwxrwxrwx 1 root root 9 Oct 5 18:03 man -> share/man/ drwxr-xr-x 7 root root 4096 Nov 3 01:42 mpi/ drwxr-xr-x 2 root root 4096 Oct 5 18:03 sbin/ drwxr-xr-x 1 root root 4096 Nov 3 01:57 share/ drwxr-xr-x 2 root root 4096 Oct 5 18:03 src/ ``` Only after passing the cuda include dir variable to cmake I was able to solve that: ``` cmake -DCUDA_INCLUDE_DIRS=/usr/local/cuda-10.0/include -DTENSORRT_ROOT=/opt/tensorrt .. ``` What I found weird is that even if the symlink variable seems to be pointing to the correct location I couldn't get it to build without passing that variable. Maybe a note in the Readme would suffice. Or could it be some cmake shenanigans? I'll close the issue after I get some feedback since it's easily fixable, just wanted it to be here if anyone encounters something similar so they can have some insight. Also, maybe some CI would be nice, I volunteer to set it up in either travis, circle or maybe even jenkins if Nvida (@benbarsdell) can provide a gpu enabled container. @yinghai I could also set up code formatting and some linting so it'll be easier to contribute, maybe an image from ngc?. I'm planning on submitting some PRs I did for some layers.

Also, I switched to TensorRT 5.0

adit_bhrgv · April 30, 2019, 1:42pm

I switched back to TensorRT 5.0.2.6 but with CUDA 9.0 and CUDNN 7.5, it still doesn’t work.
Maybe I should switch to cuda10.0 like yours !

melinda.hung · June 12, 2019, 9:14am

I encounter the same problem, and my tensorrt version is
4.0.3.0-1+cuda10.0 arm64
Is there any methods to upgrade tensorrt from 4.0 to 5.0. I had tried to download version 5.0 from official website → [b]https://developer.nvidia.com/nvidia-tensorrt-5x-download[/b]
But it seems not have package for arm embedded system…

sanpreetsingh · July 5, 2019, 2:50pm

Hello, I am also using the repository GitHub - xuwanqi/yolov3-tensorrt to convert yolov3.weights to yolov3.onnx and then yolov3.onnx to yolov3.trt.

I am successfully in converting yolov3.weights to yolov3.onnx. Please see the below screenshot from the below link to see success at this point.

Now when I run the below file to build the .trt file from .onnx

python2 onnx_to_tensorrt.py

I got the below error

Loading ONNX file from path ./yolov3.onnx...
Beginning ONNX file parsing
Completed parsing of ONNX file
Building an engine from file ./yolov3.onnx; this may take a while...
Completed creating Engine
Running inference on image dog.jpg...
Segmentation fault (core dumped)

What I have in my system

ii  graphsurgeon-tf                                             5.1.5-1+cuda10.0                             amd64        GraphSurgeon for TensorRT package
ii  libnvinfer-dev                                              5.1.5-1+cuda10.0                             amd64        TensorRT development libraries and headers
ii  libnvinfer-samples                                          5.1.5-1+cuda10.0                             all          TensorRT samples and documentation
ii  libnvinfer5                                                 5.1.5-1+cuda10.0                             amd64        TensorRT runtime libraries
ii  python-libnvinfer                                           5.1.5-1+cuda10.0                             amd64        Python bindings for TensorRT
ii  python-libnvinfer-dev                                       5.1.5-1+cuda10.0                             amd64        Python development package for TensorRT
ii  python3-libnvinfer                                          5.1.5-1+cuda10.0                             amd64        Python 3 bindings for TensorRT
ii  python3-libnvinfer-dev                                      5.1.5-1+cuda10.0                             amd64        Python 3 development package for TensorRT
ii  tensorrt                                                    5.1.5.0-1+cuda10.0                           amd64        Meta package of TensorRT
ii  uff-converter-tf                                            5.1.5-1+cuda10.0                             amd64        UFF converter for TensorRT package
onnx                                                            1.1.1

I have successfully build ONNX-TensorRT using the repository GitHub - onnx/onnx-tensorrt: ONNX-TensorRT: TensorRT backend for ONNX, All the steps are successfully. I donot know why I am getting the error Segmentation fault (core dumped)

Note: Please notice that while building ONNX-TensorRT, I used the below command

cmake .. -DTENSORRT_ROOT=/usr/src/tensorrt

Can anyone explain what should I do know to overcome this error.

sanpreetsingh · July 6, 2019, 11:36am

I have solved the above error by instead of building the yolo tensorrt model from this github repository, I tried with the sample placed at the following location

/usr/src/tensorrt/samples/python/yolov3_onnx

While building the code I was stuck in the way and got the error. Before showing the error as well as how I overcome it, let me write the steps I have implemented for building the code

python2 -m pip install -r requirements.txt
# code to convert yolo weights to yolo.onxx
python2 yolov3_to_onnx.py
# code to convert yolo.onxx to yolo.trt
python2 onnx_to_tensorrt.py

Error I got after executing the command

python2 yolov3_to_onnx.py

Traceback (most recent call last):
  File "yolov3_to_onnx.py", line 812, in <module>
    main()
  File "yolov3_to_onnx.py", line 805, in main
    onnx.checker.check_model(yolov3_model_def)
  File "/home/gpu/.local/lib/python2.7/site-packages/onnx/checker.py", line 82, in check_model
    C.check_model(model.SerializeToString())
onnx.onnx_cpp2py_export.checker.ValidationError: Input size 2 not in range [min=1, max=1].

==> Context: Bad node spec: input: "085_convolutional_lrelu" input: "086_upsample_scale" output: "086_upsample" name: "086_upsample" op_type: "Upsample" attribute { name: "mode" s: "nearest" type: STRING }

How did I resolved this

I got the solution from https://devtalk.nvidia.com/default/topic/1052153/jetson-nano/tensorrt-backend-for-onnx-on-jetson-nano/1 by answer given by sojohans to upgrade onnx to onnx==1.4.1. Please execute the below command

pip2 uninstall onnx
pip2 install onnx==1.4.1 --user

zimenglan · August 9, 2019, 2:47am

hi sanpreetsingh

i use the onnx=1.4.1 but still meet this problem. any help is appreciated.

markus.kaukonen · August 22, 2019, 2:37pm

Is there some good reason to use python2 ?

br. Markus

Topic		Replies	Views
Unable to perform inference using yolov3 in tensorrt samples TensorRT	1	818	October 14, 2019
C++ Yolov3_ONNX output parsing TensorRT	5	1387	January 6, 2025
Error while building TensorRT OSS 8.0.1 TensorRT	29	3344	July 16, 2021
How to interpret yolox output tensor? TensorRT cudnn	2	1015	January 23, 2024
Yolov8 Model Crashing TensorRT TensorRT tensorrt , cuda , yolo , cudnn	1	956	January 29, 2024
Segmentation fault (core dumped) after run IExecutionContext.execute_async_v3() TensorRT cudnn	2	31	March 31, 2025
Error in converting caffe xilinx yolo v3 model to Tensorrt TensorRT tensorrt	7	804	October 1, 2021
Yolo V4 on Jetson Nano with JP4.6 Jetson Nano yolo	2	2026	May 3, 2022
TensorRT backend for ONNX on jetson nano Jetson Nano tensorrt	31	10827	October 15, 2021
Segmentation fault (core dumped) when running yoloV8 onnx model with Deepstream-YOLO DeepStream SDK yolo , onnx	10	2169	August 27, 2023

Yolov3 to TensorRT - Segmentation fault on inference

Related topics