Hi! I am trying to build yolov7 by compiling it and saving the serialzed trt engine.
However, the process is too slow. Takes 1hour for 256*256 resolution.
Is there anyway to speed up?
Environment
TensorRT Version: 8.2.4.2 GPU Type: RTX3080 12GB Nvidia Driver Version: 515.48 CUDA Version: 11.4 CUDNN Version: 8.2.4 Operating System + Version: Ubuntu 20.04 Python Version (if applicable): 3.8 TensorFlow Version (if applicable): PyTorch Version (if applicable): 1.12.0 Baremetal or Container (if container which image + tag):
Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:
validating your model with the below snippet
check_model.py
import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.
In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!
Also, we recommend you to please try on the latest TensorRT version 8.4 GA Update 1 and if you still face this issue share with us verbose logs and command to try from our end for better debugging. https://developer.nvidia.com/nvidia-tensorrt-8x-download
Looks like you’re not using trtexec, could you please share with us the issue repro and if possible trtexec --verbose logs.
Also, we recommend you to please try official TensorRT samples for better performance.
Yes, I’m experiencing the same thing, running trtexec on yolov7, and it just takes absolutely forever. In fact it’s been running and hasn’t finish after >1hr. This is on a Jetson Xavier dev kit.
Hmm, looks like stopping and reloading the trtexec process worked for me, and it rendered an engine 1280x768 in about 20-30 minutes. Still feels a lot slower than older versions, in the previous Jetpack release (4.4), it would take maybe 1-2 minutes to create an engine.
Hello,I meet the similiar problem.I use the version8.5 of TensorRT to convert the yolov4 onnx to engine model.It still takes 42mins and does not decline.
So I want to know how did you decrease the time of converting the model?
I haven’t solved it yet.
What is certain is that the build speed was faster on the gtx 1060.
After replacing the GPU(1060->3080), it became several times slower.