I have run SSD in my jetson tx2, but the max speed was only 3.5FPS, it was too slow. Is there any method could speed up?
I want to use the nvidia TensorRT to accelerate SSD, and I have installed TensorRT2.1 on the jetson tx2. But I don’t know how to start.
Is there any documents about this?
I would appreciate it if someone helped me. Thanks!
Sure!
But what disappointed you was that I’m a freshman in this area. Even so, I’m working hard.
About the question that how to accelerate SSD using tensorRT, I have searched on the internet.
The result was I didn’t find any successful solutions.
As AastaLLL said, “SSD contains some TensorRT non-supported layer. It will require you to implement these layers.”. Maybe implement these layers is complex. I don’t know how to do so far.
Hi,AastaLLL
Could you share your Makefile.config?
When I compile my caffe-ssd at the step “make runtest”, I met an error problem as follow, but it didn’t influence the test of object detection.
I builded the caffe-ssd according to your instructions. But, it was still the same mistake and the speed was still 3.5fps.
I don’t know why.
I flashed the TX2 with JetPack3.0, then I replaced the cuDNN, CUDA and tensorRT with the latest version using JetPack3.1, and the other components stay the same with JetPack3.0.
Is this the cause of this error or such a low speed?
Do you think it necessary that I should flash the TX2 and install all components with JetPack 3.1?
JetPack3.1 using TensorRT2.1 and cuDNNv6, which is 2x faster than JetPack3.0.
We test this score with the TensorRT engine, not sure how much acceleration Caffe can get.
I got 8-9 fps with ssd_pascal_video.py script. Which example do you use?
When I building the SSD according to your instructions there was always an error that I put it at #8. But I still could test the example.
I tested the example “ssd_pascal_video.py” and “ssd_pascal_webcam.py”, and none of them exceeded 4fps without TensorRT engine. Is this the normal performance?
By the way, could you please provide a sample-code about SSD using TensorRT? I tried to deploy SSD with TensorRT according to the TensorRT User Guid you supplied, But I haven’t had any success so far.
I just want to explain that JetPack3.1 can give you 2x acceleration, but we test this with TensorRT.
It looks like Caffe also get 2x acceleration with JetPack3.1. (Cool!)
So please re-flash your device with JetPack3.1, and you can get 8-9 fps.
Please notice that the package in JetPack3.1 is for branch rel-28.1.
It may cause some error if you install it with JetPack3.0, which is rel-27.1.
For SSD-TensorRT, there are some non-supported layers.
You need to implement these layers with custom API.
Hello, dongsheng_wang.
I didn’t do anything. This error didn’t influence the test of object detection.
And,this didn’t affect the use of caffe.
I don’t know how to solve it. The source file for this error is “test_bbox_util.cpp”, which is created by SSD’s author.I think that maybe checking out the source code could solve it.