Cannot run model tested on GPU on TX2 - garbage returned.

jnoster · October 25, 2018, 1:02pm

The story:

I have a Jetson TX2 to run a model on (to detect things in images)
I'm given a model, which is converted to UFF.
I use bin2c.py to produce sorta C code to include in my sampleUffMNIST.cpp (heavely patched sample file)
I use sample build setup to produce the binary out of that sampleUffMNIST.cpp (right on that Jetson)
I run that binary and it waits on AF_UNIX socket...
I run the feed.py which grabs the images, feeds it to sampleUffMNIST.cpp over that local socket and
feed.py fetches output from sampleUffMNIST.cpp (which is expected to be a matrix of "probabilities" for a pixel to belong to an object)
the matrix is then applied to the source image and we get the image with everything masked out except of objects found.

I have a mockup in Python3 (just rewritten the same original sampleUffMNIST.cpp) on a PC with GPU and it works fine!
But I have no TensorRT Python binding on Jetson.
Hence, I have to code in C++ (which is not my favorite by any mean) to run the engine.

This doesn’t work. I get garbage on the output.

Here we are.

The details:

Jetson TX2 running tegra-ubuntu 4.4.38-tegra
tensorrt 4.0.2.0-1+cuda9.0 package on TX2
tensorrt 5.0.0.10-1+cuda9.0 package on PC/GPU
the source code for mentioned files is available here

Any help would be appreciated!

jnoster · October 25, 2018, 1:07pm

Few words more: GPU is 1080 (not “Ti”), both FP16/FP32 modes attempted, 1080 prefers FP32, TX2 prefers FP16…

jnoster · October 25, 2018, 1:20pm

Even more: if I build the same patched sampleUffMNIST.cpp on PC/GPU and run the routine described above… it works!

And fails on Jetson.

Why??

NVES · October 25, 2018, 8:48pm

Let’s try the low hanging fruit first. Can you try updating to jetpack 4.1 which contains TRT5 (matching your desktop configuration) and see if results improve?

jnoster · October 26, 2018, 7:22am

Well, I’ll try to.

But it’s said that 4.1 is for Xavier only…

jnoster · October 26, 2018, 7:52am

Yes, it is.

I mean, no way (“Next” button fails here): https://i.imgur.com/wWja5v4.png (I have no right to use img tag)

This fruit hangs high enough.

jnoster · October 26, 2018, 8:18am

Attempt to forcibly upgrade (apt-get install --only-upgrade tensorrt) fails too:

Reading state information... Done
tensorrt is already the newest version (4.0.2.0-1+cuda9.0).

jnoster · October 29, 2018, 8:56am

Few more details: an old version model (for 256x256 source images) looks ok for both platforms.
The new one (for 512x512 source image) is not.

I don’t understand how to map this fact to possible reason/fix…

Topic		Replies	Views
on TX2, there is no effect using tensorRT to speed up my trained model TensorRT	0	744	March 11, 2019
Memory error for tensorRT model on TX2 Jetson TX2 tensorrt	5	1575	December 16, 2021
TensorRT works on Python but not on C++ TensorRT	0	464	August 14, 2018
Faster-RCNN engine (TensorRT-8.2) failed to run inference on Jetson TX2 NX Jetson TX2 tensorrt	4	979	September 6, 2023
Deploy tensorflow model on TX2 with tensorRT Jetson TX2	4	2867	September 12, 2018
TF-TRT issue Jetson TX2	25	4285	February 22, 2019
TensorFlow Issue - 'NonMaxSuppressionV3' in binary Jetson TX2	15	3447	March 22, 2019
Error about tensorrt in jetson tx2 Jetson TX2 tensorrt	1	748	April 21, 2020
Tensorflow not using GPU in Jetson TX2 Jetson TX2	11	4525	February 12, 2018
Failed to create Input layer tensor InputPH_0 TensorRT	0	430	April 22, 2019

Cannot run model tested on GPU on TX2 - garbage returned.

Related topics