Output tensor values are different from Deepstream and keras, python TensorRt implementation

AastaLLL · December 10, 2021, 6:08am

Thanks.
We are checking this internally.
Will share more information with you later.

aanish.p · December 10, 2021, 8:24am

sure, thanks

AastaLLL · December 13, 2021, 5:21am

Hi,

We can run your source and reproduce the RMS issue internally.
Will share more information with you once we found anything.

Thanks.

aanish.p · December 13, 2021, 1:54pm

sure, thank you

aanish.p · January 3, 2022, 4:37am

any update ?

AastaLLL · January 10, 2022, 4:38am

Hi,

Thanks for your patience.

We can get the expected output from Deepstream with the source as below.
Would you mind checking this on your side as well?

    ...
    l_frame = batch_meta.frame_meta_list
    while l_frame is not None:
        try:
            frame_meta = pyds.NvDsFrameMeta.cast(l_frame.data)
            l_user = frame_meta.frame_user_meta_list
            while l_user is not None:
                user_meta = pyds.NvDsUserMeta.cast(l_user.data)
                meta = pyds.NvDsInferTensorMeta.cast(user_meta.user_meta_data)
                layer = pyds.get_nvds_LayerInfo(meta, 0)

                ptr = ctypes.cast(pyds.get_ptr(layer.buffer), ctypes.POINTER(ctypes.c_float))
                v = np.ctypeslib.as_array(ptr, shape=(3, 224,224))
                v = v.transpose((1, 2, 0))
                v = cv2.cvtColor(v, cv2.COLOR_RGB2BGR)
                #print(v)
                try:
                    l_user = l_user.next
                except StopIteration:
                    break

        except StopIteration:
            break

Thanks.

aanish.p · January 10, 2022, 5:13am

@AastaLLL thank you will implement and get back soon

aanish.p · January 10, 2022, 6:53am

@AastaLLL the code works fine i am getting the output right. But how is it done for a tensor in my case the 4 output logits of the mobilenetv2 model ?

AastaLLL · January 17, 2022, 6:36am

Hi,

It should follow the NCHW rule.
Do you get the expected result with the following arrangement?

ptr = ctypes.cast(pyds.get_ptr(layer.buffer), ctypes.POINTER(ctypes.c_float))
v = np.ctypeslib.as_array(ptr, shape=(1,4))

Thanks.

aanish.p · January 17, 2022, 7:07am

will check and let you know

AastaLLL · January 24, 2022, 3:03am

Hi,

Do you get the expected output with the update?
Thanks.

aanish.p · January 24, 2022, 4:53am

hey @AastaLLL didn’t get to check it. will do in a day or two and get back asap.

thank you

aanish.p · January 24, 2022, 7:14am

checked with the change and the RMSE is still high!

aanish.p · January 24, 2022, 4:49pm

hey @AastaLLL can you replicate the issue on the jetson nano and try to infer and check if the problem is only from my end ?

AastaLLL · January 28, 2022, 8:07am

Hi,

Sure. We are checking this to see the status of our environment.
Will share more information with you later.

AastaLLL · February 11, 2022, 7:27am

Hi,

Sorry for the late reply.

Based on your scripts, we are able to get very close output between TensorRT and Deepstream.
Since the video decoder is different, it’s expected to have some minor differences and it won’t hurt the accuracy.

The experience is tested with the TensorRT default ResNet-50 model.
Please check the detailed steps below:

Generate data

$ /usr/src/tensorrt/bin/trtexec --onnx=model.onnx --saveEngine=model.trt
$ ffmpeg -loop 1 -i /usr/src/tensorrt/data/resnet50/tabby_tiger_cat.jpg -vframes 1000 -vcodec libx264 -crf 25 -pix_fmt yuv420p -t 5 test.h264

Generate output and compare

TRTInferance.py (2.3 KB)
deepstream_test_1.py (7.1 KB)
dstest1_pgie_config.txt (2.9 KB)
rmsmobinet1.py (585 Bytes)

$ python3 TRTInferance.py
$ python3 deepstream_test_1.py test.h264
$ python3 rmsmobinet1.py

Results

ds=
[3.6179506e-07 1.9918819e-01 4.8616490e-01 2.8783941e-05 5.0460119e-07
 3.1102151e-01 8.4696003e-06 1.3424859e-03 7.8613736e-04 1.4913004e-04]
trt=
[4.4984480e-07 8.2435668e-02 5.1276660e-01 2.0303991e-05 2.6858103e-07
 3.8613626e-01 4.7418266e-06 1.7026100e-02 9.7259018e-04 3.3475558e-04]
average rms :
0.0044365267020706955
max rms :
[0.00449751]

Thanks.

aanish.p · February 11, 2022, 7:54am

hey @AastaLLL thanks for the experiment. i will try to reproduce the same on my side with the script you have tried and will also try the same on mobilenetv2 as well.

thank you

aanish.p · February 16, 2022, 1:23pm

hey @AastaLLL i replicated the same on my device and it seems to work fine with the resnet50 example model given from TensorRT but it gives a high rmse when i try it with the mobilenetV2 model.

thank you

aanish.p · February 16, 2022, 2:33pm

Can you please verify if the keras to onnx conversion is right? and can you reproduce the same with MOBILENETV2 model that i have shared(with the onnx&trt conversion)?

AastaLLL · February 21, 2022, 8:10am

Hi,

Actually, we want to try the model but it has been removed from the link (post on Nov 9).
So we use resnet50 instead as an example.
Would you mind sharing the model and the script to run it with TensorRT again?

The issue should not be related to the tf2onnx.
Since both TensorRT and Deepstream are based on the same ONNX file.

Thanks.

Topic		Replies	Views
DeepStream 5.1, PyTorch, MobileNet SSD v1, retained, ONNX - poor performance DeepStream SDK	8	1696	October 12, 2021
ONNX model with Jetson-Inference using GPU Jetson Xavier NX tensorrt , jetson-inference , onnx	38	5581	October 18, 2021
I do not get any performance improvement after using TensorRT provider for object detection model Jetson Nano tensorrt , onnx	7	1381	July 12, 2022
Some PyTorch model with slicing operation fails on inference TensorRT tensorrt , pytorch , onnx , deepstream	2	1413	January 7, 2022
How to get `nvinfer` to be as accurate as TensorRT's API? DeepStream SDK tensorrt , tensorflow , gstreamer , python , deepstream	25	98	November 19, 2024
Tensorflow model acceleration on AGX Jetson AGX Xavier tensorflow	14	1176	October 7, 2022
Classifier result on onnx doesn't match Deepstream result DeepStream SDK tensorrt , tensorflow , nvbugs , onnx	35	3281	October 2, 2021
I am trying to convert the ONNX SSD mobilnet v3 model into TensorRT Engine. I am getting the below error Jetson TX2 tensorrt , tensorflow	24	3661	February 17, 2022
TensorRT get different result in python and c++ TensorRT	21	2833	August 24, 2022
Running a pytorch network converted to ONNX with TensorRT on the TX2 Jetson TX2	24	8798	October 18, 2021

Output tensor values are different from Deepstream and keras, python TensorRt implementation

Generate data

Generate output and compare

Results

Related topics