DNN Samples are not working on host

kaltinok · January 29, 2021, 8:54am

Software Version
DRIVE OS Linux 5.2.0 and DriveWorks 3.5

Target Operating System
Linux

Hardware Platform
NVIDIA DRIVE™ AGX Pegasus DevKit (E3550)

SDK Manager Version
1.4.0.7363

Host Machine Version
native Ubuntu 18.04 with RTX 5000

Hello, I upgraded my software from Driveworks 3 to 3.5. In version 3, DNN samples were working also my custom implementations with bin files were working. Now I have a problem.
None of the given sample DNN examples are working. Both in my models and official examples it throws an exception.
“Driveworks exception thrown: DW_INTERNAL_ERROR: DNN: Unable to load model.”

What could be the problem here? How can we solve this?

Best regards.

VickNV · January 29, 2021, 2:14pm

Hi @kaltinok,

As you can see in the perception module document, it states “These modules are available in NVIDIA DRIVE Software releases only.”. You need to wait for upcoming DRIVE Software 11.0. Thanks!

kaltinok · January 29, 2021, 6:22pm

Thank you! Another thing;
In this link DNN, it says This module is available in both NVIDIA DriveWorks and NVIDIA DRIVE Software releases.
So can I expect that my custom .bin files can work only with DriveOS & DriveWorks or should I go back to DRIVE software 10? I asked this because dwDNN_initializeTensorRTFromFile method also throws the same exception with our custom models.

VickNV · January 29, 2021, 7:45pm

Sorry for my misunderstanding. I thought you were talking about "Perception Samples
".

Could you share your command and the output messages from running any “Deep Neural Network (DNN) Framework Samples”?

kaltinok · January 31, 2021, 12:25pm

Here you can see the outputs:
First image is from the official samples. And second image is from our custom implementation.
First image:

Second image:

Both gives errors.

kaltinok · January 31, 2021, 2:08pm

Also i tested my models as TRT engines in trtexec and they pass correctly

VickNV · February 1, 2021, 3:09pm

Please take a look at “Q: If I build the engine on one GPU and run the engine on another GPU, will this
work?” in “Chapter 14. Troubleshooting” of “NVIDIA DRIVE OS 5.2.0.0 TensorRT
6.3.1 Developer Guide” (at ~/nvidia/nvidia_sdk/DRIVE_OS_5.2.0_SDK_Linux_OS_DDPX/documentations/drive_os_documentation/NVIDIA_DRIVE_OS_5.2_For_TensorRT_6.3.1_Developer_Guide.pdf on host system).

The problem is standard runtime of TensorRT 6.3.1 wronly treats the warning which should be only on proxy/safety runtime. We already fixed it in TensorRT 6.4

On TensorRT 6.3.1, you need to generate and deserialize your plan file with the same GPU. Thanks!

kaltinok · February 2, 2021, 5:25am

Actually I use the same GPU and still get those errors. I switched back to drive software 10 and in that version those errors are gone.
Do you consider upgrading TRT version to 6.4 in driveOs 5.2? Drive software 10 has tensorrt 5.1 which is not suitable for me.

VickNV · February 2, 2021, 1:56pm

Before talking about upgrade, let’s try to clarify the issue you are seeing first. Did you mean you see the error even if using the plan file genereted with RTX 5000 and TensorRT 6.3.1? Thanks.

kaltinok · February 2, 2021, 2:26pm

Yes, exactly as you asked.

VickNV · February 2, 2021, 4:51pm

Please share the steps of generating your model file on the host system with RTX 5000 and TensorRT 6.1.3. Thanks.

kaltinok · February 3, 2021, 11:41am

Sure;

I freeze the TF 1.14 model and got a .pb file.
Converted the pb file to onnx format with tf2onnx.
Created .bin file with TRT optimizer coming with driveworks. In addition to that I saved also the model as TRT engine.
Run the TRT engine with trtexec to test. And the model passes correctly.
Finally I tried to pass the bin file to initializeTensorRTFromFile method and got those errors above.
When I switch back to previous release there is no error in that method. I think initializeTensorRTFromFile method cannot read the .bin path or something.
Thank you.

SivaRamaKrishnaNV · February 3, 2021, 12:55pm

Dear @kaltinok ,
Could you please share your onnx model file to reproduce it on our end?

VickNV · February 3, 2021, 1:33pm

Here is the bug.
https://nvbugs/3175027 [VCC-SPA2-Zenuity] TensorRT 6.3.1 - ERROR: Using an engine plan file across different models of devices is not recommended

kaltinok · February 4, 2021, 6:07am

Here you can find a sample as mnist.pb and mnist.onnx which is converted from pb.
mnist.onnx (40.5 KB) mnist.pb (48.3 KB)

SivaRamaKrishnaNV · February 5, 2021, 9:40am

Dear @kaltinok ,
I could load the model successfully. I have added a print statement and exit(0) after dwDNN_initializeTensorRTFromFile() in sample_dnn_tensor. to verify model loading
These are the steps followed.

Generate DW compatible model using TensorRT_Optimization tool. it generates optimization.bin in current directory
/usr/local/driveworks-3.5/tools/dnn/tensorRT_optimization --modelType=onnx --onnxFile=/path/to/onnxmodel
load the sample with new model
./sample_dnn_tensor --tensorRT_model=/path/to/optimized.bin

Topic		Replies	Views
Issue with TensorRT binary TensorRT tensorrt	2	791	March 26, 2021
Sample_dnn_tensor on Drive Orin, the results are incorrect DRIVE AGX Orin General driveworks-dnn-framework	7	747	July 24, 2023
Where are DNN model files for driveworks 3.5 DNN sample application? DRIVE AGX Xavier General driveworks-dnn-framework	8	891	October 12, 2021
Error Running Driveworks TensorRT Samples on Host DRIVE AGX Xavier General	6	1421	October 12, 2021
The engine plan file is generated on an incompatible device, ERROR running DL inference samples DRIVE AGX Xavier General driveos-dl	1	679	January 16, 2022
Initialize DNN using driveworks DRIVE AGX Orin General driveworks	8	1038	April 20, 2023
Issue with driveworks and creation of the tensorrt binary TensorRT tensorrt	2	732	March 26, 2021
TensorRT, Drive AGX, Jetson and the .onnx format DRIVE AGX Xavier General driveos-dl	6	889	October 12, 2021
Can't find all the sample in dnn folder DRIVE AGX Xavier General driveos-cuda	2	749	January 22, 2022
Using custom TensorRT model on sample applications DRIVE AGX Xavier General driveworks-dnn-framework	4	1281	March 22, 2022

DNN Samples are not working on host

Related topics