Is layer-device-precision property used at inference time or during engine building?

hyperlight · May 25, 2023, 8:49pm

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU): GeForce 4090
• DeepStream Version: 6.2
• TensorRT Version: 8.5
• NVIDIA GPU Driver Version (valid for GPU only): 525
• Issue Type( questions, new requirements, bugs): questions

I noticed that the yolo4 config from here, the engine is built in int8 percision and they include layer-device-precision to specify that some layers should use float32 instead. Does the layer-device-precision property only affect the inference or it also changes how the engine is built?

miguel.taylor · May 25, 2023, 9:11pm

It affects the engine, since the engine will always be generated for inference on DeepStream.

hyperlight · May 25, 2023, 9:21pm

@miguel.taylor

Thank you for the reply.

I want to build the engine in advance instead of waiting until the first time I run a deepstream app to build the engine. How do I generate the engine that incorporate layer-device-precision property that Deepstream supports if I use an external tool to build the engine such as tao-deploy or tao-converter?

miguel.taylor · May 25, 2023, 9:32pm

Don’t know about tao-deploy or tao-converter, but you can use the DeepStream app to generate the engine. The first time you run the app, you will see the message “Trying to create engine from model files.” DeepStream will attempt to generate the engine and save it in a subfolder, for example: /opt/nvidia/deepstream/deepstream-5.0/samples/models/Primary_Detector/resnet10.caffemodel_b1_gpu0_fp16.engine. If you run the app with sudo privileges, the generation will succeed, and the engine will be saved with the message “Serialized CUDA engine to file: /opt/nvidia/deepstream/deepstream-5.0/samples/models/Primary_Detector/resnet10.caffemodel_b1_gpu0_fp16.engine successfully.” You can edit the config file to use that engine file by setting the path in the model-engine-file field. The next time you run the app, the engine file won’t be generated but rather loaded from the file.

hyperlight · May 25, 2023, 9:49pm

@miguel.taylor

Thank you for the clarification.

I’m aware that I can make Deepstream re-use the engine by changing the config, I did what you described from time to time but that’s very inconvenient when I work with multiple models, say 10 models. tao-deploy and tao-converter is from the NVIDIA TAO Toolkit, they allow me to build the engine in advance and Deepstream can use that engine directly. My goal is that the first time I run a Deepstream app, all the engines it needs are already built so I don’t have to wait for an hour, then go back and change all config files so that Deepstream won’t build new engines the next time I run the same app.

The layer-device-precision property is added in Deepstream version 6.1.1 and the document doesn’t specify if Deepstream uses that property during inference or during engine building. If Deepstream uses it during engine building, then I need to find a way to replicate the setting in tao-deploy / tao-converter. If I can’t do that, then I will have to use Deepstream for engine building with mixed precision.

Morganh · May 26, 2023, 6:28am

@hyperlight
Please refer to How to replicate the affect of Deepstream's layer-device-precision property in tao-converter? - #3 by Morganh

system · June 9, 2023, 6:28am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to replicate the affect of Deepstream's layer-device-precision property in tao-converter? TAO Toolkit	6	324	May 27, 2023
Too much lag when using pruned TrafficCam with DeepStream-Python-App (python bindings) TAO Toolkit rtsp , python , tao	7	1076	January 4, 2022
Issues with tao classifier_tf2 in deepstream (Accuracy drops) TAO Toolkit deepstream	21	46	September 6, 2024
Profile inference time of each layer for .engine model to know where is bottleneck in Deepstream? DeepStream SDK	17	756	June 19, 2023
Tao-converter doesn't work for Deepstream 6.1 TAO Toolkit	7	754	July 14, 2022
Integrating Tao Models (detectnet_v2) into Deepstream SDK TAO Toolkit tao , deepstream , jetson-nano	11	967	March 24, 2023
Incompatible TensorRT engine(int8) with deepstream DeepStream SDK	6	37	December 25, 2024
Engine file and calib.table not saved in DeepStream DeepStream SDK tensorrt	6	881	December 25, 2022
EfficientDet in Deepstream Causes a Seg Fault TAO Toolkit efficientdet , tao	15	1062	July 19, 2022
How to load the FaceDetect model in deepstream testapp1 DeepStream SDK	13	296	August 11, 2023

Is layer-device-precision property used at inference time or during engine building?

Related topics