Triton ensemble model output wrong values when retrieving result from buffer through python bindings

casetiadharma · March 1, 2023, 7:38am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU) GPU
• DeepStream Version 6.2
• JetPack Version (valid for Jetson only)
• TensorRT Version 8.5.2.2
• NVIDIA GPU Driver Version (valid for GPU only) 525.85.12
• Issue Type( questions, new requirements, bugs) bug
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)

I have a custom ensemble model on triton. I am trying to write a custom parser like in the deepstream_ssd_parser example. my ensemble model outputs two layers : [-1, 7] and [-1] where -1 is the number of objects detected depending on the count in the postprocess. it is usually around [30, 7]

When running deepstream with triton inference server, i can confirm that triton is returning the correct values by printing the output through my triton python model script.

However, when i try to get the same output through the output tensor meta in deepstream through the python bindings function pyds.get_detection(), i am getting very high/low values that does not match my trition model output. The wrong values are similar to the numbers here

Wrong values in model output using gRPC Here it was mentioned that it is a bug to be fixed in the next release.

• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

Fiona.Chen · March 2, 2023, 3:29am

What is the data type of the two output layers of your model?

casetiadharma · March 2, 2023, 3:37am

Hi Fiona, thanks for getting back

One layer is FP16, the other is UINT16. I could get the correct UINT16 value from deepstream (verified to match with the triton output), however the FP16 layer is giving erronous values.

casetiadharma · March 2, 2023, 6:12am

I solved the problem by changing the output layer data type to FP32 instead of FP16. Thanks for pointing to that direction.

It is unexpected because in my custom triton ensemble postprocessing, I was working with numpy and ensured it was in FP16 by doing np.astype() right before I send it as inference output.

Topic		Replies	Views
Wrong values in model output using gRPC DeepStream SDK python , grpc , deepstream61	9	1111	August 9, 2022
Deepstream Triton Ensemble Model Error DeepStream SDK inference-server-triton	8	1314	June 15, 2022
Error when using ensemble model with deepstream-5.1 : failed to get input buffer in CPU memory DeepStream SDK inference-server-triton	7	1315	September 4, 2021
Issues we face when using triton ensemble model through grpc call DeepStream SDK deepstream	2	678	March 30, 2022
Nvinfer-server: Data corruption while using python backend DeepStream SDK inference-server-triton , deepstream	8	1517	July 11, 2022
Send custom output from triton python backend to deepstream DeepStream SDK deepstream	3	101	August 27, 2025
Model Inference Issue : some layers missing or unsupported data types in output tensors DeepStream SDK	5	1446	October 12, 2021
Triton ensemble model version Triton Inference Server (archived)	2	4124	August 31, 2021
Error when trying to parse Triton output tensor on multistream input DeepStream SDK	4	451	March 21, 2023
Custom Detection parser error with nvinferserver and custom python model with > 1 streams DeepStream SDK inference-server-triton , gpu , deepstream	18	1440	September 4, 2023

Triton ensemble model output wrong values when retrieving result from buffer through python bindings

Related topics