DeepStream 5.0 nvinferserver how to use upstream tensor meta as a model input

giangblackk · August 20, 2020, 3:42pm

• Hardware Platform: Telsa T4
• DeepStream Version 5.0
• TensorRT V7.0
• NVIDIA GPU Driver Version 450.57

I have a face alignment custom model deployed successfully to Triton Inference Server, with 2 inputs:

a 112x112x3 face image
a 5 point landmark of that face image

The output of this model is an aligned face image.

I’m trying to deploy this custom model to the nvinferserver of DeepStream 5 with the upstream element is a primary face detection with the landmarks model.

The problem is I don’t know how to pass face landmarks (in form of NvInferTensorMeta from the upstream face detection model) as a second input for this Triton custom model.

The Gst-nvinferserver File Configuration Specifications seem not to mention any information of how to mapping upstream tensor meta with Triton model’s inputs.

Please give me advice. Thanks.

miguel.taylor · August 20, 2020, 5:11pm

Hi

I think the way to go is customizing nvinfer so that the primary element adds the landmarks as NvDsUserMeta to the Object meta and then extract the meta and parse it in the face alignment model. I haven’t used the user meta but it seems it was added for cases like this one.

giangblackk · August 21, 2020, 1:17am

Thank you for your answer, but this is not a solution I’m looking for.

I had a high hope since DeepStream 5.0 Developer Preview announced to integrate Triton Inference Server into DeepStream because it makes DeepStream so much more flexible.

But it seem that nvinferserver element disappointed my expectation when it only support Detection, Classification and Segmentation. There are so many type of machine learning model that doesn’t fit into these 3 types of problems.
I still hope to get a feedback from Nvidia.

bcao · August 21, 2020, 1:30am

So the face alignment custom model need 2 input layer? Or what’s the use of the landmark, is it independent for the model or not?

giangblackk · August 21, 2020, 1:54am

Yes the face alignment custom model need 2 input layers, 1 for face image and 1 for corresponding face landmark. Each face has its own face landmark.
The face landmark is one of the outputs of upstream face detection model, like in retinaface model.

bcao · August 21, 2020, 4:33am

OK, currently nvinfer/nvinferserver don’t support model which has >=2 input layers and also we cannot support customizing the preprocess like postprocess, we may add the support in later release.
One solution here:
To combine your 2 models(face detection and face alignment) in one inferserver plugin and add a triton custom bakend model between the 2 models to handle the postprocess and preprocess, then add an ensembled model to connect the 3 models, the pipeline as following:
modelA(face detection)->custom triton backend(postprocess for A+ preprocess for B)-> modelB(face alignment)
For ensemble model, refer Documentation - Latest Release :: NVIDIA Deep Learning Triton Inference Server Documentation

giangblackk · August 21, 2020, 8:38am

Thank you for your solution.

I hope in the next release of DeepStream, we will have a nvinferserver element with a CUSTOM mode that support multiple NvInferTensorMeta as inputs and map these tensors corresponding to Triton model’s inputs.
There are many machine learning models that just doesn’t operate on PROCESS_MODE_FULL_FRAME or PROCESS_MODE_CLIP_OBJECTS mode of images, for example action recognition model operate on the time series of human skeleton, a graph structure.

Supporting this feature will make DeepStream so much more flexible, widely extending the possible usage of the SDk.
I accept this as an workaround solution.

931429379 · March 22, 2021, 3:52am

Have you solved the problem? Can you provide a solution？

tienln · July 12, 2021, 10:55am

Have you solved it yet?

Topic		Replies	Views
How to pass landmarks of RetinaFace model to downstream? DeepStream SDK	5	952	November 24, 2022
Custom NvDsInferObjectDetectionInfo DeepStream SDK	7	708	October 12, 2021
DeepStream Triton Server and Triton Client cannot be used together DeepStream SDK nvbugs , inference-server-triton , deepstream , deepstream61	11	62	October 30, 2024
How to pass the 5 landmarks of retinaface and perform face alignment between pgie and sgie? DeepStream SDK	11	4993	October 12, 2021
Issue with adding custom preprocessing step DeepStream SDK	4	1014	October 12, 2021
Model run using nvinferserver occupying high GPU memory-usage DeepStream SDK	10	1322	October 12, 2021
How to run custom TensorFlow model in deepstream pipeline? DeepStream SDK gstreamer	4	1617	October 12, 2021
Face Recognition Sample in DeepStream DeepStream SDK	11	873	October 12, 2021
How to make the Secondary Classifier to operate on custom metadata instead of cropped frame of previ... DeepStream SDK	6	1001	October 12, 2021
KeyPoint DeepStream SDK	2	465	October 12, 2021

DeepStream 5.0 nvinferserver how to use upstream tensor meta as a model input

Related topics