Image pre-processing between PGIE and SGIE

defsen · February 3, 2020, 11:52am

Hello. I need implement same pipeline like this, but with affine transform on every box with different affine transform matrices:

-> Input
-> PGIE
-> Affine transform on every box
-> SGIE

Can i implement such pipeline without touching gst-nvinfer code?
Thanks.

bcao · February 4, 2020, 2:58pm

Would you mind to explain more details about your requirement?

defsen · February 4, 2020, 5:13pm

I have classifier that requires aligned images and detector, that returns some landmarks among with bounding boxes, but in different network output. Based on these landmarks i need to calculate affine transformation matrix and then align detected region, so classifier will get aligned image.

In gstnvinfer plugin sources i see that input buffers for classifier will be generated right before inference, so, i can’t made alignment for each of them without changing gstnvinfer… or?

Today i tried to align each detected region in buffer that passed from streammux, but this buffer will be used for displaying, so this solution doesn’t suit me.
There is another thing: it’s difficult to correctly match bounding box that i got from object metadata and landmarks that i got from raw tensor.

Please point me to right direction how to implement such pipeline

defsen · February 9, 2020, 4:31pm

Gentle reminder.
Maybe, i can split buffers passed to sgie and nvosd? Then transformed image will be passed to sgie and original to nvosd. Can you advice how to do that?

bcao · February 18, 2020, 2:16pm

Checking internally and give u response ASAP

bcao · February 20, 2020, 4:27am

We suggest install probe on PGIE src pad, do Affine transform on every box if possible, change / update metadata, then SGIE will work on this data.

Maybe, i can split buffers passed to sgie and nvosd? Then transformed image will be passed to sgie and original to nvosd. Can you advice how to do that?
U can add a tee after pgie and then give connect both sgie and nvosd to tee src pad.

defsen · February 27, 2020, 11:45am

Thanks for your answer.

I have checked all again, and noticed that affine transform function (nppiWarpAffine) i want to use can’t work with NV12 buffers.

As mentioned in this thread, i can make nv12 → rgba → nv12 transformation to perform affine transform with rgba buffer.

But after that, in gstnvinfer plugin nv12 → rgba (rgb/bgr) transformation will be applied again. It’s looks like redundancy and maybe there is another solution to transform buffer?

Some sort of preprocess buffer callback in gstnvinfer will be very good solution that can solve my and similar issues. Maybe you have plans to implement such feature?

mchi · March 3, 2020, 2:56pm

could you do WarpAffine processing to the input image (RGB planar) that will be feed into nvinfer, then you don’t need do WarpAffine between PGIE and SGIE ?
If that works for you, you could use one of below APIs.

Three-channel planar 32-bit floating-point affine warp
NppStatus
nppiWarpAffine_32f_P3R(const Npp32f * pSrc[3], NppiSize oSrcSize, int nSrcStep, NppiRect oSrcROI,
Npp32f * pDst[3], int nDstStep, NppiRect oDstROI,
const double aCoeffs[2][3], int eInterpolation);

Three-channel 32-bit floating-point affine warp
NppStatus
nppiWarpAffine_32f_C3R(const Npp32f * pSrc, NppiSize oSrcSize, int nSrcStep, NppiRect oSrcROI,
Npp32f * pDst, int nDstStep, NppiRect oDstROI,
const double aCoeffs[2][3], int eInterpolation);

defsen · March 3, 2020, 3:19pm

could you do WarpAffine processing to the input image (RGB planar) that will be feed into nvinfer, then you don’t need do WarpAffine between PGIE and SGIE ?

If hope, i understand you correctly.
Affine transformation coefficients will be calculated on PGIE output (additional output tensor with landmarks), so, i think it’s not possible to perform warp affine transform before nvinfer

mchi · March 4, 2020, 12:30am

Sorry! Please ignore my comment. It should not work.

I think, the solution bcao suggested in comment#6 should work.

In PGIE/nvinfer, current data processing process: NV12 → (conversion / CUDA kernel) → RGB planar → TensorRT inference → parser .

So, you could just add WarpAffine between “RGB planar → TensorRT inference” ?

defsen · March 4, 2020, 7:10am

Could you get some advice how to implement it in gstnvinfer?

In gstnvinfer sources i see get_converted_buffer function that prepares buffer to transform (crops, scales and converts to rgb planar) and convert_batch_and_push_to_input_thread that transforms buffer and pushes buffer to inference.

I assume that warpaffine should be called in convert_batch_and_push_to_input_thread, somewhere before process_lock mutex locking. Is it right?

mchi · March 6, 2020, 2:01pm

Hi

In file: /opt/nvidia/deepstream/deepstream-4.0/sources/libs/nvdsinfer/nvdsinfer_context_impl.cpp

below code calls TensorRT enqueue() API - TensorRT: nvinfer1::IExecutionContext Class Reference

/* Queue the bound buffers for inferencing. */
        if (!m_InferExecutionContext->enqueue(enqueueBatchSize, bindingBuffers,
                                              m_InferStream, &m_InputConsumedEvent))
        {
            printError("Failed to enqueue inference batch");
            status = NVDSINFER_TENSORRT_ERROR;
            goto error;
        }

As you can find in the APi description, bindingBuffers is an array of pointers to input and output buffers for the network, so you can get the input CUDA buffer pointer from this array, and then add WarpAffine on it. Is it fine for your case?

defsen · March 10, 2020, 7:10am

Thanks.

How access tensor metadata where landmarks stored from nvdsinfer?
And where i can check that this is PGIE model?

mchi · March 25, 2020, 4:03pm

Hi Alexdefsen,
Sorry for delay!
You can use “initParams.uniqueID” used in nvdsinfer to identify the GIE, uniqueID value is from the “gie-unique-id” property in gie configure file.

How access tensor metadata where landmarks stored from nvdsinfer?
sorry! I don’t understand what tensor metadata you mean.

Thanks!

defsen · March 26, 2020, 8:38am

I mean NVDSINFER_TENSOR_OUTPUT_META from NvDsUserMeta passed with buffer (GstBuffer). I need to access additional pgie output where landmarks are stored, but i cant find any GstBuffer mention in nvdsinfer_context_impl.cpp

mchi · April 2, 2020, 2:19pm

Hi Defsen,
Really sorry about the delay!

Please refer to NVIDIA DeepStream SDK API Reference: NvDsInferTensorMeta Struct Reference , could you append your additional outout in priv_data ?

Thanks!

phuong1998bn · September 9, 2020, 2:06am

sorry, im newbie.
how can i install probe on PGIE src pad? im deploying face recognition, i need some facial landmark points to build a warpAffine on detected face image before feed it into face embedding model. another quesition that can i do it with facial landmark?
(sorry if my english skills is not good, i come from country that don’t speak English as main language)

phuong1998bn · September 11, 2020, 8:42am

hi bcao. i have the same question in #6, i need to transpose image before pass it to SGIE, how can i do that ?

Topic		Replies	Views
Need advice: Image pre-processing between PGIE and SGIE. Custom SGIE output - feature-vector DeepStream SDK	12	1899	October 12, 2021
Affine transformation/warp on bbox DeepStream SDK	6	2245	October 12, 2021
Running multiple models for different purposes including pre/post-processing and feeding their outputs into each other on GPU DeepStream SDK	13	757	October 12, 2021
Passing Transformed Image Frames down the Pipeline DeepStream SDK gstreamer	12	1488	October 12, 2021
How to custom preprocess in SGIE base on Deepstream 5.0? DeepStream SDK	15	3322	October 12, 2021
How to pass the 5 landmarks of retinaface and perform face alignment between pgie and sgie? DeepStream SDK	11	5049	October 12, 2021
Integrating action recognition custom model into nvinfer as sgie DeepStream SDK deepstream	7	303	June 25, 2024
Inference using SGIE on only part of the PGIE bounding boxes DeepStream SDK	10	54	March 18, 2025
How to pass GstBuffer to inference plugin in custom gstreamer plugin DeepStream SDK	13	1448	October 12, 2021
Cannot obtain Classifier raw tensor output or classifier meta DeepStream SDK tensorrt , cuda , jetson-inference , gstreamer , jetson , deepstream	20	87	February 14, 2025

Image pre-processing between PGIE and SGIE

Related topics