Output-tensor-meta Access RAW model output with batch dimension

johannesrhvw · September 25, 2025, 7:22am

I dont use the CustomFunction because in this function configurable in the config, i could not find a way to get the device Buffers of NvDsInferTensorMeta or any other buffer ON Gpu/device only buffers on host are given.
I want to do PostProcessing directly on GPU using batch processing kernels for NonMaximumSuppression and ROI alignment for mask creation. I tried modifying the models output head, but this showed bad performance due to INMSLayer of TensorRT synchronizing with host on every call, i couldnt find any solution for this either and on Forum the topic gets no answers:
https://forums.developer.nvidia.com/t/inmslayer-cuda-graph-invalidation-devicetoshapehostcopy/338025/6

And since my kernels need relatively large buffers for processing i dont want to allocate them on every single call but once on initialization with the maximum expected size.

I dont get how some design decisions are made here, why is the originally complete output buffer split up or is this due to internals of how TensorRT handles batched inference?

Topic		Replies	Views
Raw output tensor for nvinfer sgie cannot be acessed DeepStream SDK deepstream	6	157	September 23, 2025
Not able to access raw tensor output as metadata DeepStream SDK nvbugs , deepstream	25	515	May 26, 2025
Cannot get tensor meta data from deepstream_infer_tensor_meta.cpp example DeepStream SDK	13	1204	October 12, 2021
NVDSINFER_TENSOR_OUTPUT_META missing when nvinfer in pgie mode with both input-tensor-meta and output-tensor-meta enabled DeepStream SDK deepstream	13	132	December 29, 2025
Parsing custom tensorflow model DeepStream SDK	31	947	September 4, 2023
Access TensorMeta in Deepstream-6.0 using nvinfer DeepStream SDK	5	1266	March 22, 2022
How batch inference on secondary model works with output-tensor-meta enabled DeepStream SDK	19	981	December 12, 2022
Output of engine in gstnvinfer_meta_utlis.cpp DeepStream SDK	5	483	June 30, 2022
Facenet with DeepStream Python Not Able to Parse Output Tensor Meta DeepStream SDK	23	4783	October 12, 2021
Raw tensor output DeepStream SDK jetson-inference , gstreamer	8	2533	October 12, 2021

Output-tensor-meta Access RAW model output with batch dimension

Related topics