Incorrect Bounding Box Decoding with YOLOv8 TensorRT Engine in DeepStream (Output Shape [5, 8400])

sakshi.selmokar · May 15, 2025, 8:18pm

Description:

I have exported a YOLOv8 model (likely face detection) to TensorRT using ONNX, and integrated it into DeepStream 7.0 using a custom parser. The output layer is named output0, with shape [5, 8400] — interpreted as:

[x_center, y_center, width, height, confidence]

However, DeepStream shows incorrect bounding box locations — the boxes are not aligning with actual objects in the video (faces).

Details:

DeepStream Version: 7.0

TensorRT Version: 8.x

YOLOv8 Exported via: yolo export model=yolov8n_face.pt format=onnx opset=17

Engine Built With: trtexec

Input resolution: 640×640

Output Layer Shape: [5, 8400]

Classes: 1 (face only)

Observed Output Logging:

Raw Output Sample:

Layer Name: output0
Dims: (5, 8400)
Box 0: 9.61096 5.68841 17.7715 12.5132 0.400749
Box 1: 10.81 5.52857 16.5815 11.1546 0.372961
…

Interpretation Attempt:

float x_center = output[0 * num_boxes + i];
float y_center = output[1 * num_boxes + i];
float width = output[2 * num_boxes + i];
float height = output[3 * num_boxes + i];
float conf = output[4 * num_boxes + i];

Then converted to [left, top, width, height], clipping to frame.

Parser Code:

extern “C”
bool NvDsInferParseCustomYoloV8(
std::vector const &outputLayersInfo,
NvDsInferNetworkInfo const &networkInfo,
NvDsInferParseDetectionParams const &detectionParams,
std::vector &objectList)
{
const NvDsInferLayerInfo &layer = outputLayersInfo[0];
const float *output = reinterpret_cast<const float *>(layer.buffer);
int num_attrs = layer.dims.d[0]; // 5
int num_boxes = layer.dims.d[1]; // 8400

for (int i = 0; i < num_boxes; ++i) {
    float x_center = output[0 * num_boxes + i];
    float y_center = output[1 * num_boxes + i];
    float width    = output[2 * num_boxes + i];
    float height   = output[3 * num_boxes + i];
    float conf     = output[4 * num_boxes + i];

    if (conf < detectionParams.perClassThreshold[0]) continue;

    float left = std::max(x_center - width / 2.0f, 0.0f);
    float top  = std::max(y_center - height / 2.0f, 0.0f);

    NvDsInferObjectDetectionInfo obj;
    obj.classId = 0;
    obj.left = left;
    obj.top = top;
    obj.width = std::min(width, networkInfo.width - left);
    obj.height = std::min(height, networkInfo.height - top);
    obj.detectionConfidence = conf;
    objectList.push_back(obj);
}

return true;

}

Problem:

Despite proper shape parsing and decoding, detections appear completely misplaced on screen. We verified that:

Data buffer is correct (floats match across frames)

Output is not normalized (values like x_center = 10, width = 20)

Detection boxes drawn do not align with faces

Request:

Could NVIDIA clarify the expected output format for YOLOv8 exported to ONNX and then to TensorRT for DeepStream?

Is there a transform (e.g., normalization or anchor/grid decode) missing from this setup?

Is there any official sample for YOLOv8 with DeepStream?

Y-T-G · May 15, 2025, 11:30pm

Why the subtraction here? It’s already the width and height.

fanzh · May 16, 2025, 2:41am

please refer to the offical sample for yolov8.

yingliu · June 12, 2025, 8:17am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

system · June 26, 2025, 8:18am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
NVIDIA Forum Issue Template: Bounding Boxes Not Generated (YOLOv8 Custom Parser) DeepStream SDK deepstream	3	41	June 12, 2025
Issue with Bounding Boxes and Object Detection in DeepStream Using YOLOv8 Model DeepStream SDK yolo , deepstream	9	113	April 18, 2025
Frustrating Problem with Custom Yolov8 on Deepstream DeepStream SDK camera , cuda , usb , python	6	1135	November 19, 2023
YOLOv8 Custom Parser: Improper Face Detection (with Reference Image & Logs) DeepStream SDK tensorrt , ubuntu , gstreamer , python , video , deepstream	5	76	June 25, 2025
How to use custom object detector i.e nvinfer in ds-example DeepStream SDK	6	2380	October 12, 2021
Yolov8 model detection output printing confusing DeepStream SDK cudnn , deepstream	2	22	July 17, 2025
DeepStream parseBoundingBox(): Could not find output coverage layer error with YOLOv8 custom parser DeepStream SDK yolo , deepstream	10	53	June 12, 2025
Tensorrt YOLOv8 with deepstream python DeepStream SDK	11	1673	March 13, 2024
DeepStream - Video decoding error and bbox parsing failure with YOLOv8 engine DeepStream SDK deepstream	3	56	June 12, 2025
No detection for trained Yolo model on Deepstream 7.0.0 DeepStream SDK deepstream	2	38	July 14, 2025

Incorrect Bounding Box Decoding with YOLOv8 TensorRT Engine in DeepStream (Output Shape [5, 8400])

Related topics