Using batch-size > 1 , inferserver on grpc doesnt return metadata for each stream, its mixed and flattened

sumeet.tiwari · March 23, 2022, 12:31pm

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU) : T4
• DeepStream Version: 6.0.1-triton
• NVIDIA GPU Driver Version (valid for GPU only) - 510.47.03

I am using multiple files and trying to infer using nvinferserver using triton over grpc.
Since the batch-size is greater than 1, the NvDsInferLayerInfo buffer for stream index 0 has garbage values at positions 0 and 1, whereas for stream 1 it has the tensor returned at positions 0 and 1.

When batch-size == 1, the inference is working as expected.

infer_config {
  unique_id: 1
  gpu_ids: 0
  max_batch_size: 30
  backend {
    inputs: [ {
      name: "INPUT"
    }]
    outputs: [
      {name: "OUTPUT"}
    ]
    triton {
      model_name: "wsframe-sd-default-ensemble"
      version: -1
      grpc {
        url: "10.54.18.225:8001"
      }
    }
  }

  preprocess {
    network_format: IMAGE_FORMAT_RGB
    tensor_order: TENSOR_ORDER_NHWC
    tensor_name: "INPUT"
    maintain_aspect_ratio: 0
    frame_scaling_hw: FRAME_SCALING_HW_DEFAULT
    frame_scaling_filter: 1
    normalize { scale_factor: 1.0 }
  }

  postprocess {
    other {}
  }

  extra {
    copy_input_to_host_buffers: false
    output_buffer_pool_size: 6
  }
}
input_control {
  process_mode: PROCESS_MODE_FULL_FRAME
  operate_on_gie_id: -1
  interval: 2
}

output_control {
  output_tensor_meta: true
}

sumeet.tiwari · March 23, 2022, 12:36pm

When batch-size > 1, the tensor returned doesn’t start from index 0 for stream 0 in NvDsInferLayerInfo buffer, for at index 0 and 1 there is 0,

When batch-size == 1, the tensor returned starts from index 0 in NvDsInferLayerInfo buffer, which is the desired behaviour.

sumeet.tiwari · March 23, 2022, 12:55pm

My output tensor from triton is of shape [-1, 2]

sumeet.tiwari · March 24, 2022, 11:55am

when using batch-size > 1 with nvinferserver on grpc, the pointer for stream id 0 is pointing to a garbage value.
tensor_meta = pyds.NvDsInferTensorMeta.cast(um_frame_meta.user_meta_data)

        layer = pyds.get_nvds_LayerInfo(tensor_meta, 0)

        ptr = ctypes.cast(pyds.get_ptr(layer.buffer), ctypes.POINTER(ctypes.c_float))
        v = np.ctypeslib.as_array(ptr, shape=(2,))

for stream id 0 its pointing to garbage value, Any help will be highly appreciated.

Fiona.Chen · March 30, 2022, 8:16am

Which batch-size do you mean?

sumeet.tiwari · April 4, 2022, 3:28pm

I am using nvinferserver with remote grpc URL,

it is skipping inference on first 2 frames in a batch and providing inference on the rest, I am using interval =0

sumeet.tiwari · April 4, 2022, 3:43pm

nvstreammux - batch-size = 10, batched-push-timeout=100 msec
nvinferserver, on grpc url, interval=0, batch-size=30

I am using 10 filesrc, the result skips first two frames in a batch, the inference is returned on rest of the frames in a batch.

sumeet.tiwari · April 4, 2022, 3:47pm

I also tried another scenario where I took same files as 2 filesrc, and set
nvstreammux - batch-size = 2, batched-push-timeout=100 msec
nvinferserver, on grpc url, interval=0, batch-size=2

Then it returned classifier meta only for frames batch_id ==1, no classifier meta for batch_id == 0
Please help, am I missing something?

ag3hbk · April 5, 2022, 6:42pm

They won’t help you, I have waited for them valuable response for 1 month.

Fiona.Chen · April 13, 2022, 1:48am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)
• DeepStream Version
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

sumeet.tiwari · April 13, 2022, 11:35am

Hardware Platform (Jetson / GPU) : T4
• DeepStream Version : 6.0.1-triton
• NVIDIA GPU Driver Version (valid for GPU only) - 510.47.03
• Issue Type( questions, new requirements, bugs) - BUG
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing) -
Using this pipeline
gst-launch-1.0
filesrc location= left.h264 ! h264parse ! nvv4l2decoder name=c102
filesrc location= right.h264 ! h264parse ! nvv4l2decoder name=c104
c102. ! m.sink_0 nvstreammux name=m batch-size=2 width=1920 height=1080
c104. ! m.sink_1
m. ! nvinferserver config-file-path=config.txt ! fakesink

The config.txt is posted here Using batch-size > 1 , inferserver on grpc doesnt return metadata for each stream, its mixed and flattened

batch-size is 2 for nvstreammux.

The returned classifier meta only for frames batch_id ==1, no classifier meta for batch_id == 0
It also returns garbage value in confidence for some frames.

ag3hbk · April 17, 2022, 11:05am

Yes, I also got corrupt images in frame from inference.

Fiona.Chen · June 22, 2022, 2:28pm

Sorry for late response, can you share you model and config files for us to reproduce your problem?

yingliu · July 11, 2022, 5:00am

There is no update from you for a period, assuming this is not an issue anymore.
Hence we are closing this topic. If need further support, please open a new one.
Thanks

system · July 25, 2022, 5:00am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Utilizing Inference server for multi-batch processing with deepstream DeepStream SDK gstreamer , inference-server-triton , deepstream61	11	1347	October 19, 2023
Wrong tensor meta output from nvinferserver with triton DeepStream SDK deepstream	9	110	November 25, 2025
Mismatch in input tensor batch sizes DeepStream SDK	15	698	March 6, 2024
Nondeterministic predicts from nvinferserver via gRPC DeepStream SDK docker , inference-server-triton , deepstream	9	941	August 9, 2022
Cannot set nvstreamux batchsize DeepStream SDK	4	504	December 4, 2021
Gst-nvinfer plugin’s Gst properties: What is "batch-size"? DeepStream SDK	9	1847	April 26, 2022
Gst-nvinferserver with tritonserver get wrong result DeepStream SDK	3	592	December 26, 2022
DeepStream Inference Fails for ONNX Model with Batch Size different than 1 DeepStream SDK tensorrt , jetson-inference , onnx , deepstream	16	331	February 21, 2025
Nvinferserver (Triton server) doesn't improves inference FPS for dynamic batching models DeepStream SDK	2	407	October 25, 2023
Inference FLickers on Nvstreeammux Batch-size increase to number of streams DeepStream SDK deepstream	43	432	September 30, 2025

Using batch-size > 1 , inferserver on grpc doesnt return metadata for each stream, its mixed and flattened

Related topics