How to do Batch inference on the Xavier

sam.jenkins · May 9, 2019, 8:59pm

Hey all,

I’ve been trying to get my xavier to perform inference with a batch size >1. I tried adapting the code from here: GitHub - NVIDIA-AI-IOT/tf_to_trt_image_classification: Image classification with NVIDIA TensorRT from TensorFlow models. but it always fails to classify on imagnet with vgg_16 when I have a batch size > 1. I have set and enabled a batch size of 2 on both the plan and in the engine prior to execution so I’m think I am doing i/o for the cuda buffers wrong. I have tried using a vector, array, and a allocated amount of space.

What is the correct/best way to store multiple images in the input buffer for execution?

AastaLLL · May 10, 2019, 6:41am

Hi,

The best way is to extend the input/output tensor from 1xCxHxW to NxCxHxW.

Please noticed that if your output layer have 1000x1x1 dimension.
To inference with batchsize=2 will return a 2x1000x1x1 tensor and the probability of second image is at index=1000-1999.

Thanks.

sam.jenkins · May 10, 2019, 12:48pm

Hey Aasta,

I understand that theoretically how I’m supposed to extend my input/output tensor, but I fail to get it to work in practice. I’m doing this in C++, and i’m a bit new to the language, what is the best data structure to store the tensor in? The example code above uses a float pointer for CxHxW, my attempts to extend this as an array of float pointers to form the NxCxHxW have largely failed, what am I missing on the formatting?

Thanks,

AastaLLL · May 17, 2019, 8:06am

Hi,

You can do this by allocating a N times larger buffer and put each image at the kth*size position.
Here is a related sample for your reference:
[url]https://github.com/dusty-nv/jetson-inference/blob/master/tensorNet.cpp#L849[/url]

Thanks.

sam.jenkins · May 17, 2019, 5:35pm

Hey Aasta,

That actaully appears to be what I’m doing in my code below:

const size_t height = image_vect[0].rows;
  const size_t width = image_vect[0].cols;
  const size_t channels = image_vect[0].channels();
  const size_t numel = height * width * channels * batchsize;

  const size_t stridesCv[3] = { width * channels, channels, 1 };
  const size_t strides[3] = { height * width, width, 1 };

  float * tensor;

  cudaHostAlloc((void**)&tensor, numel * sizeof(float), cudaHostAllocMapped);

However when I do this, the output class I get for both images when I run inference is wrong. This leads me to believe that my data is getting stored wrong.

for(int x=1; x< batchsize+1; x++) {
  for (int i = 0; i < height; i++)
  {
    for (int j = 0; j < width; j++)
    {
      for (int k = 0; k < channels; k++)
      {
        const size_t offsetCv = i * stridesCv[0] + j * stridesCv[1] + k * stridesCv[2];
        const size_t offset = x * (k * strides[0] + i * strides[1] + j * strides[2]);
        tensor[offset] = (float)  image_vect[x-1].data[offsetCv];
      }
    }
  }
}

This code is what I use to store the images into the allocated data space, do you see anything wrong with this implementation? image_vect is just a vector cv::Mat data stucture containing the images I read for inputs.

Let me know what uou think. Thanks for sticking with me.

AastaLLL · May 27, 2019, 4:47am

Hi,

Would you mind to check if the output is in the NHWC format.
If the model includes an NHWC-dependent operation, TensorRT will automatically add a format converter to ensure the output is correct.

Thanks.

Topic		Replies	Views
Jetson Xavier - Inference multiple images Jetson AGX Xavier	7	1135	October 18, 2021
tensorrt execute batch_size >1 images Jetson TX2	2	977	October 18, 2021
image loading in TensorRT execute batchsize more than 1 Jetson Nano	2	578	October 18, 2021
How to organize memory for inference with batch size > 1 TensorRT	1	829	June 4, 2020
Multiple batches in C++ API TensorRT tensorrt	6	3387	September 1, 2020
How to feed multiple inputs of images (batch of input images) to a Nvidia TensorRT in inference? TensorRT	4	2040	July 5, 2021
Tensorrt inference on multiple batches TensorRT tensorrt , jetson-inference	5	3423	October 27, 2022
Output of batch inference TensorRT	6	2816	January 7, 2021
How to infer more than one image (batch size > 1) Jetson Orin Nano jetson-inference	4	638	October 19, 2023
TensorRT running inference with batch size > 1 TensorRT tensorrt	8	3855	January 26, 2021

How to do Batch inference on the Xavier

Related topics