Multi-batch infer results with the same input are different and only the first batch is wrong when running on the DLA

654818923 · October 27, 2020, 3:08am

JetPack Version : 4.4

The one-batch model running on the DLA in fp16 always comes out the wrong result, but the multi-batch infer result is different with the same input and only the first batch is wrong.
The model using one or more batches running normally on GPU.

The onnx model is attatched below:
https://drive.google.com/file/d/1jTuSd281-buxbf77sJb5YM3kdz8UQC6z/view?usp=sharing

AastaLLL · October 27, 2020, 7:32am

Hi,

Could you also share the inference source code with us for checking?
Thanks.

654818923 · October 30, 2020, 7:39am

Hi,
The code and model are attached below：
https://drive.google.com/drive/folders/10irumIjOX69sbl0ay2QvgHGJ90CzVZnD?usp=sharing

AastaLLL · October 30, 2020, 9:43am

Hi,

Thanks for your sharing.
Will let you know for any progress.

Thanks.

654818923 · November 2, 2020, 1:04am

OK, waiting for your reply.
Thanks.

AastaLLL · November 2, 2020, 8:58am

Hi,

This issue can be reproduced in our environment.
We are passing this to our internal team for further suggestion.

Will share you more information once we got any feedback.

654818923 · November 2, 2020, 9:34am

OK.
Thanks.

AastaLLL · November 10, 2020, 2:09am

Hi,

This issue is fixed in our future release.
Will update here once it releases.

Thanks.

654818923 · November 10, 2020, 5:57am

OK.
Hope to you release it soon.
Thanks.

AastaLLL · October 12, 2021, 3:18am

Hi,

Thanks for your patience.

This issue is fixed in our latest JetPack 4.6 release (TensorRT 8.0).
Please upgrade your environment to avoid this issue.

Thanks.

system · November 4, 2021, 3:08am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.