tensorrt for caffe-yolov3 optimization failed

2844932152 · April 4, 2019, 8:18am

sampleIT8 demo in tensorrt package for caffe-yolov3 optimaztion works fine in FP32 mode. However, the INT8 calibration always break down, the INT8 optimization can not be achieved. the error reported as follows:
[W] [TRT] TensorRT was compiled against cuDNN 7.5.0 but is linked against cuDNN 7.3.1. This mismatch may potentially cause undefined behavior.
[I] Top1: 0, Top5: 0
[I] Processing 4 images averaged 19.0239 ms/image and 19.0239 ms/batch.
[I] FP16 run:4 batches of size 1 starting at 1
[I] Spcified precision is not natively support
[E] [TRT] engine.cpp (570) - Cuda Error in commonEmitTensor: 11 (invalid argument)
[E] [TRT] Failure while trying to emit debug blob.
engine.cpp (570) - Cuda Error in commonEmitTensor: 11 (invalid argument)
[E] [TRT] cuda/customWinogradConvActLayer.cpp (342) - Cuda Error in execute: 11 (invalid argument)
[E] [TRT] cuda/customWinogradConvActLayer.cpp (342) - Cuda Error in execute: 11 (invalid argument)

could you provide me with calibrationtable file for caffe_yolov3.
the configuration of my computer is :
cuda 10.0
cudnn 7.3.1
tensorrt 5.1.2.2
GPU: P4
driver:410

2844932152 · April 4, 2019, 9:09am

when I try to use googlenet for int8 optimization, it also gives the error.

@root0-W780-G20:~/software/TensorRT-5.0.2.6/bin$ ./sample_int8 googlenet

FP32 run:1 batches of size 1 starting at 20
pass one
jell

Top1: 0, Top5: 0
Processing 1 images averaged 1.8903 ms/image and 1.8903 ms/batch.

FP16 run:1 batches of size 1 starting at 20
Engine could not be created at this precision

INT8 run:1 batches of size 1 starting at 20
ERROR: engine.cpp (404) - Cuda Error in commonEmitTensor: 11
ERROR: Failure while trying to emit debug blob.
engine.cpp (404) - Cuda Error in commonEmitTensor: 11
ERROR: cuda/customWinogradConvActLayer.cpp (319) - Cuda Error in execute: 11
ERROR: cuda/customWinogradConvActLayer.cpp (319) - Cuda Error in execute: 11
Cuda failure: 77Aborted (core dumped)

dusty_nv · April 5, 2019, 2:20pm

Moving this thread to the TensorRT forum.

Topic		Replies	Views
Building TensorRT int8 for batch greater than 1 fails TensorRT	1	428	January 26, 2021
tensorRT5 INT8 SSD failed TensorRT	3	1891	March 6, 2019
Building INT8 inferencing engine of Caffe+Faster R-CNN meeted CudaError 4 in findFastestTactic. TensorRT	0	568	June 4, 2019
TensorRT int8 inference for object detection throwing error TensorRT	0	704	July 23, 2019
Int8 calibrator issue TensorRT	0	456	October 23, 2019
Building TensorRT int8 engine fails TensorRT	1	331	January 20, 2021
TRT for yolov3: FP16 and INT8 optimization failed TensorRT	1	1131	October 22, 2018
TensorRT Yolo Int8 on TITAN RTX Frameworks tensorflow	0	681	September 7, 2020
Tensor RT 4 INT8 building - ERROR: cudnnEngine.cpp (85) - Cuda Error in initializeCommonContext: 4 TensorRT	8	5419	July 2, 2019
tensorRT for yolo v3 optimization failed ? TensorRT	2	1247	February 14, 2019

tensorrt for caffe-yolov3 optimization failed

Related topics