Alexnet using INT8

subarukun · July 18, 2017, 2:40pm

Hi,

I have created alexnet with TensorRT using the api creation instead of caffe model and parsing . I trained weights,biases which FLOAT format for alexnet.

My query is ,Is there anyway TensorRT quantizes them(weights/biases) to half-precision/UINT8 and verify the Alexnet in half-precision/UINT8 which would be very much useful? If so what are the settings need to be made.

Thanks in advance

subarukun · July 21, 2017, 6:48am

Hi,

I have found that sample_INT8 does exist which shows the example for mnist.
Can you let me know if my understanding with sample int8 is correct

1)what is the use of BatchStream calibrationStream(CAL_BATCH_SIZE, NB_CAL_BATCHES);
2) I assume the Int8EntropyCalibrator calibrator(calibrationStream, FIRST_CAL_BATCH); does the actual calibration . But how the data need to be feed for calibrator. I assume that batches folder need to be created and then batch0,batch1… need to be created and data need to be copied in it. But it is a tedious task.
3)scoreModel(batchSize, firstScoreBatch, nbScoreBatches, &calibrator);
I assume this is the one which does the actual scoring and lets us know whether INT8 weights are useful for our usage or not for inference . Where the INT8 converted weights will be placed ?

Questions may appear too basic? If there is any documentation regarding the same it would much more helpful for the newbies…

Thanks in advance…

acohen8 · July 21, 2017, 8:04pm

I would also like to see more documentation on these - I’m currently trying to suss out what actually happens within these same steps, and the docs on the Int8EntropyCalibrator are particularly lacking. I’m interested to know what exactly the calibrator does with the calibration stream.

adit_bhrgv · July 29, 2017, 10:19pm

I would also like to know more about Calibrator and also how the quantization method works in TensorRT . If you guys have figured it out something by now , please share…Thanks!

adit_bhrgv · August 24, 2017, 7:12am

Sorry, this comment was moved to another place

adit_bhrgv · August 29, 2017, 1:20am

Hi Subaarukun,

Were u able to run Alexnet/ Imagenet with INT8 with TENSORRT?
However, I am using only 2 classes Dogs vs Cats in caffe on GTX 1080 TI

Can you please share your results

I got below error:

TensorRT-2.1.2/bin> ./sample_int8 imagenet

INT8 run:4 batches of size 10 starting at 10
cudnnEngine.cpp (357) - Cuda Error in execute: 77

Topic		Replies	Views
INT8 Calibration in Python with TensorRT 8.6 TensorRT tensorrt	5	4794	July 12, 2023
TensorRT 5 Int8 Calibration Example TensorRT	11	7961	October 12, 2021
TensorRT 4.0 Python API INT8 Calibration TensorRT	3	1469	August 27, 2018
Int8 quantization TensorRT	1	562	December 16, 2021
TensorRT INT8 calibration in C++ api TensorRT tensorrt	2	1971	February 14, 2022
IInt8EntropyCalibrator TensorRT	2	1189	September 4, 2018
How to do int8 calibration in c++ in tensorRT 5 ? TensorRT	10	4931	October 12, 2021
TensorRT TensorRT tensorrt , python	1	379	October 27, 2021
How to generate int8 calilb table for trtexec engine generation TensorRT tensorrt	7	4717	October 12, 2021
Int8 calibration TensorRT	1	2534	December 17, 2021

Alexnet using INT8

Related topics