with the reference of [url]https://devblogs.nvidia.com/int8-inference-autonomous-vehicles-tensorrt/[/url], we managed to use the calibrator from python interface to convert the MNIST caffemodel to INT8 inference engine. we can dump the weight from the original caffemodel in FP32 through the protobuf utility. is there a way that we can dump the INT8 engine weights?
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Int8 weight extraction | 2 | 595 | April 27, 2020 | |
| Is there any method to build model with int8 weight in tensorrt? | 1 | 1313 | July 29, 2021 | |
| TensorRT 2 sample INT8 | 3 | 1234 | August 7, 2017 | |
| Data process about TensorRT INT8 and FP16 inference Engine | 4 | 2099 | October 18, 2021 | |
| Acceleration with INT8 precision using TensorRT | 6 | 1011 | February 13, 2021 | |
| TensorRT 8-bit Quantization questions | 7 | 5009 | April 26, 2018 | |
| How to generate int8 calilb table for trtexec engine generation | 7 | 4740 | October 12, 2021 | |
| TensorRT | 1 | 385 | October 27, 2021 | |
| sampleINT8 crash | 4 | 1102 | August 1, 2017 | |
| TensorRT 3: INT8 on GTX 1060 | 0 | 1287 | February 7, 2018 |