Difference in data type specified during tlt export and tlt convert

neophyte1 · March 11, 2020, 4:24am

There is a data type(fp16, fp32, int8) optional argument in both tlt-export and tlt-convert. How do they differ ?

Morganh · March 11, 2020, 7:18am

They are different data type.
Please see https://devtalk.nvidia.com/default/topic/1065558/transfer-learning-toolkit/trt-engine-deployment/ for more info.

neophyte1 · March 11, 2020, 9:32am

Hi,
I mean to say why do we need to specify the data type in both tlt-export and tlt-convert. Let’s say if I have defined data type as fp16 in tlt-export and fp32 in tlt-convert, then what will happen and vice-versa?

Morganh · March 13, 2020, 8:54am

The tlt-export will generate an etlt model.If in int8 mode,the calibration table is also generated.
The tlt-convert will generate a trt engine.

Morganh · March 13, 2020, 9:27am

More info:

For tlt-export, no matter FP32/FP16/INT8, the etlt model is exactly the same. The datatype is always fp32.
For tlt-export, if set to INT8, it can do calibration and generate the INT8 calibration table for deployment.
FP16 tlt-export + FP32 tlt-convert will behave exactly like FP32 all the way. It will generate FP32 engine
FP32 tlt-export + FP16 tlt-convert will generate FP16 engine.

neophyte1 · March 17, 2020, 10:58am

Hi Morganh,
Thanks for the reply.