when running the provided script for optimization, the file sizes of the respective graphs seem to increse.
root@2979b9610c8a:/workspace/tftrt_sample# ls -l
total 987852
-rwxrwxr-x 1 root root 294 Mar 24 2018 README
-rw-rw-r-- 1 root root 61098 Mar 24 2018 grace_hopper.jpg
-rwxrwxr-x 1 root root 31587 Mar 24 2018 labellist.json
-rw-rw-r-- 1 root root 218641975 Mar 1 10:16 resnetV150_TRTFP16.pb
-rw-rw-r-- 1 root root 280046959 Mar 1 10:15 resnetV150_TRTFP32.pb
-rw-rw-r-- 1 root root 205066441 Mar 1 10:19 resnetV150_TRTINT8.pb
-rw-rw-r-- 1 root root 205062671 Mar 1 10:16 resnetV150_TRTINT8Calib.pb
-rw-r--r-- 1 root root 102591940 Mar 24 2018 resnetV150_frozen.pb
-rwxrwxr-x 1 root root 542 Mar 24 2018 run_all.sh
-rw-r--r-- 1 root root 12503 Mar 24 2018 tftrt_sample.py
the trt graph with fp32 is alsmost 3 times the size as the unoptimized.
apart from that, optimization seems to work fine with inference times of:
~7ms unoptimized, ~5ms FP32, ~3ms FP16, ~3ms INT8