original PB size: 1.7M
optimized PB size: 1.1G
After look into the files, I found that there are no file under variables directory. My best guess is that every variables are merged to the optimized PB file.
original: variables/variables.data-00000-of-00001 856MB
optimized: no variables/variables.data-00000-of-00001 file
Because of PB size, i can’t load optimized model to tf-serving. there are 1gb limit for PB file.
Is there any known issue or am i used wrong option for optimizing?
Could you please share the script and model file so we can help better?
Also, can you provide details on the platforms you are using:
o Linux distro and version
o GPU type
o Nvidia driver version
o CUDA version
o CUDNN version
o Python version [if using python]
o Tensorflow and PyTorch version
o TensorRT version
Meanwhile, please try generating the model in static mode or low precision.