My prediction is segmentation result with size(976x208), when transport result to cpu, it could lost 60ms in xavier. My tensorrt version is 126.96.36.199, is there some way to solve this problem? Thank you very much.
Please try with highest performance setting to see if get improved.
sudo nvpmodel -m 0 sudo jetson_clocks