Hello,
I was exporting some YOLO models to TensorRT and had a question about precisions. I am first exporting to ONNX via Ultralytics and then building the TensorRT engine myself on a Jetson Nano 2 GB. This is the code I am using
yolo export model=yolov8n-seg.pt format=onnx opset=12 imgsz=512
/usr/src/tensorrt/bin/trtexec \
--onnx=yolov8n-seg.onnx \
--saveEngine=yolov8n-seg.engine \
--workspace=512 \
--fp16
My understanding is that this exports .pt (which doesn’t have a specific precision) to FP32 ONNX, and then converts to FP16 when building the TRT engine. Is there a practical difference between this and exporting to FP16 ONNX and then converting to FP16 when building?
Thank you!