Description
Failed to build TensorRT plan file on Ampere GPU (test on both RTX 6000 and RTX 3090) using FP16 mode with TensorRT 8.0.1.6.
Environment
TensorRT Version: 8.0.1.6
NVIDIA GPU: RTX A6000 and RTX 3090
NVIDIA Driver Version: 460.84
CUDA Version: 11.0
CUDNN Version: 8.0.4
Operating System: ubuntu 18.04
Python Version (if applicable): 3.6.9
Tensorflow Version (if applicable): None
PyTorch Version (if applicable): None
Baremetal or Container (if so, version): docker version: 20.10.6, image built using official docker file.
Relevant Files
onnx model: model.onnx - Google Drive
Steps To Reproduce
run with the following command using trtexec
trtexec --onnx=model.onnx --workspace=4096 --device=0 --saveEngine=model.engine --fp16
logs: click TensorRT logs to see logs
however, if I omit the --fp16
option, the onnx file can be built successfully, but the TensorRT library still print these WARNINGs:
.
.
.
[W] [TRT] Skipping tactic 0 due to kernel generation error.
nvrtc: error: invalid value for --gpu-architecture (-arch)
[W] [TRT] Skipping tactic 1 due to kernel generation error.
nvrtc: error: invalid value for --gpu-architecture (-arch)
[W] [TRT] Skipping tactic 2 due to kernel generation error.
nvrtc: error: invalid value for --gpu-architecture (-arch)
[W] [TRT] Skipping tactic 3 due to kernel generation error.
nvrtc: error: invalid value for --gpu-architecture (-arch)
[W] [TRT] Skipping tactic 4 due to kernel generation error.
nvrtc: error: invalid value for --gpu-architecture (-arch)
.
.
.