Pytorch yolov5 is failing on A100 GPU

Description

Hello,

running the next code:

python detect.py --weights yolov5s.pt --img 640 --conf 0.25 --source data/images/

on A100 GPU is giving the next error: "RunTimeError: CUDA error: no kernel is availible for execution on the drive.

the running the same conda environment & the same python command & the same OS on RTX 5000 GPU is succesfull.

What is the reson for failure in A100?

Thank you.
Inga

Environment

TensorRT Version :
GPU Type : A100-PCIE-40GB
Nvidia Driver Version : 450.51.06
CUDA Version : 11.0
CUDNN Version :
Operating System + Version : Red Hat Enterprise Linux release 8.3
Python Version (if applicable) : 3.9.1
TensorFlow Version (if applicable) : N/A
PyTorch Version (if applicable) : torch → 1.8.1, torchvision → 0.9.1
Baremetal or Container (if container which image + tag) : N/A

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

Please include:

  • Exact steps/commands to build your repro
  • Exact steps/commands to run your repro
  • Full traceback of errors encountered