Description
Hello,
running the next code:
python detect.py --weights yolov5s.pt --img 640 --conf 0.25 --source data/images/
on A100 GPU is giving the next error: "RunTimeError: CUDA error: no kernel is availible for execution on the drive.
the running the same conda environment & the same python command & the same OS on RTX 5000 GPU is succesfull.
What is the reson for failure in A100?
Thank you.
Inga
Environment
TensorRT Version :
GPU Type : A100-PCIE-40GB
Nvidia Driver Version : 450.51.06
CUDA Version : 11.0
CUDNN Version :
Operating System + Version : Red Hat Enterprise Linux release 8.3
Python Version (if applicable) : 3.9.1
TensorFlow Version (if applicable) : N/A
PyTorch Version (if applicable) : torch → 1.8.1, torchvision → 0.9.1
Baremetal or Container (if container which image + tag) : N/A
Relevant Files
Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)
Steps To Reproduce
Please include:
- Exact steps/commands to build your repro
- Exact steps/commands to run your repro
- Full traceback of errors encountered