running the next code:
python detect.py --weights yolov5s.pt --img 640 --conf 0.25 --source data/images/
on A100 GPU is giving the next error: "RunTimeError: CUDA error: no kernel is availible for execution on the drive.
the running the same conda environment & the same python command & the same OS on RTX 5000 GPU is succesfull.
What is the reson for failure in A100?
GPU Type: A100-PCIE-40GB
Nvidia Driver Version: 450.51.06
CUDA Version: 11.0
Operating System + Version: Red Hat Enterprise Linux release 8.3
Python Version (if applicable): 3.9.1
TensorFlow Version (if applicable): N/A
PyTorch Version (if applicable): torch → 1.8.1, torchvision → 0.9.1
Baremetal or Container (if container which image + tag): N/A
Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)
- Exact steps/commands to build your repro
- Exact steps/commands to run your repro
- Full traceback of errors encountered