I am using Jetpack 6.2 with cuda12.4 on a Jetson orin agx developer kit. I am able to put tensors to the device, but for some reason I am getting an error when trying to do the !=
operation.
I am using python 3.11 and I installed torch with the following
pip3 install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu124
If I run nvcc --version
I get the following
eisenbnt@ubuntu:~$ nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2024 NVIDIA Corporation
Built on Tue_Feb_27_16:18:46_PST_2024
Cuda compilation tools, release 12.4, V12.4.99
Build cuda_12.4.r12.4/compiler.33961263_0
Here is some python that gives the error.
>>> import torch
>>> x = torch.tensor([0, 1, 1]).to(0)
>>> print(x.device)
cuda:0
>>>
>>>
>>> x != 0
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
>>>
>>>
>>> x != torch.tensor(0).to(0)
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
>>>
>>>
>>> x != torch.tensor([0, 0, 0]).to(0)
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
>>>
>>> torch.version.cuda
'12.4'
>>>
>>> torch.cuda.is_available()
True