Pytorch compatibility issues (torch 2.0.0+nv23.5 && torchvision 0.15.1)

simonnv · June 9, 2023, 3:17pm

Hi there,

I’ve got an Orin NX and am really struggling to understand how to use the GPU for running yolov7. I’ve installed the Jetson Pytorch library from this nvidia link:

https://docs.nvidia.com/deeplearning/frameworks/install-pytorch-jetson-platform/index.html

I can see that Pytorch is installed in pip and reports torch version is 2.0.0+nv23.5. But when I run yolo on the GPU I get an incompatibility error saying that my PyTorch & torchvision versions aren’t compatible. According to PyTorch I believe I have installed the right version of torchvision 0.15.1.

Here is some info about my Orin:
P-Numer: p3767-0000
Module: NVIDIA Jetson Orin NX (16GB ram)
Jetpack 5.1 [L4T 35.2.1]
CUDA 11.4.315
cuDNN 8.6.0.166
TensorRT: 5.1
VPI 2.2.4
Vulkan 1.3.204
OpenCV 4.5.4 - with CUDA: NO
torch 2.0.0+nv23.5
torchvision 0.15.1

Is there a specific torchvision library for Jetson products? Any suggestions would be helpful

Thanks,

Simon

dusty_nv · June 9, 2023, 4:32pm

Hi @simonnv, are you able to build torchvision as shown under the Installation section of this post:

Alternatively, you can try using the l4t-pytorch or l4t-ml containers which come with PyTorch/torchvision pre-installed.

simonnv · June 9, 2023, 5:19pm

Hi @dusty_nv, I appreciate the prompt reply!

From the forum you mentioned, I downloaded & installed the following:
“pip install torch-2.0.0+nv23.05-cp38-cp38-linux_aarch64.whl”

According to the same forum:
PyTorch v2.0.0 - torchvision v0.15.1

So I ran the command:
“pip install torchvision==0.15.1”

Ignore the weight/model issue below, I don’t think it’s relevant or the cause of this problem.

The console output is as follows:

rebotnix@rebotnix:~/Documents/yolov7$ ls -l ~/Pictures/peopleonbeach.webp
-rw-rw-r-- 1 rebotnix rebotnix 71178 Jun 9 12:50 /home/rebotnix/Pictures/peopleonbeach.webp

rebotnix@rebotnix:~/Documents/yolov7$ ls -l ~/Documents/weights/yolov7/
total 1101276
-rw-rw-r-- 1 rebotnix rebotnix 268261125 Jun 1 15:15 yolov7-d6.pt
-rw-rw-r-- 1 rebotnix rebotnix 304425133 Jun 1 15:16 yolov7-e6e.pt
-rw-rw-r-- 1 rebotnix rebotnix 195040397 Jun 1 15:14 yolov7-e6.pt
-rw-rw-r-- 1 rebotnix rebotnix 75587165 Jun 1 15:10 yolov7.pt
-rw-rw-r-- 1 rebotnix rebotnix 141255765 Jun 1 15:13 yolov7-w6.pt
-rw-rw-r-- 1 rebotnix rebotnix 143099649 Jun 1 15:13 yolov7x.pt

rebotnix@rebotnix:~/Documents/yolov7$ python detect.py --weights ~/Documents/weights/yolov7/yolov7.pt --conf 0.25 --img-size 640 --device 0 --source ~/Pictures/peopleonbeach.webp

/home/rebotnix/.local/lib/python3.8/site-packages/torchvision/io/image.py:13: UserWarning: Failed to load image Python extension: ‘/home/rebotnix/.local/lib/python3.8/site-packages/torchvision/image.so: undefined symbol: _ZN5torch3jit17parseSchemaOrNameERKSs’If you don’t plan on using image functionality from torchvision.io, you can ignore this warning. Otherwise, there might be something wrong with your environment. Did you have libjpeg or libpng installed before building torchvision from source?
warn(
Namespace(agnostic_nms=False, augment=False, classes=None, conf_thres=0.25, device=‘0’, exist_ok=False, img_size=640, iou_thres=0.45, name=‘exp’, no_trace=False, nosave=False, project=‘runs/detect’, save_conf=False, save_txt=False, source=’/home/rebotnix/Pictures/peopleonbeach.webp’, update=False, view_img=False, weights=[‘/home/rebotnix/Documents/weights/yolov7/yolov7.pt’])
YOLOR 🚀 v0.1-126-g84932d7 torch 2.0.0+nv23.05 CUDA:0 (Orin, 14485.75390625MB)

Downloading https://github.com/WongKinYiu/yolov7/releases/download/v0.1/yolov7.pt to /home/rebotnix/documents/weights/yolov7/yolov7.pt…
Download error: [Errno 2] No such file or directory: ‘/home/rebotnix/documents/weights/yolov7/tmp3rvxjn7c’
ERROR: Download failure: /home/rebotnix/documents/weights/yolov7/yolov7.pt missing, try downloading from Releases · WongKinYiu/yolov7 · GitHub

Fusing layers…
RepConv.fuse_repvgg_block
RepConv.fuse_repvgg_block
RepConv.fuse_repvgg_block
Model Summary: 306 layers, 36905341 parameters, 6652669 gradients
Convert model to Traced-model…
traced_script_module saved!
model is traced!

/home/rebotnix/.local/lib/python3.8/site-packages/torch/functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at /opt/pytorch/pytorch/aten/src/ATen/native/TensorShape.cpp:3483.)
return _VF.meshgrid(tensors, **kwargs) # type: ignore[attr-defined]
Traceback (most recent call last):
File “detect.py”, line 196, in
detect()
File “detect.py”, line 92, in detect
pred = non_max_suppression(pred, opt.conf_thres, opt.iou_thres, classes=opt.classes, agnostic=opt.agnostic_nms)
File “/home/rebotnix/Documents/yolov7/utils/general.py”, line 684, in non_max_suppression
i = torchvision.ops.nms(boxes, scores, iou_thres) # NMS
File “/home/rebotnix/.local/lib/python3.8/site-packages/torchvision/ops/boxes.py”, line 40, in nms
_assert_has_ops()
File “/home/rebotnix/.local/lib/python3.8/site-packages/torchvision/extension.py”, line 48, in _assert_has_ops
raise RuntimeError(
RuntimeError: Couldn’t load custom C++ ops. This can happen if your PyTorch and torchvision versions are incompatible, or if you had errors while compiling torchvision from source. For further information on the compatible versions, check GitHub - pytorch/vision: Datasets, Transforms and Models specific to Computer Vision for the compatibility matrix. Please check your PyTorch version with torch.version and your torchvision version with torchvision.version and verify if they are compatible, and if not please reinstall torchvision so that it matches your PyTorch install.

What do you think?

All the best,

Simon

dusty_nv · June 10, 2023, 12:06am

Installing torchvision from pip won’t have CUDA enabled and may run into other errors, as the torchvision wheels on PyPi for aarch64 architecture are built for generic ARM platforms and not JetPack. Instead, uninstall it and try again building it from source (linked to above)

Or if you keep having problems, you can use one of the containers and simply mount in your YOLO project code.

simonnv · June 10, 2023, 6:19pm

Thanks @dusty_nv,

That has done the job and the model is running. Getting around 50ms inference so will be interesting to compare to what other people are getting.

user101843 · June 11, 2023, 9:14am

I also porting my yolov5 project on Jetson Orin NX 16GB development kit platform. Based on the instruction of pytorch.org pytorch install for previous versions, i use the following command to install toch and torchvision for CUDA 11.3

conda install pytorch==1.12.1 torchvision==0.13.1 cudatoolkit=11.3  -c pytorch -c conda-forge

The installation is success but the torch.cuda.is_available return false, so the inference result is not as expected. After using jtop to monitor, you can find the GPU utilization is also zero. Hence, the CPU version is installed in this way.
Here is the result of collect_env.py which is provided by pytorch utility to collect env:

PyTorch version: 1.12.1
Is debug build: False
CUDA used to build PyTorch: Could not collect
ROCM used to build PyTorch: N/A

OS: Ubuntu 20.04.6 LTS (aarch64)
GCC version: (Ubuntu 9.4.0-1ubuntu1~20.04.1) 9.4.0
Clang version: Could not collect
CMake version: version 3.16.3
Libc version: glibc-2.31  

Python version: 3.8.16 (default, Mar  2 2023, 03:16:31)  [GCC 11.2.0] 
(64-bit runtime)
Python platform: Linux-5.10.104-tegra-aarch64-with-glibc2.26
Is CUDA available: False
CUDA runtime version: 11.4.315
CUDA_MODULE_LOADING set to: N/A
GPU models and configuration: Could not collect
Nvidia driver version: Could not collect
cuDNN version: Probably one of the following:
/usr/lib/aarch64-linux-gnu/libcudnn.so.8.6.0
/usr/lib/aarch64-linux-gnu/libcudnn_adv_infer.so.8.6.0
/usr/lib/aarch64-linux-gnu/libcudnn_adv_train.so.8.6.0
/usr/lib/aarch64-linux-gnu/libcudnn_cnn_infer.so.8.6.0
/usr/lib/aarch64-linux-gnu/libcudnn_cnn_train.so.8.6.0
/usr/lib/aarch64-linux-gnu/libcudnn_ops_infer.so.8.6.0
/usr/lib/aarch64-linux-gnu/libcudnn_ops_train.so.8.6.0
HIP runtime version: N/A 
MIOpen runtime version: N/A
Is XNNPACK available: True

CPU:
Architecture:                    aarch64
CPU op-mode(s):                  32-bit, 64-bit
Byte Order:                      Little Endian
CPU(s):                          8
On-line CPU(s) list:             0-7
Thread(s) per core:              1
Core(s) per socket:              4
Socket(s):                       2
Vendor ID:                       ARM
Model:                           1
Model name:                      ARMv8 Processor rev 1 (v8l)
Stepping:                        r0p1
CPU max MHz:                     1984.0000
CPU min MHz:                     115.2000
BogoMIPS:                        62.50
L1d cache:                       512 KiB
L1i cache:                       512 KiB
L2 cache:                        2 MiB
L3 cache:                        4 MiB
Vulnerability Itlb multihit:     Not affected
Vulnerability L1tf:              Not affected
Vulnerability Mds:               Not affected
Vulnerability Meltdown:          Not affected
Vulnerability Spec store bypass: Mitigation; Speculative Store 
Bypass disabled via prctl
Vulnerability Spectre v1:        Mitigation; __user pointer sanitization
Vulnerability Spectre v2:        Not affected
Vulnerability Srbds:             Not affected
Vulnerability Tsx async abort:   Not affected
Flags:                           fp asimd evtstrm aes pmull sha1 sha2 crc32 
atomics fphp asimdhp cpuid asimdrdm lrcpc dcpop asimddp uscat 
ilrcpc flagm

Versions of relevant libraries:
[pip3] numpy==1.21.0
[pip3] torch==1.12.1
[pip3] torchvision==0.13.1a0
[conda] cudatoolkit               11.3.1              h89cd5c7_10    conda- 
forge
[conda] numpy                     1.21.3           py38hb9da153_0    conda- 
forge
[conda] pytorch                   1.12.1          cpu_py38ha09e9da_1  
[conda] torchvision               0.13.1          cpu_py38heb4ea19_0

Based on the previous discussion, besides of build from source and container, is there any success way from conda install?

simonnv · June 11, 2023, 12:06pm

Hi,

Could you send a link to how you installed Conda? I didn’t realise it was supported for aarch64 systems and I think it’s the best way to manage environments.

You will need to check the version of JetPack you have for PyTorch version needed:
https://docs.nvidia.com/deeplearning/frameworks/install-pytorch-jetson-platform-release-notes/pytorch-jetson-rel.html#pytorch-jetson-rel

However, I found building PyTorch from source to be a lot easier than anticipated and it was working straight away after following these instructions.
https://docs.nvidia.com/deeplearning/frameworks/install-pytorch-jetson-platform/index.html

One thing I would say is that I am getting 50ms ish inference when running in max power mode. I was hoping for faster speeds. Some people in other forums suggest using TensorRT. I will be trying that at some point to see if I can get faster speeds.

All the best,

Simon

dusty_nv · June 12, 2023, 5:34pm

user101843:

Based on the instruction of pytorch.org pytorch install for previous versions, i use the following command to install toch and torchvision for CUDA 11.3
conda install pytorch==1.12.1 torchvision==0.13.1 cudatoolkit=11.3  -c pytorch -c conda-forge

@user101843 the upstream PyTorch wheels that these install from PyPi (i.e. via pip install torch or conda install torch) are not built for JetPack or with CUDA support. Instead, please install the PyTorch wheels from the link that @simonnv provided or from this post:

Unlike those on PyPi, these PyTorch wheels were built for JetPack and with CUDA/cuDNN enabled.

user101843 · June 13, 2023, 12:22am

Update for test result of yolov5 on jetson Orin NX 16GB with Jetpack 5.1.1.
Following the instruction of nvidia developer forums, I select Pytorch 1.13.0 and tochvision 0.13.0 pre-built wheel. Except small modification of torch upsampling.py and torchvision qnms_kernel.cpp, yolov5 is run as expected performance on the Orin NX 16GB with torch.cuda.is_available true.
Thanks @dusty_nv @simonnv
ps. My yolov5 project code is on free space vehicle license plate detection(lpd) and recognition(lpr). Before deploying TensorRT, the lpd in 1920x1080 frame is about 0.16 seconds and lpr is about 0.056 seconds in my case.

dusty_nv · June 13, 2023, 3:17pm

Thanks for letting us know @user101843, glad you were able to get it working!

system · July 12, 2023, 3:30am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Install Pytorch with cuda on Jetson Orin nano Devloper Kit Jetson Orin Nano cuda , pytorch	13	3087	July 30, 2024
Torch and torchvision versions are not compatible Jetson AGX Orin pytorch	9	6073	September 11, 2023
What is proper version of torchvision for torch 2.0.0+nv23.5 Jetson AGX Orin yolo , pytorch	5	2769	August 1, 2023
Torchvison compatibility Jetson Orin Nano yolo , pytorch	8	1135	August 5, 2024
Hello AI World for Jetpack 6.0 DP - Pytorch 2.1.0 Installed, Torchvision Did Not Install Jetson Orin Nano pytorch	24	1884	January 15, 2024
What is the compatible torchvision version for torch-1.14.0a0+44dac51c.nv23.2 installed on AGX ORIN( Ubuntu-22.04, JP-5.1/5.0, Python-3.8, CUDA-11.4) Jetson AGX Orin pytorch	19	3568	May 11, 2023
Pytorch & torchversion compatible issue on L4T35.5.0 Jetson Orin Nano pytorch	20	838	November 7, 2024
Installing torch and torchvision on Orin Nano Jetson Orin Nano pytorch	2	89	July 1, 2025
Incompatible version of torchvision on Jetson AGX Orin (Developer Kit) Jetson AGX Orin pytorch	8	757	May 13, 2024
The Torchvision compatible with torch 2.0.0+nv23.5 no where to be found on https://forums.developer.nvidia.com/t/pytorch-for-jetson/72048 Jetson AGX Orin pytorch	3	178	June 26, 2024

Pytorch compatibility issues (torch 2.0.0+nv23.5 && torchvision 0.15.1)

Related topics