Segmentation fault in JetPack 5.1 container when using CUDA device in PyTorch

divelix · March 27, 2023, 3:54pm

Platform: Xavier NX with JetPack 5.1 (rev. 1) installed with SDK Manager.
I pulled latest (5.1) Docker image from here:

Inside container (which I run by official instruction in the link above, using --runtime=nvidia), I get Segmentation fault (core dumped) error thrown by this code:
Code:

import torch
import torch.nn as nn

print(f'PyTorch version: {torch.__version__}')
print(f'CUDA available: {torch.cuda.is_available()}')
print(f'CUDA version: {torch.version.cuda}')
device = torch.device('cuda')
m = nn.Conv1d(7, 64, 1, bias=False, device=device)
input = torch.randn(100, 7, 32).to(device)
print('start')
output = m(input)
print('end')
print(output.shape)

Output:

PyTorch version: 2.0.0a0+ec3941ad.nv23.02
CUDA available: True
CUDA version: 11.4
start
Segmentation fault (core dumped)

Nevertheless, if I set device = torch.device('cpu'), it works fine:
Output:

PyTorch version: 2.0.0a0+ec3941ad.nv23.02
CUDA available: True
CUDA version: 11.4
start
end
torch.Size([100, 64, 32])

Why CUDA throws segmentation fault? Can I fix this?

AastaLLL · March 28, 2023, 6:40am

Hi,

We are going to reproduce this issue.
Will share more information with you later.

Thanks.

divelix · March 28, 2023, 7:50am

Additional info about my setup:

In SDK Manager I chose to install Jetson Linux and Jetson Runtime Components, but no Jetson SDK Components (as 16 GB of internal eMMC memory was not enough for it).
I configured Docker to store its images and containers on external SD Card (the same reason: 16 GB is too small). External SD Card (SanDisk Extreme 128 GB) attached to NX carrier board via USB port (I used USB-to-SD adapter).

AastaLLL · March 28, 2023, 8:38am

Hi,

We have tested the l4t-pytorch:r35.2.1-pth2.0-py3 on XavierNX.
The sample can run correctly.

...
>>> print('start')
start
>>> output = m(input)
>>> print('end')
end
>>> print(output.shape)
torch.Size([100, 64, 32])

In case there are some issues when setting up the environment.
Could you reflash the system and try it again?

Thanks.

divelix · March 28, 2023, 1:35pm

I flashed again with the same parameters and I still get Segmentation fault. I suspect it can be caused by docker root directory being on external drive. How your setup looks? Are your OS, docker and docker root dir on the same drive?

divelix · March 29, 2023, 9:21am

I moved both rootfs and docker root to sd card and tried again and I still get Segmentation fault while trying to use cuda inside pytorch inside container. I have no idea what else to try…

divelix · March 29, 2023, 9:24am

From your code snippet it is not clear if you used GPU or CPU. On CPU my code also works fine as I described in my question.

AastaLLL · March 30, 2023, 6:55am

Hi,

We run it with GPU and our container is also saved on an external SSD.

device = torch.device('cuda')

Could you try if CUDA can work on your environment (inside the container)?
Please download the CUDA sample below and run the deviceQuery example.

Thanks.

Topic		Replies	Views
PyTorch can't find CUDA inside JetPack 5.1 docker container Jetson Xavier NX cuda , pytorch , python	3	1332	March 24, 2023
Torch/torchvision on Orin NX 16GB Segfault Jetson Orin NX pytorch	14	1130	April 5, 2023
PyTorch "Segmentation fault (core dumped)" After Forward Propagation Jetson Xavier NX pytorch	2	3628	October 18, 2021
Import torch gives Segmentation fault on Jetson Orin Nano Jetson Nano jetson-inference , pytorch , python	4	1626	June 5, 2023
Segmentation fault(core dumped) error while importing torch Jetson Nano pytorch	10	2344	January 26, 2023
How to use pytorch with Jetpack 5.1.2 in docker container Jetson Orin Nano docker , pytorch	5	292	April 7, 2025
Docker container example that can use CUDA and Pytorch 2.5.0 with the host running Jetpack 6.1 Jetson AGX Orin docker , pytorch	8	1189	December 26, 2024
Pytorch resulting in segfault when calling convert Jetson AGX Xavier pytorch	6	2291	October 18, 2021
Trying to run Pytorch docker 22.12-py3 on Jetson NX with 5.0.2 jetpack Jetson Xavier NX pytorch	3	597	February 1, 2023
PyTorch container build failing Jetson AGX Orin pytorch , containers	5	479	June 8, 2023

Segmentation fault in JetPack 5.1 container when using CUDA device in PyTorch

Related topics