Launching GPU-Enabled Applications with Podman

y-shida-tg · February 19, 2025, 10:28am

Overview

We are trying to create and start a pod that includes a container accessing the GPU using podman. However, an error occurs in Chapter 4. Please tell me the countermeasures.

Host machine settings

Install CUDAToolkit12.2
Install NVIDIA Container Toolkit
Generate the CDI Specification file for Podman

Creating the container and starting by podman

Create 6 types of containers using dockerfile. These containers include applications that use the GPU. These containers have a track record of running in a k8s environment.
Create a .yaml file to start with podman
Here is a sample of the yaml. All 6 types have the almost same yaml file.

apiVersion: v1
kind: Pod
metadata:
 name: test-pod
spec:
  restartPolicy: OnFailure
  containers:
    - name: test
      image: localhost/test:latest
      securityContext:
        privileged: true
      volumeMounts:
      - name: key-shm
        mountPath: /dev/shm/
      device:
        - nvidia-gpu

 volumes:
   - name: key-shm
     hostPath:
       path: /dev/shm/
       type: Directory

Start with the podman command

I started it using the following command: podman play kube test.yaml

Error occurrence

The error content varies depending on the created container.

Pod:
3e9afb45b9047c9a0f6b0d511b59e8d480794fcf4909804c93e9df72e8d4fd06
Container:
106d1afed2092c329aa8329a6ceeb6ccb2e48a1fb58052e2cf21b39bfe9d3a4d

error starting container 106d1afed2092c329aa8329a6ceeb6ccb2e48a1fb58052e2cf21b39bfe9d3a4d: container_linux.g
o:380: starting container process caused: process_linux.go:545: container init caused: Running hook #0:: err
or running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'
nvidia-container-cli: requirement error: invalid expression: OCI runtime error

./test: error while loading shared libraries: libcuda.so.1: cannot open shared object file: No such file or directory

Topic		Replies	Views
Permission issue of cuda driver in AWS RHEL 8.2 podman after restart CUDA Setup and Installation cuda , containers , linux	1	1222	September 15, 2021
Help with rootless podman or rootless docker and nvidia GPU CUDA Setup and Installation	2	7854	August 17, 2023
OCI Runtime error from starting GPU containers on Nvidia Jetson Xavier NX using Podman Jetson AGX Xavier cuda , docker , containers	15	3957	June 13, 2023
Podman compose not working with GPU support CUDA on Windows Subsystem for Linux	2	1278	February 10, 2025
Nvidia-container-toolkit Podman. Error: error setting up CDI devices: unresolvable CDI devices nvidia.com/gpu=all CUDA Setup and Installation docker	3	9100	November 17, 2023
Podman + GPU on Jetson AGX Orin Jetson AGX Orin pytorch , containers	9	845	July 30, 2024
CUDA driver version is insufficient for CUDA runtime version CUDA Setup and Installation	7	32311	May 18, 2024
Unable to start CUDA container with recent update on November 10 Container: CUDA cuda , ubuntu , docker	5	4188	November 21, 2023
I run the pod on my jetson,but error Jetson Xavier NX cuda	6	195	April 25, 2024
Docker accessing GPU for Pytorch error Docker and NVIDIA Docker cuda	0	766	July 29, 2021

Launching GPU-Enabled Applications with Podman

Overview

Host machine settings

Creating the container and starting by podman

Related topics