K3s device plugin is in CrashLoopBackOff

We are using docker as container engine (25.0.3) and configured container runtime according to document GitHub - NVIDIA/k8s-device-plugin: NVIDIA device plugin for Kubernetes and deployed helm char after configuring the runclass but we are getting below error for device plugin pod in CrashLoopBackOff, please help us here to resolve the issue.

Error: 2024/04/08 11:30:00 Retreiving plugins. 2024/04/08 11:30:00 Detected non-NVML platform: could not load NVML: libnvidia-ml.so.1: cannot open shared object file: No such file or directory 2024/04/08 11:30:00 Detected non-Tegra platform: /sys/devices/soc0/family file not found 2024/04/08 11:30:00 Incompatible platform detected 2024/04/08 11:30:00 If this is a GPU node, did you configure the NVIDIA Container Toolkit? 2024/04/08 11:30:00 You can check the prerequisites at: GitHub - NVIDIA/k8s-device-plugin: NVIDIA device plugin for Kubernetes 2024/04/08 11:30:00 You can learn how to set the runtime at: GitHub - NVIDIA/k8s-device-plugin: NVIDIA device plugin for Kubernetes 2024/04/08 11:30:00 If this is not a GPU node, you should set up a toleration or nodeSelector to only deploy this plugin on GPU nodes 2024/04/08 11:30:00 Error: error starting plugins: error getting plugins: unable to load resource managers to manage plugin devices: platform detection failed

1 Like

Hi @All,
Please let us know if any solution for this.