OS: Ubuntu 20.04.1 LTS
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Wed_Sep_21_10:33:58_PDT_2022
Cuda compilation tools, release 11.8, V11.8.89
Build cuda_11.8.r11.8/compiler.31833905_0
GPUs: 4 * 4090
Last week, Docker was able to build without any issues. However, after two days, an error occurred stating “driver/library version mismatch: unknown”. Although many people online suggest that restarting can solve the problem, we need our application to be stable. Therefore, I would like to know what exactly happened and how to completely avoid such incidents.
Here’s more information:
Attaching to test-lora-sd-webui1, train-lora1
Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'
nvidia-container-cli: initialization error: nvml error: driver/library version mismatch: unknown
Additionally, here’s nvidia-bug-report.sh output:
nvidia-bug-report.log.gz (1.3 MB)