On a DGX-Station (V100) I’m trying to upgrade from OS 5 (Ubuntu 20.04-based) to OS 6 (22.04 based). According to https://docs.nvidia.com/dgx/dgx-os-5-user-guide/upgrading.html
(I’ve previously successfully updated the same way from OS 4 to OS 5.)
Now at the nvidia-release-upgrade
step I always run into this error:
Reading cache
Checking package manager
Reading package lists... Done
Building dependency tree
Reading state information... Done
Get:1 http://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 InRelease [1’581 B]
Hit http://security.ubuntu.com/ubuntu focal-security InRelease
Hit https://repo.download.nvidia.com/baseos/ubuntu/focal/x86_64 focal InRelease
Hit https://repo.download.nvidia.com/baseos/ubuntu/focal/x86_64 focal-updates InRelease
Hit https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64 InRelease
Get:2 https://repo.download.nvidia.com/baseos/ubuntu/jammy/x86_64 jammy InRelease [18.2 kB]
Get:3 https://repo.download.nvidia.com/baseos/ubuntu/jammy/x86_64 jammy-updates InRelease [18.1 kB]
Get:4 http://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages [408 kB]
Hit http://archive.ubuntu.com/ubuntu focal InRelease
Hit http://archive.ubuntu.com/ubuntu focal-updates InRelease
Get:5 https://repo.download.nvidia.com/baseos/ubuntu/jammy/x86_64 jammy/common amd64 Packages [21.0 kB]
Get:6 https://repo.download.nvidia.com/baseos/ubuntu/jammy/x86_64 jammy/dgx amd64 Packages [5’208 B]
Get:7 https://repo.download.nvidia.com/baseos/ubuntu/jammy/x86_64 jammy-updates/common amd64 Packages [343 B]
Get:8 https://repo.download.nvidia.com/baseos/ubuntu/jammy/x86_64 jammy-updates/dgx amd64 Packages [1’517 B]
Fetched 474 kB in 0s (0 B/s)
Reading package lists... Done
Building dependency tree
Reading state information... Done
Traceback (most recent call last):
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/jammy", line 8, in <module>
sys.exit(main())
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/DistUpgrade/DistUpgradeMain.py", line 241, in main
if app.run():
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/DistUpgrade/DistUpgradeController.py", line 2042, in run
return self.fullUpgrade()
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/DistUpgrade/DistUpgradeController.py", line 1873, in fullUpgrade
if not self.doPostInitialUpdate():
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/DistUpgrade/DistUpgradeController.py", line 906, in doPostInitialUpdate
self.quirks.run("PostInitialUpdate")
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/DistUpgrade/DistUpgradeQuirks.py", line 99, in run
func()
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/DistUpgrade/DistUpgradeQuirks.py", line 126, in jammyPostInitialUpdate
self._fix_missing_nvidia_metapackage()
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/DistUpgrade/DistUpgradeQuirks.py", line 1499, in _fix_missing_nvidia_metapackage
nv = NvidiaDetection()
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/NvidiaDetector/nvidiadetector.py", line 79, in __init__
print(self.selectDriver())
File "/tmp/ubuntu-release-upgrader-k6sgsjyx/NvidiaDetector/nvidiadetector.py", line 325, in selectDriver
choice = occurrences[0]
IndexError: list index out of range
As far as I understand nvidiadetector.py
goes though all the apt-packages in the cache, tries to find for each driver the compatible devices. But it does not find the V100 (in the script it is named 10de:1db2
), hence the empty occurrences
list.
Does anyone know how to alleviate this issue to enable the update?