I can’t install Kubernetes via bright view or cm-Kubernetes-setup
it have some error,
error message :
- Command ‘/bin/sh -c “DEBIAN_FRONTEND=noninteractive apt-get install --yes -o Dpkg::Options::=”–force-confdef" -o Dpkg::Options::=“–force-confold” nvidia-container-toolkit cm-kubernetes126 < /dev/null"’ returned exit code 100 while expected (<ExitCodes.EX_OK: 0>,)
- Failed to install packages cm-kubernetes126,nvidia-container-toolkit into root=/cm/images/dgxa100-new
- Package install failed with error: Command ‘/bin/sh -c “DEBIAN_FRONTEND=noninteractive apt-get install --yes -o Dpkg::Options::=”–force-confdef" -o Dpkg::Options::=“–force-confold” nvidia-container-toolkit cm-kubernetes126 < /dev/null"’ returned exit code 100 while expected (<ExitCodes.EX_OK: 0>,) STDOUT: Reading package lists… Building dependency tree… Reading state information… The following additional packages will be installed: libnvidia-container-tools libnvidia-container1 nvidia-container-toolkit-base The following NEW packages will be installed: cm-kubernetes126 libnvidia-container-tools libnvidia-container1 nvidia-container-toolkit nvidia-container-toolkit-base 0 upgraded, 5 newly installed, 0 to remove and 0 not upgraded. Need to get 0 B/178 MB of archives. After this operation, 931 MB of additional disk space will be used. Selecting previously unselected package cm-kubernetes126. (Reading database … 234744 files and directories currently installed.) Preparing to unpack …/cm-kubernetes126_1.26.3-100264-cm9.2-6748854839_amd64.deb … Unpacking cm-kubernetes126 (1.26.3-100264-cm9.2-6748854839) … Preparing to unpack …/libnvidia-container1_1.12.0-1_amd64.deb … Unpacking libnvidia-container1:amd64 (1.12.0-1) … Preparing to unpack …/libnvidia-container-tools_1.12.0-1_amd64.deb … Unpacking libnvidia-container-tools (1.12.0-1) … Preparing to unpack …/nvidia-container-toolkit-base_1.12.0-1_amd64.deb … Unpacking nvidia-container-toolkit-base (1.12.0-1) … Preparing to unpack …/nvidia-container-toolkit_1.12.0-1_amd64.deb … Unpacking nvidia-container-toolkit (1.12.0-1) … STDERR: E: Can not write log (Is /dev/pts mounted?) - posix_openpt (19: No such device) dpkg: error processing archive /var/cache/apt/archives/libnvidia-container1_1.12.0-1_amd64.deb (–unpack): trying to overwrite ‘/usr/share/doc/libnvidia-container1/changelog.Debian.gz’, which is also in package cm-nvidia-container-toolkit 3.7.0-100053-cm9.2-000d5eaaf1 dpkg: error processing archive /var/cache/apt/archives/libnvidia-container-tools_1.12.0-1_amd64.deb (–unpack): trying to overwrite ‘/usr/bin/nvidia-container-cli’, which is also in package cm-nvidia-container-toolkit 3.7.0-100053-cm9.2-000d5eaaf1 dpkg-deb (subprocess): decompressing archive member: lzma write error: Broken pipe dpkg-deb: error: subprocess returned error exit status 2 dpkg: error processing archive /var/cache/apt/archives/nvidia-container-toolkit-base_1.12.0-1_amd64.deb (–unpack): trying to overwrite ‘/etc/nvidia-container-runtime/config.toml’, which is also in package cm-nvidia-container-toolkit 3.7.0-100053-cm9.2-000d5eaaf1 dpkg-deb (subprocess): decompressing archive member: lzma write error: Broken pipe dpkg-deb: error: subprocess returned error exit status 2 dpkg-deb (subprocess): cannot copy archive member from ‘/var/cache/apt/archives/nvidia-container-toolkit-base_1.12.0-1_amd64.deb’ to decompressor pipe: failed to write (Broken pipe) dpkg: error processing archive /var/cache/apt/archives/nvidia-container-toolkit_1.12.0-1_amd64.deb (–unpack): trying to overwrite ‘/usr/share/doc/nvidia-container-toolkit/changelog.Debian.gz’, which is also in package cm-nvidia-container-toolkit 3.7.0-100053-cm9.2-000d5eaaf1 No apport report written because MaxReports is reached already Errors were encountered while processing: /var/cache/apt/archives/libnvidia-container1_1.12.0-1_amd64.deb /var/cache/apt/archives/libnvidia-container-tools_1.12.0-1_amd64.deb /var/cache/apt/archives/nvidia-container-toolkit-base_1.12.0-1_amd64.deb /var/cache/apt/archives/nvidia-container-toolkit_1.12.0-1_amd64.deb E: Sub-process /usr/bin/dpkg returned an error code (1)
- Nodes missing in kubectl get nodes output are: DGXA100-Station
after I skip process it can install until progress : 80 #### stage: kubernetes: Wait Until Ingress Controller Ready
root@bright92:~# kubectl get pods --all-namespaces
NAMESPACE NAME READY STATUS RESTARTS AGE
ingress-nginx ingress-nginx-admission-create-1-5-1-lc6kp 0/1 Pending 0 17m
ingress-nginx ingress-nginx-admission-patch-8sqzj 0/1 Pending 0 17m
ingress-nginx ingress-nginx-controller-6db97f6465-qmh9b 0/1 ContainerCreating 0 17m
kube-system calico-kube-controllers-7bdbfc669-4gf6s 0/1 Pending 0 17m
kube-system calico-node-5rzdw 1/1 Running 0 17m
kube-system coredns-7958c64b9d-czhhk 0/1 Pending 0 17m
kube-system coredns-7958c64b9d-qbm29 1/1 Running 0 17m
How to fix this problem ?