Running Isaac Kit in Kubernetes on Google VM gives fatal error, but not on minikube with --driver=none locally

Running the identical Docker container on minikube with the --driver=none option is 100% perfect

It seems that we need nested virtualisation

but alas we cannot set that in

Is there something else that is causing this error?

2022-06-09 05:45:34,828 ERROR (render-files): __main__: Render error: 2022-06-09 05:45:34 [382ms] [Error] [carb.windowing-glfw.plugin] GLFW initialization failed.
2022-06-09 05:45:34,828 ERROR (render-files): __main__: Render error: 2022-06-09 05:45:34 [382ms] [Error] [carb] Failed to startup plugin carb.windowing-glfw.plugin (interfaces: [carb::windowing::IGLContext v0.1],[carb::windowing::IWindowing v1.1]) (impl: carb.windowing-glfw.plugin)
2022-06-09 05:45:34,828 ERROR (render-files): __main__: Render error: 2022-06-09 05:45:34 [666ms] [Fatal] [carb.crashreporter-breakpad.plugin] libcarb.events.plugin.so!carbOnPluginStartup
2022-06-09 05:45:34,828 ERROR (render-files): __main__: Render error: 2022-06-09 05:45:34 [668ms] [Fatal] [carb.crashreporter-breakpad.plugin] libcarb.tasking.plugin.so!std::thread::_State_impl<std::thread::_Invoker<std::tuple<void (carb::tasking::Scheduler::*)(unsigned int, int, carb::cpp20::latch*), carb::tasking::Scheduler*, unsigned int, int, carb::cpp20::latch*> > >::_M_run()
2022-06-09 05:45:34,828 ERROR (render-files): __main__: Render error: 2022-06-09 05:45:34 [670ms] [Fatal] [carb.crashreporter-breakpad.plugin] libpthread.so.0!funlockfile
2022-06-09 05:45:34,829 ERROR (render-files): __main__: Render error: 2022-06-09 05:45:34 [672ms] [Fatal] [carb.crashreporter-breakpad.plugin] libc.so.6!explicit_bzero
2022-06-09 05:45:34,829 ERROR (render-files): __main__: Render error: 2022-06-09 05:45:34 [672ms] [Fatal] [carb.crashreporter-breakpad.plugin] libGLX_nvidia.so.0!vk_icdGetInstanceProcAddr

seems like the only work around is this:

Did not need nested virtualization, just need to make sure k8s has the right kernel for the containers to access the hardware for the Vulcan 470. 1.2 CUDA 11.4 base ubi 8 image

See

Specifically shut down all pods using GPU uninstall

Helm chart for GPU daemonset and then put this on

kubectl apply -f https://raw.githubusercontent.com/GoogleCloudPlatform/container-engine-accelerators/master/nvidia-driver-installer/cos/daemonset-preloaded-latest.yaml