It runs on 8xL40S since this 3rd configuration was added recently.
That instance is in launchpad so strangely ‘sudo microk8s’ has a lot of issues but running straight with helm and kubectl worked fine.
Because the 8xL40S can run normally, it can be confirmed that the problem before is caused by the insufficient resources.
The LLM occupies about half of all the resources, so that everything is OK after taking out that. Could you try to configure-the-nims to setup some keys for the model to verify the connection refused and unauthorized issue?
Sure. Because the forum you selected is not correct, we did not track it in time. You can choose the visual-ai-agent forum to file the VSS related topic later.