I’m trying to run the alphafold2 nim on a DGX that’s in an HPC. There are no docker modules, instead i pulled the container with singularity/apptainer:
export APPTAINER_DOCKER_USERNAME='$oauthtoken'
export APPTAINER_DOCKER_PASSWORD='nvapi-*****************************'
apptainer pull af2_nim-2-1-0.sif docker://nvcr.io/nim/deepmind/alphafold2:latest
Trying to run NIM on the DGX within an interactive terminal or with SLURM sbatch using the following commands in apptainer throws an authentication error:
(base) [user@dgx09 home]$ apptainer run --nv --bind $LOCAL_NIM_CACHE:/opt/nim/.cache --env NGC_API_KEY=$NGC_API_KEY af2_nim-2-1-0.sif
INFO: underlay of /usr/bin/nvidia-smi required more than 50 (480) bind mounts
WARNING: group: unknown groupid 2190919
15:4: not a valid test operator:
15:4: not a valid test operator: 12.6
21:4: not a valid test operator: (
21:4: not a valid test operator: 550.54.15
"timestamp": "2024-12-11 17:04:36,147", "level": "INFO", "message": "Starting nimlib 0.3.7"
"timestamp": "2024-12-11 17:04:36,147", "level": "INFO", "message": "NIM VERSION:"
1.0.0
"timestamp": "2024-12-11 17:04:36,148", "level": "INFO", "message": "NIM LICENSE:"
***LISCENCE***
"timestamp": "2024-12-11 17:04:36,148", "level": "INFO", "message": "NIM banner.txt:"
AlphaFold2 was originally developed by Google DeepMind.
This NIM has been packaged and optimized by NVIDIA to support sequence-to-structure, sequence-to-msa, and msa-to-structure inference.
"timestamp": "2024-12-11 17:04:36,175", "level": "INFO", "message": "Matched profile_id in manifest from env NIM_MODEL_PROFILE to: 4df59ce8aa1ff5a0d88e808316d0c83ea0c3bde259e9da3a12a1cdc7a54e2635"
"timestamp": "2024-12-11 17:04:36,175", "level": "INFO", "message": "Using the profile selected by the profile selector: 4df59ce8aa1ff5a0d88e808316d0c83ea0c3bde259e9da3a12a1cdc7a54e2635"
"timestamp": "2024-12-11 17:04:36,175", "level": "INFO", "message": "Using the workspace specified during init: /opt/nim/workspace"
"timestamp": "2024-12-11 17:04:36,175", "level": "INFO", "message": "Downloading manifest profile: 4df59ce8aa1ff5a0d88e808316d0c83ea0c3bde259e9da3a12a1cdc7a54e2635"
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:128] One or more errors fetching files:
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.397 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
[12-11 17:04:42.398 ERROR nim_sdk::hub::repo rust/nim-sdk/src/hub/repo.rs:130] Authentication Error
"timestamp": "2024-12-11 17:04:42,399", "level": "ERROR", "message": "Error downloading models, ignoring and continuing startup"
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/nimlib/nim_sdk.py", line 313, in download_models
cache = repo.get_all() # download model artifacts to cache
Exception: Authentication Error
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/nimlib/nimutils.py", line 45, in download_models
model_manifest.download_models()
File "/usr/local/lib/python3.10/dist-packages/nimlib/nim_sdk.py", line 339, in download_models
raise ManifestDownloadError(
nimlib.exceptions.ManifestDownloadError: Error downloading manifest to cache: /opt/nim/workspace error: Authentication Error
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/nimlib/nim_sdk.py", line 313, in download_models
cache = repo.get_all() # download model artifacts to cache
Exception: Authentication Error
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/bin/start_server", line 8, in <module>
sys.exit(main())
File "/usr/local/lib/python3.10/dist-packages/nimlib/start_server.py", line 73, in main
nimutils.download_models(model_manifest)
File "/usr/local/lib/python3.10/dist-packages/nimlib/nimutils.py", line 45, in download_models
model_manifest.download_models()
File "/usr/local/lib/python3.10/dist-packages/nimlib/nim_sdk.py", line 339, in download_models
raise ManifestDownloadError(
nimlib.exceptions.ManifestDownloadError: Error downloading manifest to cache: /opt/nim/workspace error: Authentication Error
I am passing in the right environment flags but I’ve tried different generated api keys and personal keys but nothing works. Did something break in the apptainer conversion or is there an additional authentication key I’m missing to get this NIM to start?