I have an issue when installing NVIDIA NIM on laptop

ariuskudar · September 6, 2024, 2:01pm

Hi Team,

I am trying to install NVIDIA NIM on my personal laptop following the procedure in this page:

Can NVIDIA NIM be installed on my personal laptop which has Intel GPU?
Also if I have to eventually rent a VM from AWS to test NIM, what config of VM should I look for?

Thank you

TomNVIDIA · September 6, 2024, 2:17pm

Hello,

If you do not have GPU infrastructure to self-host NIM, please check out LaunchPad.

NVIDIA LaunchPad provides free access to enterprise NVIDIA hardware and software through an internet browser. Users can experience the power of AI with end-to-end solutions through guided hands-on labs or as a development sandbox. Test, prototype, and deploy your own applications and models against the latest and greatest that NVIDIA has to offer.

ariuskudar · September 6, 2024, 2:30pm

Thanks.
I am asked to install NIM myself not using launchpad.
If I can’t install it on Intel GPU, should I rent an machine from AWS?

ariuskudar · September 6, 2024, 4:11pm

I am following the commands in this page on “docker” tab to do the installation:

The commands given are the followings which I can’t run on windows even if I have WSL2 and the docker with Ubuntu 22.04 installed.
So I am wondering where I need exactly to execute these commands if I am working with Windows11.
Should I execute them thru WSL or CMD?

$ docker login nvcr.io
Username: $oauthtoken
Password: <Your Key>

Get API Key

Copy Code

Pull and run the NVIDIA NIM with the command below. This will download the optimized model for your infrastructure.

export NGC_API_KEY=<PASTE_API_KEY_HERE>
export LOCAL_NIM_CACHE=~/.cache/nim
mkdir -p "$LOCAL_NIM_CACHE"
docker run -it --rm \
    --gpus all \
    --shm-size=16GB \
    -e NGC_API_KEY \
    -v "$LOCAL_NIM_CACHE:/opt/nim/.cache" \
    -u $(id -u) \
    -p 8000:8000 \
    nvcr.io/nim/meta/llama-3.1-8b-instruct:1.1.2

Copy Code

You can now make a local API call using this curl command:

curl -X 'POST' \
'http://0.0.0.0:8000/v1/chat/completions' \
-H 'accept: application/json' \
-H 'Content-Type: application/json' \
-d '{
    "model": "meta/llama-3.1-8b-instruct",
    "messages": [{"role":"user", "content":"Write a limerick about the wonders of GPU computing."}],
    "max_tokens": 64
}'

ariuskudar · September 7, 2024, 2:13am

I have tried to get an instance from Google cloud with an A100 GPU to be able to test NIM, but my request is declined.
Is there a chance I run NIM anywhere else for free?
I also tried loading docker on Oracle cloud and got the GPU compatibility error, because that VM uses an AMD GPU.
Also in this link, GitHub - NVIDIA/nim-anywhere: Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench, it says to use AI workbench and when I try to install it on my personal laptop I get docker installation failure. Do I have to use this AI workbench on a VM that has NVIDIA GPU on it?

neal.vaidya · September 9, 2024, 1:25am

Hi @ariuskudar – to run NIM you need a machine with an NVIDIA GPU and with the ability to launch docker containers. You can use AI Workbench to connect to a server that has an NVIDIA GPU but either way you’ll have to reserve the cloud VM.

Any cloud VM with an NVIDIA GPU with Compute Capability > 7.0 should work with NIM. So on AWS, that would be P5, P4, P3, G6e, G6, G5g, G5, or G4dn instances.

In terms of trying things for free – the hosted APIs on build.nvidia.com are the same as what you would get from deploying NIM yourself. If you need to deploy NIM yourself, I’d recommend taking another look at the Launchpad NVIDIA NIM for deploying large language models (LLMs).

ariuskudar · September 18, 2024, 8:24pm

I finally could get things going and run the codes in the project notebook here:

but I am getting the following error that some nemo modules are not installed, can you please instruct:

Traceback (most recent call last): File "[/NeMo/examples/nlp/language_modeling/tuning/megatron_gpt_finetuning.py", line 18](http://localhost:8887/NeMo/examples/nlp/language_modeling/tuning/megatron_gpt_finetuning.py#line=17), in <module> from nemo.collections.nlp.models.language_modeling.megatron_gpt_sft_model import MegatronGPTSFTModel ModuleNotFoundError: No module named 'nemo'

I tried this code block, and it says Cython is not installed, although my terminal shows that it is:

apt-get update && apt-get install -y libsndfile1 ffmpeg
pip install Cython packaging
pip install nemo_toolkit[‘all’]

error is:

line 318, in run_setup
          exec(code, locals())
        File "<string>", line 5, in <module>
      ModuleNotFoundError: No module named 'Cython'
      [end of output]
  
  note: This error originates from a subprocess, and is likely not a problem with pip.
error: subprocess-exited-with-error

× Getting requirements to build wheel did not run successfully.
│ exit code: 1
╰─> See above for output.

note: This error originates from a subprocess, and is likely not a problem with pip.

Topic		Replies	Views
Getting Started With NVIDIA NIM Tutorial Issues with NGC Registry Access/Accounts ubuntu , nim , llm , llama3-8b-instruct	7	635	July 24, 2024
NVIDIA NIM Container with CUDA out of Memory Problem Docker and NVIDIA Docker cuda , ubuntu , docker , nim , llama3-8b-instruct	2	314	September 20, 2024
A Simple Guide to Deploying Generative AI with NVIDIA NIM Technical Blog nim	9	554	September 8, 2024
/opt/nim/start-server.sh: line 61: 32 Killed python3 -m vllm_nvext.entrypoints.openai.api_server Container: CUDA	0	197	July 9, 2024
NIM nim/meta/llama3-8b-instruct - no API key is detected NGC GPU Cloud	2	521	July 23, 2024
NIM API key not Found Models nim , llama-31-8b-instruct , llama	4	209	September 21, 2024
Unable to Run NIM on H100 GPU Due to Profile Compatibility Issue Despite Sufficient GPU Resources Models nim , llama-31-8b-instruct , llama	1	66	November 12, 2024
Managing AI Inference Pipelines on Kubernetes with NVIDIA NIM Operator Technical Blog nim	4	81	October 17, 2024
Model says there is a compatible profile but fails on data type Models nim , mistral-7b-instruct-v03	4	342	August 21, 2024
RTX 4090 shows as "non-free GPU" when running NIM model in docker AI Foundation Models and Endpoints nim	8	1278	October 21, 2024

I have an issue when installing NVIDIA NIM on laptop

Related topics