Support for vision models after enterprise 4000 credits are exhausted - onboarding on paid subscription

tushars1 · October 23, 2024, 4:30am

Hello NVidia folks,
I am using the following Vision instruct models using the enterprise credits (and python code**).

I am not able to figure out how to continue using it when my credit tokens are exhausted.

Some forums suggest using docker/helm charts or using HuggingFace Enterprise subscription.
But it looks too complex to setup. I would like to continue using the API keys and python code** on NVIDIA cloud.

ht tps://forums.developer.nvidia.com/t/nim-api-credits/305703
ht tps://docs.nvidia.com/nim/large-language-models/latest/deploy-helm.html
ht tps://build.nvidia.com/meta/llama-3_1-70b-instruct?snippet_tab=Docker

Could someone please point me in the right direction?

python code**

import requests, base64

invoke_url = "https://ai.api.nvidia.com/v1/gr/meta/llama-3.2-90b-vision-instruct/chat/completions"
stream = True

with open("image.png", "rb") as f:
  image_b64 = base64.b64encode(f.read()).decode()

assert len(image_b64) < 180_000, \
  "To upload larger images, use the assets API (see docs)"
  

headers = {
  "Authorization": "Bearer $API_KEY_REQUIRED_IF_EXECUTING_OUTSIDE_NGC",
  "Accept": "text/event-stream" if stream else "application/json"
}

payload = {
  "model": 'meta/llama-3.2-90b-vision-instruct',
  "messages": [
    {
      "role": "user",
      "content": f'What is in this image? <img src="data:image/png;base64,{image_b64}" />'
    }
  ],
  "max_tokens": 512,
  "temperature": 1.00,
  "top_p": 1.00,
  "stream": stream
}

response = requests.post(invoke_url, headers=headers, json=payload)

if stream:
    for line in response.iter_lines():
        if line:
            print(line.decode("utf-8"))
else:
    print(response.json())

Topic		Replies	Views
Audio to face Credit Access/Accounts api	4	92	March 4, 2025
Need more credits for NIM cloud API Access/Accounts nim	3	1438	April 16, 2025
What do I do once I run out of credits for using bees? Access/Accounts audio2face	2	146	January 13, 2025
Getting Started With NVIDIA NIM Tutorial Issues with NGC Registry Access/Accounts ubuntu , nim , llm , llama3-8b-instruct	7	1414	July 24, 2024
Unable to Run NIM on H100 GPU Due to Profile Compatibility Issue Despite Sufficient GPU Resources Models nim , llama-31-8b-instruct , llama	1	193	November 12, 2024
A Simple Guide to Deploying Generative AI with NVIDIA NIM Technical Blog nim	9	717	September 8, 2024
API Credit balance Models nim	5	367	January 7, 2025
NVIDIA NIM Container with CUDA out of Memory Problem Docker and NVIDIA Docker cuda , ubuntu , docker , nim , llama3-8b-instruct	2	501	September 20, 2024
NVIDIA NIM API / openai.API: Error code: 402,Cloud credits expired - Please contact NVIDIA representatives Models nim , llama-31-405b-instruct , llama	8	407	January 19, 2025
/opt/nim/start-server.sh: line 61: 32 Killed python3 -m vllm_nvext.entrypoints.openai.api_server Container: CUDA	0	261	July 9, 2024

Support for vision models after enterprise 4000 credits are exhausted - onboarding on paid subscription

Related topics