What image do I need to run the "nvidia/llama/mistral-7b-int4-chat:1.2" model?

rigoberto.corujo · July 25, 2024, 12:58am

I’ve downloaded the mistral-7b-int4-chat_v1.2 model onto a PVC and will like to run the model in Kserve. What image should I be using for this model?

Thank you.

neal.vaidya · July 25, 2024, 2:11am

@rigoberto.corujo we don’t support int4 models in NIM at the moment, but you can deploy mistral 7B in fp8 or fp16 using the following container:

nvcr.io/nim/mistralai/mistral-7b-instruct-v03:1.0.0

I believe the specific model you are referring to was built for usage with the ChatRTX app.

rigoberto.corujo · July 25, 2024, 1:38pm

Thank you. I have my model in a PVC. What argument do I need to pass to the container so that it runs the model?

% kubectl exec -it pod/model-store-pod -- ls -l /mnt/models/mistral-7b-int4-chat_v1.2/
total 4123108
-rw-r--r-- 1  504 staff         63 Jul 24 21:26 README.txt
-rw-r--r-- 1  504 staff        891 Jul 24 21:26 config.json
-rw-r--r-- 1  504 staff        143 Jul 24 21:26 license.txt
drwxr-xr-x 2 root root        4096 Jul 24 21:26 mistral7b_hf_tokenizer
drwxr-xr-x 2 root root        4096 Jul 24 21:26 mistral_kv_int8_scales
-rw-r--r-- 1  504 staff 4222035384 Jul 24 21:31 rank0.safetensors

neal.vaidya · July 25, 2024, 3:11pm

Hi @rigoberto.corujo, you can’t run this model with NIM.

rigoberto.corujo · July 25, 2024, 3:42pm

Thank you. Perhaps I picked the wrong model. To run the Mistral-7B-v0.1 model from a PVC, what would be the image for that?

Thank you.

neal.vaidya · July 25, 2024, 4:12pm

We technically don’t support Mistral-7B-v0.1 for deployment with NIM, only the Mistral-7B-v0.3 model.

You can deploy the mistral-7b-instruct-v03 model with the nvcr.io/nim/mistralai/mistral-7b-instruct-v03:1.0.0 container – check out the Kserve deployment instructions here: nim-deploy/kserve at main · NVIDIA/nim-deploy · GitHub

system · October 10, 2024, 6:09pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Mistral 7B INT4 Not Installed NVIDIA Nemotron nvbugs , installation , llama	5	1084	February 17, 2025
MistralAI models, Mistral-7B, Mistral-7B-Instruct, Mixtral-8x7B, Mixtral-8x7B-Instruct Maxine	0	255	June 17, 2024
How to deploy and run Mistral-7B-Instruct-v0.2 model using Triton Inference Server on AWS EC2 instance Deep Learning (Training & Inference) mistral-7b-instruct-v02	1	46	May 1, 2025
How to fix 0 compatible profiles for L40S with mistral-7b-instruct-v03 NIM? Models gpu , nim , mistral-7b-instruct-v03	7	364	November 4, 2024
NIM running docker appear the example Models nim	0	29	March 22, 2025
Power Your AI Projects with New NVIDIA NIMs for Mistral and Mixtral Models Technical Blog nim	1	28	July 15, 2024
Nv-rerankqa-mistral-4b-v3 model error NVIDIA Nemotron nim , mistral-7b-instruct-v03 , nv-rerankqa-mistral-4b-v3	0	65	August 12, 2024
Model says there is a compatible profile but fails on data type Models nim , mistral-7b-instruct-v03	4	717	August 21, 2024
Internal Server error ,Try again NVIDIA AI Workbench	5	647	April 11, 2024
ChatNVIDIA - HTTPError: 404 Client Error: Not Found Models nim	5	469	September 22, 2024

What image do I need to run the "nvidia/llama/mistral-7b-int4-chat:1.2" model?

Related topics