How to change the dimensions output using llama-3.2-nv-embedqa-1b-v2

crslen · January 10, 2025, 1:23pm

I would like to use llama-3.2-nv-embedqa-1b-v2 to integrate with pgvector, but the 2048 output is too large for pgvector. The documentation says the output can be configured to a different value (i.e., 1024). Is this a value that gets passed in the input, or is it something entirely different?

Output: Model outputs embedding vectors of maximum dimension 2048 for each text string (can be configured based on 384, 512, 768, 1024, or 2048).

sophwats · January 10, 2025, 2:49pm

Hi @crslen welcome to the NVIDIA Developer forums!

You should now be able to use the dimensions API parameter to specify the dimension Reference - NVIDIA Docs for the llama-2.3-nv-embedqa-1b-v2 model.
Please let us know if this solves your problem.

Thanks!

Sophie

Topic		Replies	Views
API Input length 1217 exceeds maximum allowed token size 512 but configured the API parameters to 4096 AI Foundation Models and Endpoints llama	0	64	November 27, 2024
Discrepancy in Maximum Token Length for nv-embed-qa-1b-v2 Model Models nv-embedqa-e5-v5 , llama	3	251	February 7, 2025
Transform Llama config to Gemv scale AI Foundation Models and Endpoints llama	0	14	November 13, 2024
Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs Technical Blog llama	1	72	November 19, 2024
NIM - Llama 3 8B Instruct - Results were very weirdn Models nim	1	330	August 27, 2024
Override max_num_seqs on nvcr.io/nim/meta/llama-3.2-11b-vision-instruct Models nim , llama	4	115	February 12, 2025
Reusing a stored model (llama-3.1-8b-instruct) with a proper profile Models nim , llama-31-8b-instruct , llama	0	150	October 30, 2024
StyleGAN resolution 2048x2048 CUDA Developer Tools	0	615	December 15, 2020
Windows WGL_NV_DX_interop NV12 texture larger then 2048(width) issue OpenGL	2	423	November 20, 2023
CUDA Video Encoder maximum resolution GPU-Accelerated Libraries	3	1721	April 17, 2013

How to change the dimensions output using llama-3.2-nv-embedqa-1b-v2

Related topics