Assistance Required for API Call Error: Prompt Length Exceeds Maximum Input Length in TRTGptModel

enardo7 · December 20, 2024, 4:52pm

Context:
Error Details: The error indicates that the prompt length (1514 tokens) exceeds the maximum input length (255 tokens). It also suggests checking the TRTGptModel logs for more details about how the maximum input length is set.
Error during inference of request chat-6abeade2e0bc48f6bd02486825f35ed1 – Encountered an error when fetching new request: Prompt length (1514) exceeds maximum input length (255). Set log level to info and check TRTGptModel logs for how maximum input length is set (/home/jenkins/agent/workspace/LLM/release-0.12/L0_PostMerge/llm/cpp/include/tensorrt_llm/batch_manager/llmRequest.h:249)

Attempted Fixes: I’ve tried the following without success.
sudo docker run -it --runtime=nvidia --gpus all --shm-size=16GB --memory=“40g” --memory-swap=“50g” --cpus=8 -e PROFILE_ID=“193649a2eb95e821309d6023a2cabb31489d3b690a9973c7ab5d1ff58b0aa7eb” -e NGC_API_KEY=“$NGC_API_KEY” -v “$LOCAL_NIM_CACHE:/opt/nim/.cache” -p 8000:8000 nvcr.io/nim/mistralai/mistral-7b-instruct-v0.3:latest
sudo docker run -it --runtime=nvidia --gpus all --shm-size=16GB --memory=“40g” --memory-swap=“50g” --cpus=8 -e MAX_PROMPT_LENGTH=4096 -e NGC_API_KEY=“$NGC_API_KEY” -v “$LOCAL_NIM_CACHE:/opt/nim/.cache” -p 8000:8000 nvcr.io/nim/mistralai/mistral-7b-instruct-v0.3:latest
sudo docker run -it --runtime=nvidia --gpus all --shm-size=16GB --memory=“40g” --memory-swap=“50g” --cpus=8 -e NIM_MAX_MODEL_LEN=4096 -e MAX_INPUT_LENGTH=4096 -e NGC_API_KEY=“$NGC_API_KEY” -v “$LOCAL_NIM_CACHE:/opt/nim/.cache” -p 8000:8000 nvcr.io/nim/mistralai/mistral-7b-instruct-v0.3:latest

I would greatly appreciate assistance in resolving this issue.

Thank you,

Topic		Replies	Views
API Input length 1217 exceeds maximum allowed token size 512 but configured the API parameters to 4096 AI Foundation Models and Endpoints llama	0	59	November 27, 2024
Mistral v.03 NIM container on AWS Sagemaker: cannot set max_model_len! Models nim	4	370	September 4, 2024
Internal Server error ,Try again NVIDIA AI Workbench	5	600	April 11, 2024
Model says there is a compatible profile but fails on data type Models nim , mistral-7b-instruct-v03	4	622	August 21, 2024
Device memory is insufficient to use tactic BUT there is enough mem TensorRT tensorrt	2	28	March 31, 2025
Docker run error - "exec format error" TAO Toolkit	17	2165	October 5, 2021
Discrepancy in Maximum Token Length for nv-embed-qa-1b-v2 Model Models nv-embedqa-e5-v5 , llama	3	220	February 7, 2025
Out of memory message trying to run cnn network benchmark Frameworks tensorflow	4	3164	April 7, 2018
Cannot convert model with dynamic input shape to TRT engine TAO Toolkit	9	1161	October 12, 2021
Problem with installation of Llama 3.1 8b NIM Models nim , llama3-8b-instruct , llama-31-8b-instruct , llama	1	527	September 4, 2024

Assistance Required for API Call Error: Prompt Length Exceeds Maximum Input Length in TRTGptModel

Related topics