DeepSeek API usage reports extremely inflated token counts and leaks DSML tool call markers

haidydev · April 26, 2026, 2:39am

model: deepseek-ai/deepseek-v4-flash

I’m using DeepSeek through the NVIDIA API endpoint, and I noticed that the usage field returned by the backend contains extremely inflated token counts compared with the actual character count.

Example response:

"usage": {
  "inputCharacters": 21864,
  "inputTokens": 878912,
  "outputCharacters": 84,
  "outputTokens": 1914,
  "totalTokens": 880826
}

The reported inputTokens value is about 40 tokens per input character, which seems unreasonable. With around 21k input characters, I would expect the token count to be much lower, not close to 879k tokens.

In addition, the model sometimes prints raw DeepSeek tool-calling markers in the response content, such as:

<｜DSML｜tool_calls

This makes me suspect that the DeepSeek tool-calling format may not be fully parsed or converted correctly by the NVIDIA compatibility layer. It may also be related to the inflated token usage if tool call chunks, hidden tool-call markup, or streaming chunks are being counted repeatedly.

Could you please help confirm:

Is the usage.inputTokens value returned by the NVIDIA API expected to represent the actual billable token count?
Is NVIDIA currently parsing DeepSeek DSML tool calls into OpenAI-compatible tool_calls fields?
Could this be a bug in token accounting, especially when tools or streaming are enabled?
Is there any recommended request format to avoid DSML markers leaking into content?

This issue makes it difficult to estimate cost and context usage accurately.

Thanks.

firejack200 · April 30, 2026, 6:02am

oh it doesn’t even work past 262144 Tokens now… that is if my 484000 token dataset and the subsequent error is to be believed.

Topic		Replies	Views
Request for deepseek-v4-pro API Rate Limit Increase Access/Accounts deepseek	3	160	June 26, 2026
Request for deepseek v4 flash/pro API Rate Limit Increase (RPH) For light use, Academic and small entertainment Models nim , deepseek	0	169	June 2, 2026
DEEP SEEK v4 Pro GPU SUPPORT Models api , nim , deepseek	1	510	May 10, 2026
Request to enable "Public API Endpoints" for my personal organization Access/Accounts deepseek	0	34	May 23, 2026
Request for NVIDIA NIM API Rate Limit Increase Models nim , deepseek	2	213	May 10, 2026
Noob, It doesn't work for some reason. need help for clarity Models deepseek	1	320	December 18, 2025
Please add more deepseek models and fix a issue that exists with deepseek 3.2 Models deepseek	1	196	April 17, 2026
Deepseek-v3.2: Function 'xxx': Not found for account 'yyy' Models llm , deepseek	0	174	December 28, 2025
Issues on deepseek models, Models deepseek	1	191	November 14, 2025
Unexpected DeepSeek V4 Pro Behaviour Models deepseek	4	1381	April 25, 2026

DeepSeek API usage reports extremely inflated token counts and leaks DSML tool call markers

Related topics