Hosted Integrate /v1/responses returns 404 across multiple models while /v1/models and /v1/chat/completions work

codyshanemitchell · March 25, 2026, 8:10pm

I did a hosted compatibility repro against https://integrate.api.nvidia.com/v1 and put the artifacts here:

GitHub - SproutSeeds/codex-nim-poc: Proof-of-concept for Codex custom-provider compatibility against NVIDIA NIM hosted endpoints · GitHub

The question is narrower than “Codex support” or “NIM support” generally. I’m trying to clarify the current hosted Integrate contract for /v1/responses.

Exact results:

direct GET /v1/models returned 200
direct POST /v1/chat/completions returned 200
direct POST /v1/responses returned 404 page not found
the same /v1/responses 404 also appeared in a widened six-model matrix on the same hosted Integrate surface across NVIDIA, Meta, and Mistral instruct models:
- nvidia/nemotron-3-super-120b-a12b
- nvidia/llama-3.3-nemotron-super-49b-v1
- nvidia/llama-3.1-nemotron-70b-instruct
- nvidia/nemotron-mini-4b-instruct
- meta/llama-3.3-70b-instruct
- mistralai/mistral-large
NVIDIA’s current LLM NIM release notes describe experimental Responses API support, which is why I’m asking whether this hosted result is expected

The narrow question is:

is /v1/responses expected to work today on the hosted Integrate surface?
if yes, is it limited to a different subset of models, accounts, or endpoint variants than the ones tested here?

If it helps, I can also provide the exact per-model request/response captures from the repro repo artifacts.

Topic		Replies	Views
Moonshotai/kimi-k2.5 on Hosted Integrate returns success-shaped failures: repeated HTTP 200 responses with unusable content Models api , nim	0	132	March 26, 2026
Bug Report: NVIDIA NIM Hosted Endpoint Reliability Issues - bugs requiring extensive client-side workarounds Models nim , deepseek	3	288	April 14, 2026
404 Error - Function Not Found for Some Models + How to List Supported Models via API NVIDIA Nemotron	4	596	April 6, 2026
Request to enable Public API Endpoints for personal organization JAX cuda , nim , deepseek	2	72	May 4, 2026
Request to enable Public API Endpoints for my personal organization Access/Accounts nim , deepseek	0	25	April 29, 2026
Api url not working Models	3	1091	April 2, 2026
404 Function not found for account when calling aisingapore/sea-lion-7b-instruct via integrate.api.nvidia.com Models	0	175	February 24, 2026
Reliability issues across glm5, kimi-k2.5, and minimax-m2.5; temporary mitigations exist but a permanent fix is needed Models api , nim	2	333	April 10, 2026
Need hosted API access for nvidia/nemotron-3-nano-30b-a3b Access/Accounts api , jetson , nim , deepseek , nemotron	0	141	February 22, 2026
NVIDIA NIM API invoked by Langchain returns statuscode 500 Access/Accounts nim , llama-31-70b-instruct , llama	1	401	September 4, 2024

Hosted Integrate /v1/responses returns 404 across multiple models while /v1/models and /v1/chat/completions work

Related topics