Model returns "<|return|>" and missing reasoning_content with openai/gpt-oss-120b

huxiangkun2003 · October 4, 2025, 4:35pm

Model outputs contain the literal control token string <|return|> inside returned content.

The returned completion object does not include a reasoning_content field despite reasoning_effort being set.

When reasoning_effort is provided, the response should include a reasoning_content field (or documented equivalent) containing structured reasoning or chain-of-thought, or an explicit documented indication that reasoning output is unavailable.

No internal tokens like <|return|> should appear in user-visible output.

from openai import OpenAI

client = OpenAI(
  base_url="https://integrate.api.nvidia.com/v1",
  api_key=""
)

completion = client.chat.completions.create(
  model="openai/gpt-oss-120b",
  reasoning_effort="medium",
  messages=[{"role":"user","content":"who are you?"}],
  temperature=0.6,
  top_p=0.9,
  max_tokens=4096,
  response_format={"type": "json_object"}
)

Topic		Replies	Views
Build intelligent chatbots, enhance search engines, and develop educational tools with Llama 3-ChatQA Technical Blog	1	112	June 26, 2024
Hope, dream NVIDIA Nemotron	0	247	February 29, 2024
Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model Technical Blog	1	168	May 15, 2024
Noob, It doesn't work for some reason. need help for clarity Models	1	20	December 18, 2025
Open AI API Compatible Models llama-31-405b-instruct , llama	1	219	November 14, 2025
Issues on deepseek models, Models	1	66	November 14, 2025
Deepseek: Extract Reasoning Only NVIDIA Nemotron nim	1	539	February 18, 2025
ChatWithRTX sentence_tranformer help NVIDIA Nemotron	0	243	March 28, 2024
LLM based Multimodal AI w/ Azure Open AI & NVIDIA Jetson Jetson Projects	0	554	August 22, 2023
AI Chatbot General Topics and Other SDKs	0	446	February 1, 2022

Model returns "<|return|>" and missing reasoning_content with openai/gpt-oss-120b

Related topics