I see vids/GIFs of GPT OSS being agentic, using the CLI, reading/writing files, etc.
So in VS Codium, I tried the Cline extension. I also tried the Void IDE, a VSC fork. Neither seem to be able to decipher GPT OSS’s tool use output attempts for some reason.
This is the command I’m using:
docker run -it --gpus all --ipc=host --platform "linux/arm64" \
--ulimit memlock=-1 --ulimit stack=67108864 -p 8000:8000 \
nvcr.io/nvidia/vllm:25.12.post1-py3 vllm serve "openai/gpt-oss-120b" \
--gpu-memory-utilization=0.8 --enable-auto-tool-choice \
--tool-call-parser openai
I tried without the two *tool* arguments, as well as ulimit arguments.
The speed and chatting seem to work perfectly fine, but the error message is about Harmony format (which GPT OSS uses) for tool use, and unsure if anyone has any open source solution for this, whether it be a VSC extension or a fork, or something else.