For loacl Agent, QWEN3.6 35B OR QWEN3-CODER-NEXT?

RoldyCHN · April 24, 2026, 9:29am

Hey guys, I tested both of them and feel Qwen3codernext is more useful, whats your thoughts? thanks for your sharing.

DannyTup · April 24, 2026, 10:13am

I started running some benchmarks, and Qwen 3.6 seems to come out on top on those:

README.md

main

# DGX Spark Evals

Some basic evals run on various models that fit on a single DGX Spark.

## Leaderboard

<!-- LEADERBOARD -->
| name | AgentBench | bfcl |
| --- | ---: | ---: |
| [Qwen3.6 27B](results/qwen36-27b/README.md) | <u>**59.3%**</u><br>*2h 41m* | **77.3%**<br>*1h 13m* |
| [Qwen3.6 27B FP8](results/qwen36-27b-fp8/README.md) | **58.7%**<br>*1h 44m* | **75.3%**<br>*37m 26s* |
| [Qwen3.6 35B-A3B FP8](results/qwen36-35b-a3b-fp8/README.md) | **55.3%**<br>*2h 9m* | <u>**78.0%**</u><br>*17m 3s* |
| [Qwen3.6 35B-A3B](results/qwen36-35b-a3b/README.md) | **52.7%**<br>*2h 34m* | <u>**78.0%**</u><br>*25m 5s* |
| [Gemma4 31B](results/gemma4-31b/README.md) | **45.3%**<br>*2h 4m* | **77.3%**<br>*19m 49s* |
| [Qwen3 Coder Next FP8](results/qwen3-coder-next-fp8/README.md) | **46.0%**<br>*32m 49s* |  |
| [Gemma4 26B-A4B](results/gemma4-26b-a4b/README.md) | **44.0%**<br>*2h 16m* |  |
| [Qwen3.6 35B-A3B NVFP4](results/qwen36-35b-a3b-nvfp4/README.md) |  |  |
<!-- /LEADERBOARD -->

## Running Evals

This file has been truncated. show original

I haven’t done much real-world testing though, because it’s really hard to compare that way and I wanted to answer the same question you had.

I’m currently running the 27B dense model with dflash to see what different it makes to the speed (in theory the quality should be the same). I was surprised that in the benchmarks I’ve run, the dense model hasn’t actually been as much slower as I’d expected (presumably it’s more efficient and getting the answer and doesn’t go in loops or fail tool calls as often).

giles8 · April 24, 2026, 10:14am

I find myself using one, then finding it is getting stuck in a rut, and then swapping out for the other to fix a problem, then swapping back again once that particular problem is fixed. It is increasingly looking like I need a cluster to run more than one at a time, and then use a smaller model to act as intermediary to the other coding models…

Problem is - things are moving so fast, new models seem to arrive every few days, and runtime recipes with different options seem to float around every few hours - it is difficult to keep track, especially as I’m pretty new to this.

azampatti · April 24, 2026, 9:51pm

I asked this question to myself multiple times. For my coding workflows, both work well and perform almost identically. I found myself going back and forth between these two and 3.5-122B-A10B (with all the optimizations from @Albond and @whpthomas’ recipes it performs very well.

Sometimes “coder-next” get stuck in a few things and 122B fixes it at once.

Sometimes 122B loops itself and 35B-A3B-FP8 solves that at once.

I guess the real answer is “its depends” :) I’m still defaulting back to 35B mosts of the times. Peforms very well and doesn’t make ugly mistakes too often.

Topic		Replies	Views
Qwen3.6-27B is out! DGX Spark / GB10 agentic-ai	292	25592	June 14, 2026
Qwen/Qwen3.6-35B-A3B (and FP8) has landed DGX Spark / GB10 agentic-ai	308	26364	June 9, 2026
Fastest Qwen 3.5 122B Int4 recipe on DGX Spark tested and published on Spark-Arena DGX Spark / GB10 llama	59	2694	June 3, 2026
Implementation Guide: DGX Spark with Qwen3.5-35B-A3B via llama.cpp for Claude Code DGX Spark / GB10 Projects llama , agentic-ai	3	1717	April 2, 2026
Best models/configurations for agentic coding with DGX (Nvidia/Asus/Dell/Lenovo/MSI) DGX Spark / GB10 agentic-ai	2	1298	April 17, 2026
HOW-TO: Run Qwen3-Coder-Next on Spark DGX Spark / GB10 llama	92	10057	March 24, 2026
Qwen3.6-27B-Dflash link DGX Spark / GB10 Projects	20	4182	April 29, 2026
Ok, I've fully bought into the hype now DGX Spark / GB10	9	933	March 2, 2026
Success with QuantTrio/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ DGX Spark / GB10	1	2006	April 2, 2026
Collecting eval results for Spark-sized quants of models DGX Spark / GB10 benchmarks , llm	50	1905	May 11, 2026

For loacl Agent, QWEN3.6 35B OR QWEN3-CODER-NEXT?

Related topics