5 releases, +0.3%: open-weight coding hit a wall

Addy Crezee 1 Jun 2026 1 min read

Man shocking a CRT monitor with defibrillator paddles in a dark room, sparks flying, illustrating stalled open-weight coding

swe-bench pro across open-weight models hasn't moved in 10 weeks

mar 27th - jun 1st. 5 major open-weight releases. net progress on swe-bench pro: +0.3%. scores swung -5% and back within that window

looks like open-weight labs are optimizing hard for agent tasks and multimodality – but not for coding anymore

Table comparing five open-weight models on SWE-bench Pro from March to June 2026, scores clustered between 55.4 and 59

Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities

- Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas
- MiniMax Sparse Attention scales context to 1M
-… pic.twitter.com/TF891iJukF
— MiniMax (official) (@MiniMax_AI) June 1, 2026

laguna s 2.1 vs hy3 vs inkling vs deepseek v4 pro max

@poolsideai released laguna s 2.1 on july 21 – a new open-weights agentic coding model, full weights on hugging face. key facts: • 118b total / 8b active moe, 1m context,

24 Jul 2026 3 min read

Comic-style chef in a neon kitchen holding a plate labeled "rosted fennel" and "thryme" — Qwen's misspelled menu

qwen image 3.0 vs gpt image 2

@Alibaba_Qwen just dropped qwen-image-3.0 – the third gen of their image model. the whole pitch is going from "good-looking" to "useful" (their

23 Jul 2026 2 min read

Comic-style architects study glowing blueprints beside neon tower holograms labeled Qwen, Kimi and Gemini in a graffiti studio

gemini 3.6 flash vs qwen3-max vs gpt 5.6 sol vs kimi k3

@OfficialLoganK and @GoogleAIStudio recently shipped gemini 3.6 flash – their new workhorse. key facts: • 17% fewer output tokens than 3.5 flash on the artificial analysis index, up to 65%

22 Jul 2026 3 min read

Stay in the loop

Read next

laguna s 2.1 vs hy3 vs inkling vs deepseek v4 pro max

qwen image 3.0 vs gpt image 2

gemini 3.6 flash vs qwen3-max vs gpt 5.6 sol vs kimi k3