gpt-5.5 is so much better than opus 4.7 that it can challenge mythos

Nick Trenkler 23 Apr 2026 1 min read

Woman in a police lineup with "GPT-5.5" highlighted in neon green above Opus 4.7, Gemini 3.1, GPT-5.4 — public model on top.

- top-1 on artificial analysis intelligence index, above opus 4.7, gemini 3.1, and gpt-5.4
- beats opus 4.7 on almost every benchmark. one exception: swe-bench pro (57.7% vs opus 4.7's 64.3%)
- edges out mythos on terminal bench 2.0 by 0.7pp
- completed a uk aisi cyber attack simulation end-to-end: 32 steps, ~20 hours for a human expert. but only 1 out of 10 attempts. mythos: 3/10 on the same sim.

strong as mythos, public as gpt

Bar chart comparing GPT-5.5 and Claude Mythos: terminal-bench 82.7 vs 82.0, OSWorld 78.7 vs 79.6, BrowseComp 84.4 vs 86.9, CyberGym 81.8 vs 83.1.

Introducing GPT-5.5

A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done.

Now available in ChatGPT and Codex. pic.twitter.com/rPLTk99ZH5
— OpenAI (@OpenAI) April 23, 2026

gpt 5.6 sol pro vs claude fable 5 vs grok 4.5 vs glm 5.2

today @OpenAI released the gpt-5.6 family – sol (flagship), terra, luna. key facts: • sol is the new flagship, priced $5/m input and $30/m output • openai claims "

10 Jul 2026 2 min read

xAI worker in jumpsuit between neon bedroom and kitchen blueprints, messy garage with dead code newspapers

grok 4.5 vs fable 5 vs gpt 5.5 vs glm 5.2

today @SpaceXAI released grok 4.5 – a model built around coding, agentic tasks and knowledge work. key facts: • trained across tens of thousands of nvidia gb300 gpus, with rl over

9 Jul 2026 2 min read

Hand holding a phone showing an @-mention field building a woman's face from a photo grid at 60%

muse image vs gpt image 2 vs nano banana 2 vs reve 2.0

@Meta shipped muse image – an image model inside meta ai. the headline feature: @-mention anyone's instagram and generate images of them from their public photos. opt-out exists

8 Jul 2026 1 min read

Stay in the loop

Read next

gpt 5.6 sol pro vs claude fable 5 vs grok 4.5 vs glm 5.2

grok 4.5 vs fable 5 vs gpt 5.5 vs glm 5.2

muse image vs gpt image 2 vs nano banana 2 vs reve 2.0