hermes cut startup time 63% and just beat codex on speed

Nick Trenkler 25 May 2026 1 min read

Armored woman kneels beside a crashing server labeled CODEX with sparks flying, timer reads 258ms on a screen behind her

hermes is now officially faster than codex. here's how they did it

hermes was losing to codex 6–5 across 11 benchmark tasks measuring real-world cli speed – things like file editing, codebase exploration, shell pipelines, and multi-turn sessions

teknium shipped 3 fixes:

• bitwarden disk cache – the app was calling a password manager api on every single startup. added a local cache so it only fetches once
• lazy model loading – a huge list of ai providers was loading into memory at startup even when not needed. now it only loads when actually used
• config file dedupe – the same config file was being read twice on startup for no reason. merged into one read

total: 701ms → 258ms. that's -63% startup time. hermes now wins 6–5. the scoreboard just flipped

nous research vs openai. the war has just started

Bar chart comparing Hermes and Codex across 11 CLI tasks: Hermes wins on mt_inspect_edit (9.08s vs 17.20s) and most multi-step operations

Some new improvements to performance just went in.

Python gets a bad wrap for performance but we aint looking to shabby against a trillion dollar co's rust codebase, beating codex at most multi-turn tasks we benchmarked (mt stands for multiturn)

PR: https://t.co/nNJz3EPAPA pic.twitter.com/6z7vXpLA9X
— Teknium 🪽 (@Teknium) May 25, 2026

hermes cut startup time 63% and just beat codex on speed

Read next

gpt 5.6 sol pro vs claude fable 5 vs grok 4.5 vs glm 5.2

grok 4.5 vs fable 5 vs gpt 5.5 vs glm 5.2

muse image vs gpt image 2 vs nano banana 2 vs reve 2.0

Stay in the loop

Read next

gpt 5.6 sol pro vs claude fable 5 vs grok 4.5 vs glm 5.2

grok 4.5 vs fable 5 vs gpt 5.5 vs glm 5.2

muse image vs gpt image 2 vs nano banana 2 vs reve 2.0