tencent just open-sourced a translation model that fits in 440mb and beats models 500x its size
how? they compressed a 3.3gb model by storing weights in basically 1 bit instead of 16
most quantization stops at 4-bit. they pushed to 1.25-bit – and somehow didn't lose accuracy
runs fully offline on a normal android phone. no api calls. no data leaving the device. 33 languages. tibetan and mongolian included
if you're building:
• a travel or language learning app — drop in offline translation with no per-call cost
• a chat or email client — translate messages in the background across any app
• anything for low-connectivity markets — it works without internet, forever
• a privacy-first product — user text never touches a server
We're open-sourcing Hy-MT1.5-1.8B-1.25bit — a 440MB translation model that runs fully offline on your phone, supports 33 languages, and outperforms Google Translate.
— Tencent Hy (@TencentHunyuan) April 29, 2026
At 1.8B parameters, it matches commercial translation APIs and 235B-scale models on standard benchmarks. By… pic.twitter.com/4kgWldn4p1
this is crazy

Nick Trenkler