Testing GLM-5.2 on a Real Rails Upgrade | Mario Alberto Chávez
28-Jun-2026 7
Every few weeks, a new open-source model appears with claims that it matches the frontier. I’ve watched this pattern long enough to be skeptical — the charts are fine for YouTube content, but they don’t tell you how a model behaves on a codebase that hasn’t been touched in years, with deprecated gems, a Node-dependent asset pipeline, and data that needs to move between databases.
When GLM-5.2 from Z.ai started circulating on X last week, the comparisons were to Claude Opus 4.8 — specifically on long-horizon coding benchmarks like Terminal-Bench 2.1, where GLM-5.2 scores 81.0 against Opus 4.8’s 85.0, and FrontierSWE, where it trails by about one point. For a fully open-weight model under an MIT license, that’s a meaningful result. Whether it translates to real work is a different question.
Testing GLM-5.2 on a Real Rails Upgrade | Mario Alberto Chávez #ruby #rubydeveloper #rubyonrails #Testing #GLM-5.2 #Rails #Upgrade #Mario #Alberto #Chávez #testing #upgrade https://rubyonrails.ba/link/testing-glm-5-2-on-a-real-rails-upgrade-mario-alberto-chavez