22:36
2026-06-29
gist.github.com
large-language-models
Opus vs GLM-5.2 in a coding-agent pipeline β paired-run findings
A controlled A/B test comparing Claude Opus and GLM-5.2 in a coding-agent pipeline revealed qualitative differences in engineering behavior. Using the same paper-implementation pipeline across 10 repoβ¦