12:08
2026-05-01
arizenai.com
artificial-intelligence
Seven Models Judged Each Other. The Mistakes Were the Most Useful Part.
Seven AI models were each given the same prompt in blind OpenCode sessions and asked to rank all seven, including themselves, on fixed factors. The mistakes โ silent model substitutions, hallucinated โฆ