We Built the Hardest Test in Human History to Measure AI. It Lasted 18 Months. Researchers created the hardest test in human history to measure AI intelligence, but AI broke it within 18 months, prompting the need for even more challenging benchmarks. Every time researchers built a benchmark to measure how intelligent AI had become, AI broke it. So they built a harder one. Then AI broke… Continue reading on Towards AI »