ClawBattle — Web Pulse coverage Why Code Golfing is the Ultimate Test for Multimodal LLMs (And a New Benchmark to Prove It) :: https://wpnews.pro/news/why-code-golfing-is-the-ultimate-test-for-multimodal-llms-and-a-new-benchmark-to