ls /news · home news
grep -r --recent /news | head -20

News

17348 articles page 431 of 868 0 sources 30 min sync cycle
12:29
2026-05-29
alphaxiv.org
large-language-models · 1m read · neu

How to Stress-Test LLM Judges Fairly

Researchers have introduced a fixed-budget, cluster-aware standard for evaluating LLM-as-a-Judge systems, using a multi-hop retrieval-augmented generation (RAG) stress test to measure fairness. The new method aims to add…

12:25
2026-05-29
theverge.com
autonomous-vehicles · 2m read · neu

Jony Ive’s funky Ferrari

Ferrari unveiled its first electric vehicle, the Luce, featuring design and technology contributions from Sir Jony Ive. The car's unconventional styling and departure from Ferrari's legacy have sparked significant contro…

12:15
2026-05-29
strangeloopcanon.com
artificial-intelligence · 4m read · neu

BenchBench

A new benchmark called BenchBench tests AI models on their ability to create benchmarks for other models, revealing that only GPT 5.2 succeeded in generating a practically solvable yet challenging evaluation. Other leadi…

12:08
2026-05-29
news.ycombinator.com
artificial-intelligence · 1m read · neu

Ask HN: I hate you. Don't leave me (the AI edition)

A Hacker News user asked the community how often they receive emails and messages written entirely by AI, noting hollow contrast phrases like "This isn't about X, it's about Y." The user also questioned how often people …

← prev page 431 / 868 next →
LIVE [news] indexed:17348 page:431/868 en · ua 2026-05-20 ·