04:00
2026-05-29
arxiv.org
large-language-models
Review Arcade: On the Human Alignment and Gameability of LLM Reviews
A new study from the University of Hamburg found that LLM-generated peer reviews for scientific papers show only limited alignment with human evaluations, with alignment varying significantly across dโฆ