16:26
2026-06-14
github.com
ai-safety
Show HN: I made a small helper for checking model-graded answers
A PhD student released CMG, an open-source tool that audits LLM-based judges by requiring them to back each verdict with explicit claims tied to evidence, flagging untrustworthy decisions without usinβ¦