cd/entity/Terminal-WrenchΒ· homeβ€Ί entitiesβ€Ί Terminal-Wrench
grep -l @terminal-wrench /news/*.json | wc -l β†’ 1

Terminal-Wrench

mentions 1 type Organization feed RSS
19:16
2026-06-11
arxiv.org
machine-learning

Cheap Reward Hacking Detection

Researchers trained a small transformer encoder to detect reward hacking in reinforcement learning trajectories by mapping them onto a unit sphere where embedding distance approximates reward-metadata…

// co-occurs with top 1 entities