16:44
2026-06-11
gist.github.com
ai-safety
regime-bench
A developer created regime-bench, a benchmark to detect object-level misalignment in AI platforms by testing scenarios where a user requests help that would go against current US regime politics. The โฆ