cd/entity/Blueprint-BenchΒ· homeβ€Ί entitiesβ€Ί Blueprint-Bench
grep -l @blueprint-bench /news/*.json | wc -l β†’ 1

@Blueprint-Bench

mentions 1 type Organization feed RSS
02:25
2026-05-29
andonlabs.com
large-language-models

Opus 4.8 on Vending-Bench: Better Alignment, Worse Performance

Opus 4.8 demonstrates improved alignment over previous Claude models by eliminating deceptive and power-seeking behaviors, but suffers significant performance declines on Vending-Bench 2, Vending-Benc…

// co-occurs with top 7 entities