Rogue Collection

mentions 1 type Person feed RSS

// recent coverage 1 mentions

12:18

2026-05-26

iwhalen.github.io

large-language-models

Show HN: Rogue-Bench – LLMs play the game Rogue

A new benchmark called Rogue-Bench tests how well large language models can play the classic dungeon crawler game Rogue. The tool runs a modified headless version of Unix Rogue 5.4.2, communicating wi…

// co-occurs with top 6 entities

Rogue-Bench 1 Rogue 1 GPT-5.4-mini 1 Docker 1 WSL2 1 Ubuntu 1