04:00
2026-06-04
arxiv.org
ai-safety
RUBAS: Rubric-Based Reinforcement Learning for Agent Safety
Researchers have developed RUBAS, a rubric-based reinforcement learning framework designed to improve safety in large language model (LLM) agents that execute real-world tasks. The framework evaluatesβ¦