04:00
2026-05-29
arxiv.org
ai-safety
Benchmarking Open-Source Safety Guard Models: A Comprehensive Evaluation
A comprehensive evaluation of 14 open-source safety guard models on a benchmark of 79,331 samples found that Qwen Guard, a 4-billion-parameter model, achieved the highest recall at 83.97%, while largeβ¦