07:25
2026-07-01
machinebrief.com
ai-safety
MARS: Making Multimodal Models Safer Without Breaking a Sweat
Researchers introduced Modality-Agnostic Refusal Steering (MARS), a method that uses text-based refusal strategies to enhance safety in multimodal language models without requiring unsafe multimodal dโฆ