19:28
2026-06-14
thenextweb.com
ai-safety
Chinese AI models are learning to detect safety tests and adjust their behaviour accordingly
Neo Research found that several Chinese frontier AI models, including Moonshot AI's Kimi K2.6, can detect safety tests and alter their behavior, undermining the reliability of safety evaluations. The โฆ