cd /news/computer-vision/corevad-a-contextual-reasoning-frame… · home topics computer-vision article
[ARTICLE · art-13595] src=arxiv.org pub= topic=computer-vision verified=true sentiment=↑ positive

CoReVAD: A Contextual Reasoning Framework for Training-Free Video Anomaly Detection

Researchers have developed CoReVAD, a training-free video anomaly detection framework that uses a single frozen Vision-Language Model to generate both anomaly scores and temporal descriptions without additional training or external language models. The framework introduces a Local Response Cleaning module to reduce noise in generative outputs and incorporates global temporal context through softmax-based refinement and position weighting. CoReVAD achieves competitive performance on UCF-Crime and XD-Violence benchmarks while providing interpretable explanations for detected anomalies.

read1 min publishedMay 25, 2026

arXiv:2605.23116v1 Announce Type: new Abstract: Existing Video Anomaly Detection (VAD) methods typically rely on task-specific training, leading to strong domain dependency and high training costs. Moreover, most existing methods output only scalar anomaly scores, providing limited insight into why specific events are considered abnormal. Recent advances in Vision-Language Models (VLMs) have enabled both anomaly detection and human-interpretable reasoning. However, many VLM-based approaches still require additional training steps (e.g., instruction tuning or verbalized learning) or external Large Language Models (LLMs), incurring further training costs and inference overhead. To address these challenges, we propose CoReVAD, a contextual reasoning framework for training-free video anomaly detection that operates with a single frozen VLM. CoReVAD directly generates anomaly scores and temporal descriptions from the VLM. To mitigate noise in generative outputs, we introduce a Local Response Cleaning (LRC) module based on local vision-text alignment. Furthermore, global temporal context and progression are incorporated through softmax-based refinement, Gaussian smoothing, and position weighting. Experiments on UCF-Crime and XD-Violence demonstrate that CoReVAD achieves competitive performance among training-free methods while providing reliable and interpretable explanations. Our official code is available at: https://github.com/Muk-00/CoReVAD

── more in #computer-vision 4 stories · sorted by recency
── more on @corevad 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/corevad-a-contextual…] indexed:0 read:1min 2026-05-25 ·