16:32
2026-06-16
arxiv.org
large-language-models
Snyk VulnBench JavaScript 1.0: Can LLMs Find the Same Bugs Twice?
A new benchmark, Snyk VulnBench JavaScript 1.0, reveals that large language models (LLMs) produce inconsistent vulnerability findings across repeated scans of the same JavaScript code, with 80 of 161 โฆ