cd /news/natural-language-processing/lavisa-a-language-and-vision-structu… · home topics natural-language-processing article
[ARTICLE · art-33546] src=arxiv.org ↗ pub= topic=natural-language-processing verified=true sentiment=· neutral

LaViSA: A Language and Vision Structural Ambiguity Benchmark

Researchers introduced LaViSA, a benchmark to evaluate vision-language models' ability to resolve structural ambiguity using visual scenes. Tests on proprietary and open-source models showed they can partially leverage visual cues but struggle with certain ambiguity types and subtle semantic distinctions.

read1 min views1 publishedJun 19, 2026

arXiv:2606.19552v1 Announce Type: new Abstract: Structural ambiguity arises when a single sentence admits multiple valid interpretations due to its syntactic structure, posing a fundamental challenge for language understanding. Visual scenes serve as useful cues for resolving such ambiguity, and Vision and Language Models (VLMs) need to be capable of deriving possible semantic interpretations from visual scenes. We introduce Language and Vision Structural Ambiguity (LaViSA), a benchmark designed to evaluate the ability of VLMs to resolve structural ambiguity leveraging visual scenes. LaViSA consists of ambiguous sentences, their disambiguated sentences, and corresponding images of these disambiguated sentences across seven ambiguity categories. Using LaViSA, we conduct a comprehensive evaluation of diverse VLMs, including both proprietary and open-source models with varying parameter scales and reasoning capabilities. Experimental results show that although recent VLMs can leverage visual scenes to resolve structural ambiguity to a some extent, they still struggle with certain ambiguity types and visually subtle semantic distinctions, indicating remaining limitations in resolving structural ambiguity using visual scenes.

── more in #natural-language-processing 4 stories · sorted by recency
── more on @lavisa 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/lavisa-a-language-an…] indexed:0 read:1min 2026-06-19 ·