cd /news/computer-vision/language-instructed-vision-embedding… · home topics computer-vision article
[ARTICLE · art-33503] src=arxiv.org ↗ pub= topic=computer-vision verified=true sentiment=↑ positive

Language-Instructed Vision Embeddings for Controllable and Generalizable Perception

Researchers introduced Language-Instructed Vision Embeddings (LIVE), a method that uses language to dynamically guide vision encoders, eliminating the need for task-specific retraining. LIVE reduced visual hallucinations by 34 points on MMVP and outperformed larger vision-language models on visual question answering, demonstrating controllable and generalizable perception.

read1 min views4 publishedJun 19, 2026

arXiv:2606.19584v1 Announce Type: new Abstract: Vision foundation models are typically trained as static feature extractors, placing the burden of task adaptation onto large downstream models. We propose an alternative paradigm: instead of solely feeding visual features into language models, we use language itself to dynamically guide the vision encoder. Our method, Language-Instructed Vision Embeddings (LIVE), leverages language as high-level guidance to produce task-centric embeddings at inference time, removing the need for task-specific retraining. This enables the encoder to focus on contextually relevant aspects of the input, yielding more controllable and generalizable representations. Empirically, LIVE reduces visual hallucinations (+34 points on MMVP), surpasses vision-language models with orders of magnitude more parameters on visual question answering, and generalizes to unseen instructions and tasks -- offering a direct path toward adaptive, instruction-driven visual intelligence.

── more in #computer-vision 4 stories · sorted by recency
── more on @live 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/language-instructed-…] indexed:0 read:1min 2026-06-19 ·