04:00
2026-06-18
arxiv.org
artificial-intelligence
VISUALSKILL: Multimodal Skills for Computer-Use Agents
Researchers introduced VISUALSKILL, a multimodal skill library for computer-use agents that combines text and visual figures to improve performance on long-horizon tasks. In tests, the system achievedβ¦