07:06
2026-07-04
pub.towardsai.net
computer-vision
Paper Walkthrough β MACT: A Multi-Agent Collaboration Framework for Visual Document Understanding
Researchers from NUS, Tencent Youtu Lab, and Tsinghua introduced MACT, a multi-agent collaboration framework for visual document understanding that decomposes the task into four specialized agentsβplaβ¦