04:00
2026-05-25
arxiv.org
artificial-intelligence
MARGIN: Runtime Confidence Calibration for Multi-Agent Foundation Model Coordination
A new online calibration method called MARGIN corrects systematic mis-calibration in foundation model confidence scores, achieving 3-6x lower calibration error than existing design-time methods under โฆ