00:00
2026-05-11
machinelearning.apple.com
artificial-intelligence
BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning
Researchers at BalCapRL introduced a balanced reinforcement learning framework for multimodal large language model image captioning that jointly optimizes utility-aware correctness, reference coverageβ¦