arXiv:2605.27385v1 Announce Type: new Abstract: Federated reinforcement learning (FedRL) enables multiple agents to collaboratively train a global policy without sharing raw data, making it ideal for privacy-sensitive applications. However, FedRL faces challenges in heterogeneous environments where differing state-transition dynamics lead to non-identical input distributions and imbalanced parameter updates during aggregation. Therefore, this paper develops a personalized observation normalization (PON) method, allowing each agent to locally normalize raw state inputs using a continuously updated running mean and variance. This design ensures consistent scaling of local feature without overshadowing across agents during aggregation. Furthermore, we demonstrate that sharing normalization parameters across agents is ineffective due to the diverse local input distributions, which highlights the necessity of personalized statistics. Experiments on heterogeneous MuJoCo tasks show that our developed PON accelerates training and achieves superior performance compared to baseline methods.
Personalized Observation Normalization for Federated Reinforcement Learning in Simulation Environments with Heterogeneity
Researchers developed a personalized observation normalization (PON) method to address heterogeneity in federated reinforcement learning, where differing state-transition dynamics cause non-identical input distributions and imbalanced parameter updates. The approach allows each agent to locally normalize raw state inputs using a continuously updated running mean and variance, ensuring consistent scaling without overshadowing during aggregation. Experiments on heterogeneous MuJoCo tasks demonstrated that PON accelerates training and achieves superior performance compared to baseline methods.
Run your AI side-project on zahid.host
EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.