Personalized Observation Normalization for Federated Reinforcement Learning in Simulation Environments with Heterogeneity

wpnews.pro

cd /news/machine-learning/personalized-observation-normalizati… · home › topics › machine-learning › article

[ARTICLE · art-16023] src=arxiv.org ↗ pub=2026-05-28T04:00Z topic=machine-learning verified=true sentiment=↑ positive

Personalized Observation Normalization for Federated Reinforcement Learning in Simulation Environments with Heterogeneity

Researchers developed a personalized observation normalization (PON) method to address heterogeneity in federated reinforcement learning, where differing state-transition dynamics cause non-identical input distributions and imbalanced parameter updates. The approach allows each agent to locally normalize raw state inputs using a continuously updated running mean and variance, ensuring consistent scaling without overshadowing during aggregation. Experiments on heterogeneous MuJoCo tasks demonstrated that PON accelerates training and achieves superior performance compared to baseline methods.

read1 min views8 publishedMay 28, 2026

arXiv:2605.27385v1 Announce Type: new Abstract: Federated reinforcement learning (FedRL) enables multiple agents to collaboratively train a global policy without sharing raw data, making it ideal for privacy-sensitive applications. However, FedRL faces challenges in heterogeneous environments where differing state-transition dynamics lead to non-identical input distributions and imbalanced parameter updates during aggregation. Therefore, this paper develops a personalized observation normalization (PON) method, allowing each agent to locally normalize raw state inputs using a continuously updated running mean and variance. This design ensures consistent scaling of local feature without overshadowing across agents during aggregation. Furthermore, we demonstrate that sharing normalization parameters across agents is ineffective due to the diverse local input distributions, which highlights the necessity of personalized statistics. Experiments on heterogeneous MuJoCo tasks show that our developed PON accelerates training and achieves superior performance compared to baseline methods.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/personalized-observation…

Read original on arxiv.org → arxiv.org/abs/2605.27385

mentioned entities

MuJoCo

metadata

slugpersonalized-observation-normalization-for-federated-reinforcement-learning-in

topic#machine-learning

secondary3 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevOpen House 2026 Day 1: real-time…

next →New poll points to possible Bece…

── more in #machine-learning 4 stories · sorted by recency

byteiota.com · 11 Jul · #machine-learning

Unitree’s $619M IPO Funds the Robot App Store Era: What Developers Need to Know

arxiv.org · 9 Jul · #machine-learning

SPEAR: A Simulator for Photorealistic Embodied AI Research

marktechpost.com · 4 Jul · #machine-learning

NVIDIA AI Introduces ASPIRE: A Self-Improving Robotics Framework Reaching 31% Zero-Shot on LIBERO-Pro Long Tasks

luckyrobots.com · 12 Jun · #machine-learning

The first game engine for robotics

── more on @mujoco 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 8 Jul · #artificial-intelligence

SpaceXAI unveils Grok 4.5 AI model ahead of July 2026 public release

wpnews · 8 Jul · #artificial-intelligence

xAI Launches Grok 4.5 With Pricing Built to Undercut Anthropic's Opus 4.8

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required