TycoonLE: A Jax reinforcement learning environment for long-horizon planning
Researchers released TycoonLE, a JAX-based reinforcement learning environment for long-horizon planning in a simulated logistics economy. The environment supports action legality, delayed rewards, and…