TycoonLE: A Jax reinforcement learning environment for long-horizon planning

wpnews.pro

cd /news/artificial-intelligence/tycoonle-a-jax-reinforcement-learnin… · home › topics › artificial-intelligence › article

[ARTICLE · art-25830] src=github.com ↗ pub=2026-06-13T02:02Z topic=artificial-intelligence verified=true sentiment=· neutral

TycoonLE: A Jax reinforcement learning environment for long-horizon planning

Researchers released TycoonLE, a JAX-based reinforcement learning environment for long-horizon planning in a simulated logistics economy. The environment supports action legality, delayed rewards, and replayable audit traces, with a companion benchmark report at TycoonBench. It is designed to study agent planning and decision-making under economic constraints.

read1 min views19 publishedJun 13, 2026

Tycoon Learning Environment (TycoonLE) is a reinforcement learning environment for economically grounded, long-horizon planning. Agents operate in a simulated logistics economy where they allocate capital, build transport routes, move cargo, manage debt, and optimize delayed returns.

It is designed to study action legality, candidate-frontier decision interfaces, financing timing, delayed rewards, procedural variation, and replayable audit traces.

TycoonLE uses a fixed-shape interface. Agents choose among valid route, finance, and wait candidates, making rollouts compatible with JAX transformations such as jit

, vmap

, and scan

The replay UI makes policies inspectable through route choices, cargo flow, financing behavior, reward, score, and profit over time.

TycoonBench provides a companion benchmark report for comparing agent and model performance on TycoonLE planning tasks: vrtnis.github.io/tycoonbench.

Use Python 3.11 or 3.12:

py -3.12 -m venv .venv
.\.venv\Scripts\python.exe -m pip install -e ".[test]"
npm install
python
import jax
from tycoonle_jax import TycoonLE

env = TycoonLE(split="dev", family="chain")
state, timestep = env.reset(jax.random.PRNGKey(0))
action = timestep.observation.action_mask.argmax()
state, timestep = env.step(state, action)

Export a replay:

.\.venv\Scripts\python.exe examples\quickstart.py
npm run dev

Open the browser UI and load runs/quickstart/replay.json

Run tests:

.\.venv\Scripts\python.exe -m pytest
npm run build

Run a small PPO smoke train:

.\.venv\Scripts\python.exe examples\train_ppo_jax.py --updates 1 --num-envs 4 --rollout-length 4 --update-epochs 1 --hidden-sizes 32

If you find this work useful, consider citing:

@software{tycoonle,
  title = {TycoonLE},
  author = {TycoonLE contributors},
  year = {2026},
  url = {https://github.com/vrtnis/tycoon-learning-environment}
}

TycoonLE uses sprite artwork from OpenGFX, an open-source graphics base set for OpenTTD.

source & further reading

github.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/tycoonle-a-jax-reinforce…

Read original on github.com → github.com/vrtnis/tycoon-learning-environment

mentioned entities

TycoonLE

JAX

TycoonBench

OpenGFX

OpenTTD

metadata

slugtycoonle-a-jax-reinforcement-learning-environment-for-long-horizon-planning

topic#artificial-intelligence

secondary4 topics

sentimentneutral

canonicalgithub.com

navigation

← prevHatchkeep Property Management So…

next →Samsung may make the next Google…

── more in #artificial-intelligence 4 stories · sorted by recency

insideai.news · 28 Jul · #artificial-intelligence

Coding Agents Modernize Scientific Software, OpenAI Field Report Shows

siliconangle.com · 28 Jul · #artificial-intelligence

Biggest ever MCP update brings metadata, cybersecurity enhancements

schedpilot.com · 28 Jul · #artificial-intelligence

Ayrshare Alternative: The Same Social API From $21/Month

arize.com · 28 Jul · #artificial-intelligence

AI agent evaluation: Tips from Anthropic on building evals you can trust

── more on @tycoonle 3 stories trending now

wpnews · 26 Jul · #artificial-intelligence

Nobel laureate Simon Johnson on the AI race and China’s ‘over-automation’ problem

wpnews · 26 Jul · #artificial-intelligence

China’s Moonshot, Z.AI, and DeepSeek are challenging U.S. AI labs—and beating them on cost

wpnews · 26 Jul · #ai-safety

University of Washington study reveals prompt injection risks lurking in AI agent memory

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required