RL

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

08:35

2026-05-22

developers.googleblog.com

artificial-intelligence

MaxText Expands Post-Training Capabilities: Introducing SFT and RL on Single-Host TPUs

MaxText has introduced new post-training capabilities, specifically Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL), now available on single-host TPU configurations like v5p-8 and v6e-8. …

// co-occurs with top 7 entities

MaxText 1 TPU 1 JAX 1 Tunix 1 Hugging Face 1 vLLM 1 SFT 1

// topics top 5 topics

artificial intelligence 1 machine learning 1 large language models 1 developer tools 1 cloud computing 1