08:35
2026-05-22
developers.googleblog.com
artificial-intelligence
MaxText Expands Post-Training Capabilities: Introducing SFT and RL on Single-Host TPUs
MaxText has introduced new post-training capabilities, specifically Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL), now available on single-host TPU configurations like v5p-8 and v6e-8. โฆ