07:00
2026-06-02
cloud.google.com
artificial-intelligence
Experimenting with TPUs, GKE Managed DRANET, and Multi-cluster Inference Gateway
Google engineers deployed a Gemma 3 large language model across two GKE clusters in separate regions, using four TPU v6e chips per cluster, to test multi-cluster failover with the Inference Gateway. Tโฆ