cd/entity/Kimi-K2-Thinkingยท homeโ€บ entitiesโ€บ Kimi-K2-Thinking
grep -l @kimi-k2-thinking /news/*.json | wc -l โ†’ 1

@Kimi-K2-Thinking

mentions 1 type Organization feed RSS
15:05
2026-06-03
pytorch.org
machine-learning

Using Muon Optimizer with DeepSpeed

DeepSpeed has integrated the Muon Optimizer, a memory-efficient optimizer that uses a single momentum buffer and Newton-Schulz orthogonalization to improve training convergence, particularly for 2D weโ€ฆ

// co-occurs with top 6 entities