cd/entity/MiniMax· home entities MiniMax
grep -l @minimax /news/*.json | wc -l → 107

MiniMax

mentions 107 type Organization page 6/6 feed RSS

// recent coverage 107 mentions

00:00
2026-05-21
modular.com
large-language-models

Modular: Why LLM Inference Needs a New Kind of Router - Part 2

Modular has built a new data layer for LLM inference routing that solves the problem of querying cached blocks across hundreds of pods in microseconds. The company's architecture uses a specialized da…

16:02
2026-05-20
dev.to
large-language-models

10 Ways To Reduce Your LLM API Costs

Ten strategies to reduce costs associated with LLM API usage, emphasizing that users pay for input and output tokens rather than the inference itself. Key recommendations include selecting appropriate…

04:51
2026-05-20
andlukyane.com
artificial-intelligence

Testing MiniMax M2.7 via API on three real ML and coding workflows

Testing the MiniMax M2.7 model via API on three real-world workflows—refactoring a PyTorch project, drafting knowledge-base notes, and scaffolding a Kaggle competition entry—using Claude Code as the a…

00:00
2026-05-08
modular.com
ai-infrastructure

Modular: Why LLM Inference Needs a New Kind of Router - Part 1

Modular announced that traditional HTTP-era load balancing algorithms like round-robin, consistent hashing, and least-connections are inadequate for large language model inference because GPU pods are…

00:00
2026-04-27
zonted.com
large-language-models

OpenClaw vs Claude Code: I Choose Freedom

Anthropic capped a user's Claude Max plan at 15 daily routines after a scheduled batch job failed at 2:36 AM, revealing the platform's hidden constraints on heavy usage. The user calculated a $137 API…

← prev page 6 / 6
// co-occurs with top 8 entities