cd/sources/together-ai· home sources Together AI
cat /sources/together-ai.feed | wc -l → 14

Together AI

articles 14 domain together.ai → feed RSS
00:00
2026-05-29
together.ai
artificial-intelligence

How Together AI built the world’s fastest speech-to-text stack

Together AI built the world’s fastest speech-to-text stack, enabling NVIDIA’s Parakeet-TDT 0.6B v3 model to transcribe roughly 20 hours of speech in under 10 seconds. The company achieved this by opti…

00:00
2026-05-19
together.ai
ai-infrastructure

Benchmarking inference at scale: coding agents

Together Inference Engine delivered 31% more tokens per second than the next fastest open-source inference engine on the same hardware during a production coding agent workload, while maintaining twic…

00:00
2026-05-08
together.ai
ai-agents

Deploy and inference any model from HuggingFace

Netflix released void-model on Hugging Face, and a developer used the Goose CLI agent with Together's dedicated containers skill to deploy the model for inference on release day with a single prompt. …

00:00
2026-05-04
together.ai
ai-infrastructure

Foundational research powering efficient inference at scale

NVIDIA CEO Jensen Huang declared at GTC 2026 that agentic AI systems are shifting infrastructure priorities from training to inference, as inference now accounts for 80-90% of the total lifetime cost …

00:00
2026-04-30
together.ai
artificial-intelligence

Announcing Together AI and Adaption Partnership

Together AI has partnered with Adaption to integrate Together Fine-Tuning into Adaption's Adaptive Data platform, enabling users to optimize training datasets and directly execute fine-tuning on leadi…

00:00
2026-04-29
together.ai
artificial-intelligence

DeepSeek-V4 Pro now available on Together AI

Together AI has launched DeepSeek-V4 Pro, a 1.6T-parameter Mixture-of-Experts model with a 512K-token context window, priced at $2.10 per million input tokens and $4.40 per million output tokens. The …