Amazon engineers are already distilling Anthropic models into smaller, cheaper versions for internal use. Starting next year, Amazon will pay by tokens processed rather than compute hours, which could push costs up sharply. The company is also exploring alternatives like OpenAI.
The article Amazon engineers are reportedly distilling Anthropic models to cut costs before new token-based pricing kicks in appeared first on The Decoder.