Microsoft announced two new text LLMs this morning - ** MAI-Thinking-1** (reasoning, 35B parameters, available to "select early partners") and
It's very interesting to see Microsoft releasing models with such low parameter counts, especially given how expensive larger models are to access right now. They claim MAI-Thinking-1 "is preferred to Sonnet 4.6 in our blind human side-by-side evaluations", which is impressive for a 35B model seeing as I frequently run models larger than that on my own laptop.
Also of note: We trained [MAI-Thinking-1] from the ground up on enterprise grade, clean and commercially licensed data, without distillation from third-party models.
And for MAI-Code-1-Flash as well: It is built end-to-end by Microsoft using clean and appropriately licensed data.
I would very much like to learn more about this "appropriately licensed" data! Could these be the first generally useful code-specialist models that didn't train on an unlicensed dump of the web?
Tags: llm-release, generative-ai, ai, microsoft, llms