Microsoft's new MAI models
· Source: Simon Willison
Microsoft has announced the release of two new language models, MAI-Thinking-1 and MAI-Code-1-Flash, which are part of its family of artificial intelligence (AI) models called MAI. The MAI-Thinking-1 model has 35 billion parameters and focuses on reasoning capabilities, while the MAI-Code-1-Flash model has 5 billion parameters and is designed specifically for use in GitHub Copilot and VS Code, aiming to provide high performance and lower costs. Notably, these models have a relatively low number of parameters compared to current models, which could make them more accessible and cost-effective.
It is worth highlighting that Microsoft claims the MAI-Thinking-1 model was trained from scratch using high-quality commercial data, without relying on third-party model distillation techniques. Similarly, the MAI-Code-1-Flash model was built by Microsoft using clean and properly licensed data. This raises the question of whether these could be the first specialized code models not trained on unlicensed internet data.
This news is significant as it shows progress in the development of more efficient and accessible AI models, which could have a substantial impact on how these technologies are used and applied in various industries and applications. Additionally, the way data is trained and licensed for these models may have important implications for the ethics and legality of AI use.
Read the original article on Simon Willison
This summary is an informational synthesis produced by dataqbs.com. All rights to the original content belong to its author and the cited media outlet. We act solely as curators of technology news and claim no authorship.