Original article excerpt
Server-side extracted preview paragraphs from the original source.
MAI released models that can transcribe voice into text as well as generate audio and images after the group's formation six months ago.
Microsoft AI, the tech giant’s research lab, announced the release of three foundational AI models on Thursday that can generate text, voice, and images.
The release signals Microsoft’s continued push to build out its own stack of multimodal AI models — and compete with rival AI labs — even though it remains tied to OpenAI.