microsoft MAI tech report is a gold mine, one of the most transparent for a model at this scale.
this model uses zero synthetic data or distillation from previous models. this means reasoning, agentic behavior, tool use are all learned fully during post-training with no cold start. bold choice that makes it harder and requires more iterations to reach sota, but you get FULL control over your model series and it proves they are serious about being a frontier lab.
the tech report is insanely detailed and precise about numbers. to give an example, they give the exact MFU across all the iterations of the model, with the exact changes etc. they also share the full scaling ladder recipe, to my knowledge this is the first time i've seen this in a tech report at this scale
let's look at all of this in this likely very long thread 🧵
Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier.
First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks.
- It’s a
My laptop has become a “satellite device” since I started using Codex from my phone. And my Mac mini has become the “home.” It’s clunky, but the end state feels more like how we’re going to be working in the near future:
I’m currently running the Codex app on 2 devices:
1. my MacBook
2. my Mac mini
My laptop isn’t reliably connected to Wi-Fi enough, so I keep a Mac mini on my desk that is always connected.
When I kick off new threads from my phone, I start them on the Mac mini. When I’m working from my desk, I run them there too.
The cool part is that I’ve added my MacBook and Mac mini as connected devices to each other. That means I can start and resume threads from either device. So if I’m in a meeting but want to continue a thread on my laptop that was started on my Mac mini, I can do that.
I’ve also set up mutual SSH for Mac mini <> MacBook, so files are easy to access from either side. It’s not fully seamless yet, but the model works.
What this means:
- I have an always-on Codex that is accessible from my phone, with its own dev environment
- All threads are always accessible from any of the 3 devices
- I can run heartbeat threads that stay on 24/7
It’s a little makeshift today, but the shape of it feels very real to me: Codex is no longer tied to whichever computer happens to be open in front of me. It starts to feel like something I can stay connected to across whatever device I’m using.
7K Followers 2K FollowingLeading research at @arcee_ai | Formerly Data Research Lead @DbrxMosaicAI | Visiting Researcher @meta | Ph.D | #TXSTFOOTBALL fan | https://t.co/4G6Jf3b0V4
15K Followers 1K FollowingTraining VLMs at @GoogleDeepMind - ex @Amazon @CarnegieMellon @PoliTOnews @IITalk - I post honest non-AI-generated paper reviews
33K Followers 1 FollowingGet AI-powered stock analysis, DCF calculators, earnings calendars, custom KPIs, portfolio insights, and 30+ years of financials - all in beautiful charts.
533K Followers 2K FollowingPolyagentmorous ClawFather. Came back from retirement to mess with AI and help a lobster take over the world.
@OpenClaw🦞 + @OpenAI
21K Followers 727 FollowingML Engineer @huggingface, building https://t.co/nRLLeUdzP9. @KU_Leuven grad. General interest in machine & deep learning. Making AI more accessible for everyone!
804K Followers 323 FollowingTogether with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
442K Followers 6K FollowingChief Scientist, Google DeepMind & Google Research. Gemini Lead. Opinions stated here are my own, not those of Google. TensorFlow, MapReduce, Bigtable, ...
83K Followers 1K FollowingAgents & Gemini API, MTS @GoogleDeepMind | prev: Tech Lead at @huggingface, AWS ML Hero 🤗 Sharing my own views and AI News 🧑🏻💻 https://t.co/7IosdlO6RA