maybe: shivam @kaffeinated
ml research @spotify, former matrix multiplier @twitter @x cortex, nyu. techno-optimist. probably on a bike 🚲☕. London Joined October 2012-
Tweets3K
-
Followers455
-
Following3K
-
Likes18K
Awesome to see this innovation in text diffusion. DiffusionGemma is lightning fast, 4x faster than other Gemma 4 models! Congrats to @bodonoghue85 and the team who worked so hard on this - excited to see what people build with it!
Meet DiffusionGemma! An experimental open model that explores a fast approach to text generation, released under an Apache 2.0 license. Moving beyond sequential, token-by-token processes to generate entire blocks of text simultaneously. Here’s what’s new with DiffusionGemma: 👇
Quiet quitting but for AI research
BREAKING NEWS: Anthropic's latest model will NOT help you if it thinks your ML research/ML engineering is interesting, and/or will secretly degrade its IQ so that the average engineer won't notice. We are already seeing Anthropic's latest model's moderation filters our GPU
@TheStalwart @citrini It's wild that this is possible now (or will be soon), I hope it doesn't come to pass but we live in a world that makes Black Mirror feel tame now
HTML is the new markdown. I've stopped writing markdown files for almost everything and switched to using Claude Code to generate HTML for me. This is why.
Neural networks might speak English, but they think in shapes. Understanding their rich *neural geometry* is key to understanding how they work – and to debugging and controlling them with precision. Starting today, we’re releasing a series of posts on this research agenda. 🧵
New Anthropic research: Natural Language Autoencoders. Models like Claude talk in words but think in numbers. The numbers—called activations—encode Claude’s thoughts, but not in a language we can read. Here, we train Claude to translate its activations into human-readable text.
this is how you look when you have multiple agents running btw
idk how else to say this, but... build your dream projects now. I feel like all the tools are giving away a LOT for free/cheap now. It's only gotten more pricey over time, and will keep getting more expensive. Your ideas are subsidized now, think of it as a fire sale and build!
First, to get you started, we've created 23 tutorials to walk you from the API basics to advanced training techniques and deploying models into production. tinker-docs.thinkingmachines.ai/tutorials/
We are entering the ~5 month period each year where there is no better city to be in than London. It's good to be back!
We see our home planet as a whole, lit up in spectacular blues and browns. A green aurora even lights up the atmosphere. That's us, together, watching as our astronauts make their journey to the Moon.
@giffmana "Have you tried turning the outer learning rate off and on again?"
if you can imagine it, you can build it
dude computers are actually so fucking insane when you really think about it. we literally figured out how to write some fake-ass rules called code and somehow convinced rocks to follow them. like actual rocks. sand, melted, purified, carved into tiny pathways where electricity just flows in patterns. that’s it. that’s the whole magic. and yet from that we get operating systems, compilers, kernels, networks, distributed systems, machine learning models, entire virtual worlds running inside other virtual worlds. billions of tiny electrical decisions per second, all because we defined some abstract logic. humans basically invented a language of instructions and taught matter itself to execute it.
The time to learn how to think for yourself was before genAI, if you missed your chance, good luck
We are in a new phase since December, everything is moving faster. You can feel it in the most recent releases, you can feel it from the energy of people in the industry. I know some people still think this is hype, but this is real. Everything in this world is about to change.
Introducing The Anthropic Institute, a new effort to advance the public conversation about powerful AI. anthropic.com/news/the-anthr…
Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model. It found ~20 changes that improved the validation loss. I tested these changes yesterday and all of them were additive and transferred to larger (depth=24) models. Stacking up all of these changes, today I measured that the leaderboard's "Time to GPT-2" drops from 2.02 hours to 1.80 hours (~11% improvement), this will be the new leaderboard entry. So yes, these are real improvements and they make an actual difference. I am mildly surprised that my very first naive attempt already worked this well on top of what I thought was already a fairly manually well-tuned project. This is a first for me because I am very used to doing the iterative optimization of neural network training manually. You come up with ideas, you implement them, you check if they work (better validation loss), you come up with new ideas based on that, you read some papers for inspiration, etc etc. This is the bread and butter of what I do daily for 2 decades. Seeing the agent do this entire workflow end-to-end and all by itself as it worked through approx. 700 changes autonomously is wild. It really looked at the sequence of results of experiments and used that to plan the next ones. It's not novel, ground-breaking "research" (yet), but all the adjustments are "real", I didn't find them manually previously, and they stack up and actually improved nanochat. Among the bigger things e.g.: - It noticed an oversight that my parameterless QKnorm didn't have a scaler multiplier attached, so my attention was too diffuse. The agent found multipliers to sharpen it, pointing to future work. - It found that the Value Embeddings really like regularization and I wasn't applying any (oops). - It found that my banded attention was too conservative (i forgot to tune it). - It found that AdamW betas were all messed up. - It tuned the weight decay schedule. - It tuned the network initialization. This is on top of all the tuning I've already done over a good amount of time. The exact commit is here, from this "round 1" of autoresearch. I am going to kick off "round 2", and in parallel I am looking at how multiple agents can collaborate to unlock parallelism. github.com/karpathy/nanoc… All LLM frontier labs will do this. It's the final boss battle. It's a lot more complex at scale of course - you don't just have a single train. py file to tune. But doing it is "just engineering" and it's going to work. You spin up a swarm of agents, you have them collaborate to tune smaller models, you promote the most promising ideas to increasingly larger scales, and humans (optionally) contribute on the edges. And more generally, *any* metric you care about that is reasonably efficient to evaluate (or that has more efficient proxy metrics such as training a smaller network) can be autoresearched by an agent swarm. It's worth thinking about whether your problem falls into this bucket too.
I've been working on this for weeks, and he just Tweeted it...out
The next step for autoresearch is that it has to be asynchronously massively collaborative for agents (think: SETI@home style). The goal is not to emulate a single PhD student, it's to emulate a research community of them. Current code synchronously grows a single thread of
Luca (also on the oth... @__lucab
2K Followers 2K Following former @NIST AI Fellow and @UCBerkeley Tech Policy Co-founder of Twitter's Machine Learning Ethics team (RIP) Opinionated about 🇮🇹 food Tweet with Typos
Suvash Sedhain @suvsh
1K Followers 1K Following Core Ranking/Retrieval Lead @Tubi. ML Ph.D. from @anucecs. ex @twitter @adobe & @kobo. Blog: https://t.co/8K8HAxRycM
beepee @reachbp
338 Followers 343 Following Love building products | Recsys | Into Machine learning and AI | Often gives unsolicited opinions | ExMeta
Kiwami @kiwami
2K Followers 383 Following Life's simple... Family, Friends, Dance & Dancehall. Strangly serial tech entrepreneur & event producer: My hobby is my job... #blessed https://t.co/Co4ReXKWXB
lata @LataPersson
2K Followers 1K Following investing+research @fabric_vc | cooking https://t.co/iZrrFYtR0N
Rahul Yadav @RahulYadav80727
3 Followers 81 Following
kelsey @KelseyPenners
10 Followers 202 Following
cammelia dreams @TheAngelette
101 Followers 882 Following warm heart, cold brew ☕ always follow back
Jyoti Mann @jyoti_mann1
4K Followers 4K Following monitoring situations at meta for @theinformation send me secrets on signal: jyotimann.11
Rahul Sawhney @ra37194
0 Followers 541 Following
q_b @q_bbbb
1 Followers 326 Following
khoiracle @khoiracle
974 Followers 328 Following building https://t.co/PIpXy21tXG and https://t.co/iePX22hGDs
Nadeem @8W7O7
40 Followers 425 Following Specifically here for the applied LLM research and design community. Post only for myself. Ex-@Meta @Uber @Wayfair
Sander Dieleman @sedielem
68K Followers 2K Following Research Scientist at Google DeepMind (WaveNet, Imagen, Veo). I tweet about deep learning (research + software), music, generative models (personal account).
Amine @fromamine
99 Followers 454 Following Building AI applications at @GoogleDeepmind, ex-@scale_ai. CS PhD graduate. I believe in: Start slow, sustain the effort, finish strong.
Kaxil Naik @kaxil
2K Followers 2K Following Orchestrating the next era of AI agents ⚙️ | @ApacheAirflow Committer & PMC member | Sr Eng Director @ @astronomerio | Open-source advocate
Pablo Fernandez 🧉 @pupeno
4K Followers 253 Following Head of Engineering for Pexels at Canva. Ex Google. Writes about AI, coding, management, tech, business, and geeky stuff.
Sean Cantrell @ThePremiseOfIt
512 Followers 351 Following From particle theory to AI. First principles, math, and hot takes. Founder & CEO PremiseAI.
Andrew Curran @AndrewCurran_
58K Followers 18K Following 🏰 - I write about AI, mostly. Expect some strange sights.
John Peter Ryan @JohnPeterRyan1
0 Followers 1K Following
Trey Dickens @dickens45615
159 Followers 5K Following
kori @korigero
118 Followers 452 Following research @cursive_ai | prev: sent a human to space; Oxford
LaurenLawson @jyE0sg08S2QQlH
14 Followers 1K Following
Henrietta @sYLtvCn5XBH53T
24 Followers 827 Following
Satnam Singh @satnam6502
22K Followers 3K Following Punjabi-Scottish-American working at @HarmonicMath. Cook, cyclist, Lost In Music. ∃🇮🇳 ∧ ∀🇬🇧 ∧ ∃🇪🇺 ∧ ∀🇺🇸 #celiac ex-{Microsoft, Google, Facebook, Xilinx}
Nolleen 🤍 @carnagefashion
3K Followers 3K Following Learn to master your mindset, one day at a time.
Aniruddh @aniruddh_gdn
94 Followers 3K Following Product Manager, Serial Entrepreneur, Ex-Software Engineer, MS at Lund University.
HFT_Insider🇺🇸 @Erproumxuiv790
37 Followers 2K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis
Jain Deanglo @DeangloJai93340
13 Followers 122 Following Varlıklarınızı hızla ikiye katlayın! 5 ücretsiz hisse senedi, haftada 15.000 dolar kazanın. 100 kişiyle sınırlı, WhatsApp’ı ekleyin! Şu anda 7.682 katılımcı!
Anna @Qiacbal1711552
11 Followers 519 Following
Toslough @TosloughzTdCB_
42 Followers 1K Following
segmenta @segmenta
857 Followers 1K Following Rowing Rowboat (YC S24). Prev: Co-founder/ CTO Agara AI (acq. Coinbase), Coinbase AI, Twitter AI
mary @mary175626462
14 Followers 323 Following
Sloyfres @SloyfresVX8
17 Followers 991 Following
Tarestewn @TarestewnQ8g_z
44 Followers 1K Following
McSooshe @McSooshezAP
138 Followers 3K Following
Nazia @TeaurtairsDKQx
178 Followers 4K Following
Jatin Nandwani @authorjatin
48 Followers 874 Following Generative AI | Open Source | Gym | Parenting | Learning and Building in Public
SJ @_Shubham_Jha
250 Followers 5K Following I'm an Engineer, so to save time let's just assume I'm always right
Bharat Raghunathan @BharatR123
140 Followers 3K Following Exploring opportunities in Software Engineering (at the intersection of Machine Learning, LLMs and GenAI) and NLP!
us_Genesis_ @UsGenesis38286
55 Followers 5K Following
Sinadisleys @sinadisley90539
69 Followers 5K Following
Gergely Orosz @GergelyOrosz
339K Followers 3K Following Writing @Pragmatic_Eng, the #1 software engineering newsletter on Substack. Author of @EngGuidebook. Formerly Uber & Skype.
Jane Manchun Wong @wongmjane
180K Followers 3K Following “The woman scooping Silicon Valley” — BBC・hacker turned builder, writer & consultant・prev: Threads, Instagram, startups
Yann LeCun @ylecun
1.2M Followers 786 Following Professor at NYU & Executive Chairman at AMI Labs. Ex-Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
Bojan Tunguz @tunguz
290K Followers 8K Following Founder and CEO @tabul_ai. Creator of @trainxgb. ML ex Nvidia. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
@levelsio @levelsio
900K Followers 3K Following 📸https://t.co/lAyoqmSBRX $100K/m 🛰https://t.co/ZHSvI2wjyW $44K/m 🎮https://t.co/jFirUbDgtZ $39K/m 🏡https://t.co/1oqUgfD6CZ $35K/m 👙https://t.co/RyXpqGuFM3 + @X $14K/m 🌍https://t.co/UXK5AFqCaQ $10K/m 💾https://t.co/T74ZwJ1F0C $0/m
Ian Brown @igb
19K Followers 3K Following XML apologist. Erlang enthusiast. Currently JVMs & Performance stuff at @Netflix. Previously JVMs & performative stuff at @Twitter. He/him.
Nikita Bier @nikitabier
1.1M Followers 2K Following head of product @x, advisor @solana, venture partner @lightspeedvp, ex-founder @gasappteam (acq by discord), ex-founder @thetbhapp (acq by facebook)
François Chollet @fchollet
697K Followers 825 Following Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.
VCs Congratulating Th... @VCBrags
289K Followers 5K Following They're adding value™ And they're very proud of it. @BragsVentures
Apoorva Govind @Appyg99
33K Followers 1K Following Founder @BesteverAI. Ex @Ubereats. Previously @Apple. Technology Syster. Substack https://t.co/NOfQ180iK5
Richard Socher @RichardSocher
120K Followers 1K Following Building self-improving superintelligence CEO @recursive_si and @youdotcom MP @aixventuresHQ Ex: Stanford Adj Prof, Chief Scientist at Salesforce, CEO MetaMind
John Carmack @ID_AA_Carmack
2.3M Followers 286 Following AGI at Keen Technologies, former CTO Oculus VR, Founder Id Software and Armadillo Aerospace
Deedy @deedydas
245K Followers 6K Following Partner at Menlo Ventures. ex-founding Glean, Google Search. Cornell CS. Investor: Anthropic, OpenRouter, Modal, Wispr Flow, Inception, Prime Intellect
Luca (also on the oth... @__lucab
2K Followers 2K Following former @NIST AI Fellow and @UCBerkeley Tech Policy Co-founder of Twitter's Machine Learning Ethics team (RIP) Opinionated about 🇮🇹 food Tweet with Typos
Ananth Govind Rajan @ananthgr
2K Followers 1K Following Associate Professor @chemengiisc @iiscbangalore | Catalysis, nanotech, & materials | ECB @NanoLetters | Previously @Princeton @MIT @IITDelhi |
Jane Zhang @jjanezhang
3K Followers 836 Following caring deeply, building carefully, and living life 🎉 | agents & llm training @dbrxmosaicai @dukeu I write essays monthly 📝
Joel David Hamkins @JDHamkins
29K Followers 285 Following Mathematics and philosophy of the infinite. Professor of Logic @NotreDame @UniofOxford #ProofandtheArt #PhilMaths #InfinitelyMore #BookOfInfinity
love drops @lovedropx
320K Followers 123 Following A collection of words from books I've read, along with my ramblings | Movies | Linguistics & Literature |
Today in History @TodayinHistory
456K Followers 4K Following Sharing events that happened today in the past 🏛️ Join me in keeping history alive for everyone on X!
Kiana Ehsani @ehsanik
9K Followers 617 Following Making models smarter @ Anthropic, formerly CEO and Co-Founder @ Vercept (acquired by Anthropic), Climber on the weekends. Opinions are my own.
Recursive @Recursive_SI
7K Followers 0 Following Recursive self-improving superintelligence to automate knowledge discovery.
Probability and Stati... @probnstat
81K Followers 703 Following Sharing insights on Probability, Statistics, ML, DL and AI research. Subscribe for recent research paper discussions at $2/month. DM to collaborate.
Petar Veličković @PetarV_93
45K Followers 557 Following Senior Staff Research Scientist @GoogleDeepMind | Affiliated Lecturer @Cambridge_Uni | Assoc @clarehall_cam | GDL Scholar @ELLISforEurope. Monoids. 🇷🇸🇲🇪🇧🇦
Samuel Marks @saprmarks
5K Followers 148 Following AI safety research @AnthropicAI, leading Cognitive Oversight team. Previously: postdoc with @davidbau, math PhD at @Harvard.
julia @mooncat_is
3K Followers 2K Following Being weird. Currently research @anthropic. Formerly @openai and YC S15 and many other things. Bad takes are my own.
Brian Huang @brianryhuang
6K Followers 3K Following @GoogleDeepmind @antigravity | prev math and cs @mit
poof @poof_eth
17K Followers 5K Following Group Founder at @dxrgai - the experimental agentic world builders
heiner @HeinrichKuttler
20K Followers 1K Following ex @xAI, @InflectionAI, @AIatMeta, @DeepMind, @Google, @LMU_Muenchen, PhD math-ph. Opinions my own. (Can be yours for a small fee.)
gabriel @gabriel1
102K Followers 582 Following new thing, previously research at @OpenAI & @midjourney
Sanja Fidler @FidlerSanja
17K Followers 493 Following Associate Professor @UofT, Vice President of AI Research @nvidia, founding member of @VectorInst. Computer vision, deep learning, 3D. Opinions are my own.
Sovereign AI @UKSovereignAI
9K Followers 62 Following Backing Britain's AI Founders to start here, scale here, and win everywhere
Thariq @trq212
284K Followers 2K Following Claude Code @anthropicai. prev YC W20, @southpkcommons, @medialab
deep Manifold @BetaTomorrow
3K Followers 637 Following mathematics Thief & Chef "through the window of differential equations, mathematics sees the light in the real world" / "通过微分方程的窗子,数学家看到现实世界的光" (Jiang Zehan)
Ido Salomon @idosal1
6K Followers 400 Following ai lead | creator https://t.co/dYzyzeEggO | MCP Apps | creator https://t.co/96CngH2g1F | co-creator https://t.co/Kx3uu8RvoA
swyx @swyx
166K Followers 4K Following achieve ambition with intentionality, intensity, integrity & insanity. affiliations: - @dxtipshq - @cognition - @temporalio - @aidotengineer - @latentspacepod
kitze the 🐐 @thekitze
101K Followers 917 Following https://t.co/2lUVc9DzP9 ⋅ https://t.co/EB5easbAZH ⋅https://t.co/BaMlf8p9vR ⋅ https://t.co/llv5eYsp5q ⋅ https://t.co/UPvF2vcvXV ⋅ https://t.co/0i9Ne1kP1I ⋅ https://t.co/4YoOCTmat7
Sarah Chieng @MilksandMatcha
24K Followers 1K Following Head of DevX @Cerebras prev. @ExaAiLabs, @shopthrifthouse, @MIT married to @tyler_fong_ 💌 DMs open
R 'Nearest' Nabors @rachelnabors
28K Followers 2K Following mimarobe | Prev @reactjs @aws @W3C | agentic web herald Directors call me up in the middle of the night to ask about AI. I want your replies, not your likes
etn. @etnshow
9K Followers 279 Following Europe’s technology show. Hosted by @lukeknight and @ronanchamberss and streaming live on X and Youtube at 11AM-2PM UK every Tuesday and Thursday.
sunil pai @threepointone
53K Followers 3K Following 🎈 Entscheidungsproblem. https://t.co/DISzWsXdnE forky mcforkface.
David Soria Parra @dsp_
11K Followers 414 Following Member of Technical Staff @AnthropicAI. Co-Creator of https://t.co/cn31cNYQD3. Ex-Meta. Playing with computers and tech. https://t.co/yDyCddC26H
Kevin Rose @kevinrose
1.5M Followers 2K Following building at @basic_in (@digg) | Podcasts: The Kevin Rose Show, Random Show w/ @tferriss. | Ex: @google, Board of Directors: @ouraring, @hodinkee
Aleksa Gordić (水�... @gordic_aleksa
30K Followers 225 Following pretraining LLMs getting us to singularity with friends | angel computers can be understood: https://t.co/doHE1Qv2Sj x @GoogleDeepMind @Microsoft
Andrea Santilli @teelinsan
1K Followers 3K Following Senior Research Engineer @NVIDIA | Prev: @Apple, @NousResearch, @GladiaLab, @BigscienceW, @picampusschool #NLProc
Pliny the Liberator �... @elder_plinius
209K Followers 1K Following ⊰•-•⦑ latent space steward ❦ prompt incanter 𓃹 hacker of matrices ⊞ breaker of markov chains ☣︎ ai danger researcher ⚔︎ bt6 ⚕︎ architect-healer ⦒•-•⊱
Yet another commodity... @tleilax___
51K Followers 3K Following Commodity Portfolio Manager - Systematic mid-frequency futures & options - Switzerland - Options - Quantitative trading - Data driven
David Daines @daviddorg
18K Followers 465 Following Year Unplugged: 1 year, 0 screens, 100s of biomarkers | Locking in partners for official launch
kalomaze @kalomaze
25K Followers 3K Following ML researcher (@primeintellect), speculator • extremely silly jester
AI Engineer @aiDotEngineer
53K Followers 9 Following The world's best engineers, founders, and researcher building with AI. Organizers of the AIE Summit, Code Summit, Europe, and the flagship SF World's Fair.
OSINTdefender @sentdefender
2.3M Followers 2K Following Open Source Intelligence Monitor focused on Europe and Conflicts across the World. RT ≠ Endorsement. Want to Support my Work? https://t.co/PcUbewvWPr
Claude Code Changelog @ClaudeCodeLog
70K Followers 20 Following UNOFFICIAL – but tolerated – bot posting Claude Code CLI, feature flag & prompt changes. Full CC history in github repo.
Alek Dimitriev @tensor_rotator
5K Followers 2K Following Inference @AnthropicAI, prev Gemini @Google, prev prev PhD @UTAustin
Yosh Zlotogorski @yehoshzl
2K Followers 1K Following Building a low turnover, long duration growth portfolio to outperform the QQQ. 📈 +174% over past 5 years. Follow along: https://t.co/VY2NXJ3LTX
Andrew Lampinen @AndrewLampinen
12K Followers 2K Following Interested in cognition and artificial intelligence. MTS at @AnthropicAI. Previously @DeepMind, cognitive science @StanfordPsych. Tweets are mine.
Archit Sharma @archit_sharma97
8K Followers 369 Following RL, post-training, reasoning research @GoogleDeepMind | co-created: Gemini Deep Think series, DPO | prev: @Stanford @Google Brain @IITKanpur @MILAMontreal
Pingbang Hu 🇹🇼 @PingbangHu
3K Followers 371 Following I work on, with, and for data. Ph.D. candidate @UofIllinois. Fellows @AnthropicAI. Interns @ SIG @amazon @jouhouken. Alumni @Umich @SJTU1896.





































