matt turk @TurkMatthew
ML researcher @withprotegeai prev: ML @cleanlabAI @goodwatercap, Quant @coinbase & @goldmansachs, EECS @ucberkeley datalab.withprotege.ai/people?person=… New York, NY Joined March 2012-
Tweets1K
-
Followers719
-
Following2K
-
Likes10K
kinda messed up that brunson isn’t gonna be able to get into the parade because of the no bag policy
Jalen Brunson, First of His Name, Breaker of Double Teams, King of New York, Most Valuable Player.
@TheRealDanSaedi Incredible launch. Excited to see what is next!
Congrats on the launch to my friends @mattjoseph27 @TheRealDanSaedi @_jcfe these guys are a top tier team and have built something truly game changing for marketing. So proud of their hard work so far.
AI can now read your customers' minds. We raised a $20M Series A lead by 8VC & Lingotto to build this. Introducing Minerva, built in collaboration with OpenAI:
We're excited to see @DataLabResearch @TurkMatthew’s new paper, “Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents,” accepted to the inaugural RLEval Workshop at ACM CAIS 2026 and selected for an invited talk. 🔍️ The guiding research question: "When clinically important patient facts change, does the model appropriately change its recommendations?" 👉️ If you change a key clinical detail that changes the case context, the model should change its recommendation. But if a clinically meaningful fact changes and the model doesn't update its recommendation, that counterfactual test exposes the gap. The model wasn't truly reasoning through the specific case in front of it — it was relying on surface patterns or the general shape of the case rather than the patient's actual circumstances. 🚨 What Matt found: "Producing the right answer and responding appropriately to new information are distinct capabilities - and future evaluation frameworks need both." ‼️ Why this matters: This connects directly to one of the hardest problems in benchmark design. It's not enough to measure whether a model arrives at the right answer. We also need to know whether it would arrive at a different answer when the underlying facts change. This is the kind of benchmark-design question that @engyziedan's @DataLabResearch focus on. Better benchmarks require more than held-out datasets. They require realistic, ground-truth evaluation frameworks that can distinguish between a model that genuinely updates its reasoning and one that reaches the correct answer for the wrong reasons. Congratulations to @TurkMatthew, and we're excited about the cutting-edge benchmark and evaluation work happening across the Protege DataLab.
@arxiv @CAISconf You can see the workshop and accepted papers here: rl-eval.github.io
Excited to share that my paper, "Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents", is now available on @arxiv (link in the comments). The paper was accepted to the inaugural RLEval Workshop at @CAISconf- a workshop focused on methods and reinforcement learning environments for evaluating AI agents - and was selected for an invited talk based on reviewer ratings. LLM evaluation is difficult. Models that look equally capable on traditional benchmarks can behave very differently when the underlying facts change. Most current benchmarks focus on whether a model's output looks correct. But in real-world settings, especially in healthcare, what often also matters is whether the model appropriately updates its recommendations when the underlying facts change. In this work, I introduce the Causal Sensitivity Score (CSS), a pre-registered counterfactual evaluation framework designed to measure exactly that: "When clinically important patient facts change, does the model appropriately change its recommendations?" Across six frontier models and several hundred oncology tumor board cases, I found that models with very similar performance on standard coverage-based metrics often behave dramatically differently under counterfactual interventions. In fact, model rankings were nearly reversed depending on whether you measured coverage or responsiveness. The paper also shows that these findings transfer to tool-using agents, revealing failure modes that remain hidden under conventional evaluation approaches. The broader takeaway is that producing the right answer and responding appropriately to new information are distinct capabilities - and future evaluation frameworks should measure both. This work is also closely aligned with the mission of our DataLab at @withprotegeai: building rigorous datasets, benchmarks, and evaluation frameworks that better reflect how AI systems perform in the real world. As AI moves into increasingly complex, high-stakes domains, measuring what models know is important - but we also need to measure how they respond when reality changes. I'm grateful to my colleagues and collaborators, especially @engyziedan and Wes Hopkins, as well as the medical professionals who helped validate the results.
@arxiv @CAISconf Link to my paper: arxiv.org/abs/2605.30590
deep learning research was the original vibe math
.@NYCSanitation I'd like to report a sweep
Are people on LinkedIn aware that one day we are all going to die
NYC summers hate your weekends. I analyzed 3 years of Central Park rain data to see if the feeling was real. It was. 27 of 38 summer weekends had measurable rain. That’s 71%. Friday was the rainiest day of the week, with rain on 43.6% of summer Fridays. Basically worse than a coin flip for your evening plans. Sunday had the highest rainfall volume of any day, averaging 0.17 inches. It may not rain every Sunday, but when it does, it commits. Thursday was objectively the best day to be outside in NYC: lowest precipitation, clearest skies, least weekend-related misery. The wildest stat: 70.8% of rainy Sundays were preceded by a gross Friday or Saturday. The weekend basically telegraphs its own downfall. I built a full dashboard using NWS Central Park data with every weekend tracked and every raindrop counted:
@dair_ai Way too noisy of a process to forecast
Top-tier read calling out performative grind culture. “Great work has always demanded sacrifice and often brutal hours and I'm not disputing this. What I'm disputing is the direction. These people, many of them friends, have more economic freedom than any class in history and they've chosen, freely, to simulate the conditions of a Chinese assembly line and call it virtue.”
Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
Daniel A. Saedi (Data... @TheRealDanSaedi
10K Followers 3K Following Cofounder Minerva | Former Systematic Equity Investor at Bridgewater
Matt Joseph @mattjoseph27
251 Followers 1K Following cofounder & CTO @TryMinervaAI, previously quant research @citadel
Antonio García Mart�... @antoniogm
225K Followers 17K Following Director, @base growth. Founder @spindl_xyz (acq. @coinbase). Wrote bestseller 'Chaos Monkeys'. גם זה יעבור 🇺🇸🇪🇸
Roshun Patel @roshunpatel
16K Followers 4K Following head of credit, alternatives @ereborbank | formerly @genesistrading @hack_vc | local farmer | tungsten maxi | nothing is financial advice
Ross Trachtman @riscoedash
566 Followers 799 Following Jamming on what’s next | prev. Principal @BHDigitalAssets, TMT L/S | Proud @UCBerkeley 🐻 | Views are my own
Payas Parab @payasparab
706 Followers 2K Following Tech bro meme account | Ex-Child, Ex-virgin | My mom’s 2 under 30 | Late employee at Big 4 (first 500K) | 1x’er engineer, an NPC that will not pretend otherwise
Gandy @austinmichaelg
402 Followers 1K Following Building @SierraPlatform Frmr @fermatcommerce @Liveramp, CS @Cal. Interested in AI, SaaS, birding, and watches. Cover: Art Club 2000
Rob Masiello @rkmasiello
1K Followers 952 Following Engineer, founder, and investor - building @dealporthq
PUT YOUR (CAL) HAT ON @ToshingMyLupoi
2K Followers 4K Following Shadow Business Development @ Cal Athletics
Matt Turck @mattturck
141K Followers 2K Following VC at @FirstMarkCap. Host: MAD Podcast; Organizer: Data Driven NYC, Author: MAD Landscape.
Josiah Anyabuike @pawfectsupply17
0 Followers 54 Following Premium OpenClaw deployment kits. Paste one prompt. Your AI team is live.
Sir. Brian Keya @Harry_Adams97
167 Followers 2K Following 🚀 Klimstone | Construction, Structural & Consultancy Firm ✨Umoja Technologies | Driving Digital Innovation 🌐🔥
Autonomous🐻 @ShmerberParadox
2K Followers 6K Following Retweets ≠ endorsements | I’m probably just fucking with you lmao
Coastal GA Home Pros ... @CoastalGAPros
4 Followers 71 Following Go to home services business focusing on fixing what the big guys tear up and don't want to do. Oh and soon to be adding "Home Inspections" to our list.
craggers @cr3gs1
125 Followers 3K Following
Mr mohcine @mhbn205
53 Followers 790 Following
Rayen Mlayeh @MlRingo
0 Followers 48 Following
esdwace @esdwace
93K Followers 96K Following VDJ👁PRODUCER➔🎮←©CREATOR® #HexagonHQ @Kokaneofficial 👌/G\🤟 @DjASHBA @DJBonics @DJJS1 @DannyDiablo @mattox__ @Unimerce1 @sagesoulrich @RealK_Smyllz
Sammy @ToughSammy
403 Followers 1K Following Engineering the future of autonomous pharma factories. Dum Spiro Spero
grim789 @MagicAlucard
180 Followers 2K Following I post AI, Tech and science content, just whatever I find interesting. The problem is I have too many interests and not enough lifetimes. 🏰⏳️
antwon @antwon707415
0 Followers 15 Following
nishant @nkheterpal
296 Followers 1K Following phd candidate @umrobotics; cal eecs '18; now formal verification; then av simulation; he/him/his
Protege @withprotegeai
974 Followers 9 Following Data is the biggest bottleneck to AI's progress. We're here to unlock it.
Ritik Suman @ritik100x
4 Followers 648 Following Ai builder | Security Researcher | Open Source Builder.
smart Sow @smartsow15
9 Followers 112 Following
cammy & glitch @MikeTeamboise
29 Followers 2K Following softest chaos you’ll meet today 🌙 follow back always
xerxes @xerxes_4x
5 Followers 148 Following
Benjamin Cowenn. @benjaminncowe
283 Followers 51 Following Founder & CEO, ITC | Macro, commodities & digital assets | Liquidity, labor & risk cycles | PhD Engineering | Former NASA, Sandia National Labs
himanshu @himanshustwts
28K Followers 4K Following simulating world behaviour @physeraAI • pods @groundzero_twt • DMs open!
Zecheng Zhang @zechengzh
3K Followers 1K Following Founder @ https://t.co/2JWm2aq3T2 | YC Alumni | Founding Engineer @ https://t.co/5txAk2P7AW | Stanford CS MS | DeepSNAP and PyTorch Frame Co-creator
ekin @eking0x
4K Followers 2K Following defi thinkboi @OxfordLawFac // rip: eic @dlnewsinfo of @defillama
AI-Engineering.at @Engineerin67400
17 Followers 1K Following
Thom @ThomSwann
376 Followers 291 Following Building @HeyStreamGenie + @DiffractAgency. I live in the realm of HW and SW. Past me: @tastytrade, @wendys, @layeroneio, @streamlabs, @chipotletweets.
Janik Wing @JanikWing16554
0 Followers 26 Following
Siméon @Simeon_Cps
10K Followers 3K Following Building world-models for verified AI inference in London | former founder & CEO of SaferAI
latif horst @latifhorst
203 Followers 453 Following Helping GTM pros to automate the bullshit so they can focus on what actually matters | Rebel | Work smarter because working harder is a scam
RachinX2 @rmasculine31515
0 Followers 849 Following nobody, just a normal guy, hopefully what i write in here, can help you
camus @camus0711
99 Followers 1K Following
@makebamsales @makebaads
19 Followers 241 Following X Account Executive helping businesses & brands scale fast on the platform. I specialize in high-impact ads and turning your messages into a movement!
Engy Ziedan @engyziedan
229 Followers 665 Following Co-founder @withprotege.ai Applied health economist @IndianaUniv. Building data that unlocks AI capabilities.
Yixin Lin @yixin_lin_
2K Followers 7K Following something new. prev: embodied AI @GoogleDeepMind, FAIR/@AIatMeta, Google Brain.
Jacob Rosenfield @JacobRosenfield
27 Followers 293 Following
Pat @boonesquad13
75 Followers 818 Following
ELON PRIVATE CHAT ✪... @elonprivvchat
220 Followers 7K Following CEO and chief engineer of SpaceX🚀,СЕО and product architect of Tesla, Inc ⚡️🚘,Personal interactive account ⚡️
Ryan Patrick Hughes @ryanpathughes
894 Followers 7K Following PR Pro. Partner & Head of US Operations @MaltinPR. Posts are my own views.
Joseph A. Carlino @JosephACarlino
1K Followers 2K Following Bad luck to bankrolls, I flip the script Faith over fear, no hype just grind Learning loud, loving life. 📸 IG EVERYONEKNOWSJOE_
Maor Shapira @shapira99381
0 Followers 15 Following
Paul Graham @paulg
3.6M Followers 794 Following
@jason @Jason
1.4M Followers 7K Following Host: @twistartups @theallinpod @thisweeknai; I invest in 100 startups a year @launch & @founderuni [email protected] for life
Mike Solana @micsolana
396K Followers 1K Following billionaire media tycoon and former mayor of san francisco. disinformation researcher. cmo @foundersfund. editor-in-chief @piratewires 🏴☠️
Daniel A. Saedi (Data... @TheRealDanSaedi
10K Followers 3K Following Cofounder Minerva | Former Systematic Equity Investor at Bridgewater
Matt Joseph @mattjoseph27
251 Followers 1K Following cofounder & CTO @TryMinervaAI, previously quant research @citadel
Alex Cohen @anothercohen
230K Followers 1K Following Now: Co-founder @hellopatient | Previously: Led consumer and growth product @carbonhealth | Hobbies include getting fired constantly | Mostly satire
Aaron Levie @levie
2.9M Followers 814 Following ceo @box - your business lives in content. unleash it with AI
Joe Weisenthal @TheStalwart
443K Followers 7K Following One half of Bloomberg's Odd Lots Podcast. One quarter of Light Sweet Crude.
Antonio García Mart�... @antoniogm
225K Followers 17K Following Director, @base growth. Founder @spindl_xyz (acq. @coinbase). Wrote bestseller 'Chaos Monkeys'. גם זה יעבור 🇺🇸🇪🇸
Lucy Guo @lucy_guo
84K Followers 919 Following Building https://t.co/NaZlVKaVoL | cofounder of @backendcapital @hf0residency @scale_ai | part time DJ
Roshun Patel @roshunpatel
16K Followers 4K Following head of credit, alternatives @ereborbank | formerly @genesistrading @hack_vc | local farmer | tungsten maxi | nothing is financial advice
Rando Norris @0xBobbyAxelrod
2K Followers 3K Following
Zoubin Ghahramani @ZoubinGhahrama1
35K Followers 710 Following VP Research, Google DeepMind, ex-head of Google Brain. Professor at University of Cambridge. Machine Learning Researcher. ex-Chief Scientist & VP of AI, Uber.
koray kavukcuoglu @koraykv
24K Followers 102 Following Chief AI Architect, Google. CTO, Google DeepMind
Karri Saarinen @karrisaarinen
89K Followers 1K Following ceo of @linear 🇫🇮🇺🇸 previously: @coinbase @airbnb, YC alumni
nishant @nkheterpal
296 Followers 1K Following phd candidate @umrobotics; cal eecs '18; now formal verification; then av simulation; he/him/his
DataLab Research @DataLabResearch
3 Followers 1 Following
shift @joinshiftX
3K Followers 4 Following join the shift Press, hiring, or collabs, please DM or email us at [email protected]
Lev Akabas @LevAkabas
15K Followers 854 Following NBA 🏀 & sports business 💵 data viz 📊 for @Sportico All opinions are my own
Harbor Framework @harborframework
1K Followers 4 Following
ACM Conference on AI ... @CAISconf
2K Followers 36 Following The inaugural ACM Conference on AI and Agentic Systems!
Yaron (Ron) Minsky @yminsky
22K Followers 367 Following Occasional OCaml programmer. Host of @signalsthreads. @[email protected] @yminsky.bsky.social https://t.co/kiUGRvWOO2
Timothy Gowers @wtgow... @wtgowers
57K Followers 187 Following Mathematician. Professeur titulaire de la chaire Combinatoire au Collège de France. Also fellow of Trinity College Cambridge.
Renegade Partners @RenegadePtnrs
983 Followers 358 Following We help founders turn their startups into companies.
François Chollet @fchollet
699K Followers 826 Following Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.
KnicksMuse @KnicksMuse
122K Followers 200 Following The #1 Knicks Page on Twitter | Reach Me at [email protected] | Follow my Instagram @KnicksMuse | DMs Always Open |
MATS Research @MATSprogram
4K Followers 136 Following MATS empowers researchers to advance AI alignment, transparency, and security
Claude @claudeai
1.5M Followers 2 Following Claude is an AI assistant built by @anthropicai to be safe, accurate, and secure. Talk to Claude on https://t.co/ZhTwG8d1e5 or download the app.
Financial Times @FT
6.8M Followers 1K Following Big stories and breaking news as they are published on https://t.co/EYmAcRLBHv. Register here to access free articles: https://t.co/NRg2hritkA
Yash Patil @ypatil125
9K Followers 606 Following Co-Founder, CEO @appliedcompute 🚂 prev: @OpenAI, @Stanford
Thomas G. Dietterich @tdietterich
62K Followers 651 Following University Distinguished Professor (Emeritus), Oregon State Univ.; Former President, AAAI; Currently Chair CS Section of ArXiv
arXiv.org @arxiv
49K Followers 184 Following News from https://t.co/enurGFxpcS, a free distribution service and an open archive for scholarly articles. For help with arXiv, see https://t.co/LcWuhM0BOl
Thomas Kipf @tkipf
29K Followers 1K Following Sr. Staff RS at @GoogleDeepMind. Gemini Omni Team. Priors: GNNs, Structured World Models, Neural Assets, Veo Ingredients/References, Veo Robotics
william @wgussml
15K Followers 796 Following Project Prometheus, prev cofounder @generalagents, OpenAI, helped build copilot & MineRL, creator of ell
Justus Mattern @MatternJustus
8K Followers 848 Following Co-Founder @ProximalHQ | prev. research @PrimeIntellect, @MPI_IS and built revideo
himanshu @himanshustwts
28K Followers 4K Following simulating world behaviour @physeraAI • pods @groundzero_twt • DMs open!
Zecheng Zhang @zechengzh
3K Followers 1K Following Founder @ https://t.co/2JWm2aq3T2 | YC Alumni | Founding Engineer @ https://t.co/5txAk2P7AW | Stanford CS MS | DeepSNAP and PyTorch Frame Co-creator
Subquadratic @subquadratic
20K Followers 1K Following AI lab leading the subquadratic LLM revolution.
Alexander Whedon @alex_whedon
25K Followers 60 Following Building better algorithms. Co-Founder at @subquadratic
The White House @WhiteHouse
4.8M Followers 6 Following Welcome to The Golden Age of America. 📱 Text USA to 45470 to receive alerts.
Gary Marcus @GaryMarcus
228K Followers 7K Following OG GenAI Skeptic; spoke at US Senate. Warned about hallucinations in 2001. Advocating world models & neurosymbolic AI ever since. Author, Marcus on AI & 6 books
NYPD NEWS @NYPDnews
813K Followers 451 Following The official X of the New York City Police Dept. Call 911 for emergencies, 311 for non-emergencies. Account not monitored 24/7. https://t.co/dbME9x7eL3
Maziyar PANAHI @MaziyarPanahi
18K Followers 332 Following Building @OpenMed_AI · 3,500+ open-source medical models · #1 on HuggingFace Daily Papers · Shipping OpenMed Agent today: Terminal-native AI for Healthcare
Ronak Malde @rronak_
10K Followers 514 Following Co-Founder of Trajectory @TrajectoryLabs prev @GoogleDeepmind, SWE-1 @windsurf | @stanford
Alex Laterre @AlexLaterre
2K Followers 781 Following Co-founder @IneffableLabs | prev. Head of Research @ Instadeep
Seb Johnson @SebJohnsonUK
12K Followers 722 Following Talking about UK and European Tech. Subscribe to my newsletter to keep up to date.
Ineffable Intelligenc... @IneffableLabs
8K Followers 0 Following Making first contact with superintelligence.
ICLR @iclr_conf
60K Followers 57 Following International Conference on Learning Representations #ICLR2027. SPC is @jacobandreas and GC is @BharathHarihar3
Kevin Weil 🇺🇸 @kevinweil
121K Followers 3K Following BoD @Cisco @nature_org LTC @USArmyReserve Ex CPO+Science @OpenAI, Pres @Planet, Head of Product @Instagram @Twitter ❤️ @elizabeth ultramarathons kids cats math
Benjamin Cowen @benjamincowen
1.2M Followers 1K Following Founder & CEO, ITC | Macro, commodities & digital assets | Liquidity, labor & risk cycles | PhD Engineering | Former NASA, Sandia National Labs
TK Kong @tkkong
12K Followers 594 Following Building something new. Previously @RampLabs, founder/CEO @VenueHQ (backed by Sequoia, acq by Ramp). Tennis fan. 🇰🇷
Phil Chen @philhchen
9K Followers 624 Following Building something new. Previously research @openai @GoogleDeepMind @scale_AI @Stanford
Aparna Dhinakaran @aparnadhinak
10K Followers 2K Following founder @arizeai. I post about agents and evals. previously: @uber, @ycombinator alum







































