Alex Havrilla @Dahoas1

Georgia Tech ML Researcher studying neural network learning theory and LLMs for mathematical reasoning. Intern at FAIR, MSFT Research. Co-founder of CarperAI. dahoas.github.io Joined August 2021

Tweets

150
Followers

1K
Following

503
Likes

958

Alex Havrilla @Dahoas1

4 days ago

In my humble opinion the recent Stream of Search paper (arxiv.org/abs/2404.03683) is truly outstanding. Everyone should give it a thorough read.

8 25 169 22K 230

@Thom_Wolf The 3 key elements of a good dataset: 1. quality 2. diversity 3. quantity You can only easily measure the last one but the performance is a sensitive function of all three. Super interesting topic ty for #longread :)!

33 79 1K 132K 360

Tomek Korbak @tomekkorbak

6 days ago

I've finally uploaded the thesis on arXiv: arxiv.org/abs/2404.12150 It ties together a bunch of papers exploring some alternatives to RL for finetuning LMs, including pretraining with human preferences and minimizing KL divergences from pre-defined target distributions.

David Krueger @DavidSKrueger

5 months ago

0 6 52 41K 43

7 39 263 35K 202

Download Image

Sharath Raparthy @sharathraparthy

a week ago

I am super excited to share our Llama3 preview models (8B and 70B). I am proud to have been a part of this amazing effort over the past 8 months. We still have some super cool stuff coming up in the coming months... until then, enjoy playing with these preview models…

2 4 63 7K 4

Download Image

Alex Havrilla @Dahoas1

a week ago

Had a great time during our discussion, thanks again for having me!

The TWIML AI Podcast @twimlai

2 weeks ago

Had a great time during our discussion, thanks again for having me!

0 4 18 8K 8

Download Video

1 3 22 2K 4

Reshinth @reshinth_

3 weeks ago

How to define Diversity in the context of CodeLMs and Programming Languages ? 1. Diversity is positively correlated with Performance in solving a problem. 2. Shortcomings of diversity in small codeLMs. 3. Code Embedding models don't capture semantics. reshinthadithyan.github.io/blog/2023/code…

1 10 25 4K 11

Download Image

Costa Huang @vwxyzjn

a month ago

Happy to share our work on reproducing RLHF scaling behaviors in @OpenAI's work in summarizing from feedback. We built an RLHF pipeline from scratch and enumerated over 20+ implementation details 🚀 Fun collab with @mnoukhov, @arianTBD, @krasul, @weixunwang, and @_lewtun 📜…

6 63 323 42K 240

Download Image

Wojciech Galuba @wgaluba

2 months ago

Super proud to share what we've been cooking with the amazing team at @cohere - a nimble model that's efficient, retrieves docs and gives citations, knows how to use tools and supports 10 languages. Ready to use in your business! Have a chat with it at: coral.cohere.com

Aidan Gomez @aidangomez

2 months ago

31 191 1K 320K 543

1 3 22 7K 2

@NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

SynthLabs @synth_labs

12K Followers 43 Following AI Aligned with Your Vision. We’re doing cutting edge research for transparent, auditable AI alignment.

Math @uwaterloo, RLHF @BrownCSDept, Goosefluencer. x-RS @aieleuther, x-Head of LLMs @stabilityai, x-lead @CarperAI. co-founder @synth_labs. We're hiring.

Louis Castricato @lcastricato

3K Followers 477 Following Math @uwaterloo, RLHF @BrownCSDept, Goosefluencer. x-RS @aieleuther, x-Head of LLMs @stabilityai, x-lead @CarperAI. co-founder @synth_labs. We're hiring.

PhD at 19 |
Founder and CEO at @MedARC_AI |
Research Director at @StabilityAI |
@kaggle Notebooks GM |
Biomed. engineer @ 14 |
TEDx talk➡https://t.co/xPxwKTq6Qb

Tanishq Mathew Abraha.. @iScienceLuvr

Stella Biderman @BlancheMinerva

15K Followers 748 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/her

Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.

Sebastian Raschka @rasbt

267K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.

Nathan Benaich @nathanbenaich

51K Followers 32K Following solo member of investment staff @airstreet, brewing ambition @airstreetcafe, next token predictor @airstreetpress

A non-profit research lab focused on interpretability, alignment, and ethics of artificial intelligence.

Creators of GPT-J, GPT-NeoX, and VQGAN-CLIP

EleutherAI @AiEleuther

19K Followers 76 Following A non-profit research lab focused on interpretability, alignment, and ethics of artificial intelligence. Creators of GPT-J, GPT-NeoX, and VQGAN-CLIP

Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

Delip Rao e/σ @deliprao

46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

Follow me on my new podcast with AI startups, Unaligned. Tech industry color commentator since 1993. Author/Blogger. Former strategist @Microsoft.

Robert Scoble @Scobleizer

504K Followers 68K Following Follow me on my new podcast with AI startups, Unaligned. Tech industry color commentator since 1993. Author/Blogger. Former strategist @Microsoft.

Jeremy Howard @jeremyphoward

222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @Stanford

Sharif Shameem @sharifshameem

53K Followers 3K Following founder @LexicaArt • in pursuit of good explanations

Nathan Lambert @natolambert

25K Followers 689 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentials

Haofu Liao @LiaoHaofu

2 Followers 73 Following

Bayan Sporle @BSporle54687

83 Followers 5K Following

Elana Unzicker @ElaUnzicke

91 Followers 5K Following

Al Mamun @al_mamun_sardar

276 Followers 3K Following Looking for NLP/LLM related PhD positions | Research Assistant (NLP) at Jahangirnagar University | MSc (CS)

Eyüp Pastırmacı @eyuppastirmaci

15 Followers 128 Following

Parker Sigona @ParkerSigo31950

12 Followers 2K Following 🤎Parker ~ My Free Content👇

Harriett Becky @HarrieBeck

41 Followers 5K Following

CS PhD at @UWaterloo, Founding Engineer at Coastal Carbon and part time Researcher at @huawei Noah’s Arc Lab, prev enjoyed my time at @oraclelabs, & @CVC_UAB

Mojtaba Vàlipour @ValipourMojtaba

391 Followers 3K Following CS PhD at @UWaterloo, Founding Engineer at Coastal Carbon and part time Researcher at @huawei Noah’s Arc Lab, prev enjoyed my time at @oraclelabs, & @CVC_UAB

Harvie Zhang @harvie_zhang

128 Followers 4K Following HyperEvol AI Lab

Janhavee Shinde @SJanhavee

56 Followers 2K Following

Czentye Levente @lordlewo

23 Followers 3K Following

阮小胖 @Pang1231231

0 Followers 7 Following

Sam Kuhn @SamKuhnDev

214 Followers 4K Following Full Stack 3D Web Developer. WebGPU, WebGL/XR, Serverless, Cloud

Research Scientist @asapp; #NLProc PhD from Carnegie Mellon LTI @LTIatCMU (2016-22)

RTs ≠ endorsements. Views personal, not of employers/institutions.

Varun Gangal @VarunGangal

724 Followers 5K Following Research Scientist @asapp; #NLProc PhD from Carnegie Mellon LTI @LTIatCMU (2016-22) RTs ≠ endorsements. Views personal, not of employers/institutions.

Krishna Srinivasan @krishna2

471 Followers 2K Following I code stuff. I build models. I work in NLP/DL at Google Research.

头雁 @alacheng

21K Followers 3K Following BTC & Web3 & AI & ZK & FHE 研究分享

A dreamer. Life is getting full of Prompts.#cloudnative #GenerativeAI #modelontheEdge #petrolhead #gadgets freak #smart device #SaaS Architect #xR #immersiveExp

alonet @alonet

94 Followers 1K Following A dreamer. Life is getting full of Prompts.#cloudnative #GenerativeAI #modelontheEdge #petrolhead #gadgets freak #smart device #SaaS Architect #xR #immersiveExp

Azal Ahmad Khan @azalakhan

22 Followers 466 Following Undergraduate @IITGuwahati

krishna soham @iamkrishnasoham

109 Followers 932 Following i compute therefore i am

DuttonΦ @duttonphi

105 Followers 478 Following ..aagen (double (2x) agent).. ..previously Wołfram|Ałpha.. ..baeksu.ai.. ..jajangmyeon all day/all night..

ZKP @ZKPxyz

557 Followers 5K Following AIxWeb3 hacker and Angel Investor. Web1 and Web2 veteran. Currently exploring the agentic efficient frontier.

Ashutosh Sharma @ashutoshuiuc

33 Followers 813 Following MSCS @IllinoisCS BTech @iitbombay

S. Iqbal @S_Iqbal90

26 Followers 1K Following

Sidney Liquori @LiquoriSidn

50 Followers 5K Following

🔎 actively pondering the next big adventure, all things Data & AI
🤝 Founder @ Aptitive (acquired Q4 2021 @ ~55 FTEs)
💡 AI/ML/LLM enthusiast, building in open

Fred Bliss @fblissjr

432 Followers 1K Following 🔎 actively pondering the next big adventure, all things Data & AI 🤝 Founder @ Aptitive (acquired Q4 2021 @ ~55 FTEs) 💡 AI/ML/LLM enthusiast, building in open

PM who loves to write a bit of code, into 3d printing and some other stuff. @yandex alumni (12 years, head of blog search service, tech marketing, taxi)

Anton Volnuhin @antonme

2K Followers 3K Following PM who loves to write a bit of code, into 3d printing and some other stuff. @yandex alumni (12 years, head of blog search service, tech marketing, taxi)

Blake Camp @blake_camp_1

406 Followers 1K Following AI (PhD), markets, aviation, brains, futbol, pizza, wine, books, films, family, friends, life

Andrew Dai @andrewdai99

191 Followers 253 Following Researcher @Aleph__Alpha, a Kerryman, LLMs x open-endedness; prev @tcddublin, @FormulaTrinity Autonomous | 🇮🇪

tom @0xluciusv

149 Followers 454 Following i like cuda kernels, c++, rust, go, and nvim. (cons e/λx.x 🌎/acc) wrong about a lot of things but trying to learn

Josh @JoshPurtell

730 Followers 2K Following ML Researcher. Ex: Cyber microexit, Yale Math. Hiring in ML Ars longa

Make @LearnAnything_

Learn in public: https://t.co/GbFvuErkYn

macOS course: https://t.co/JdbJWru6zG

https://t.co/94R8ER7K2h
https://t.co/ROkqhyhpEK

Nikita @nikitavoloboev

4K Followers 7K Following Make @LearnAnything_ Learn in public: https://t.co/GbFvuErkYn macOS course: https://t.co/JdbJWru6zG https://t.co/94R8ER7K2h https://t.co/ROkqhyhpEK

Independent researcher: molecular design/nanotech, human-AI collab, informetrics, DIY-scihw. Prev compchem/biophys/info phd&postdoc @helsinkiuni

Markus Rauhalahti, Ph.. @MRauhalahti

844 Followers 5K Following Independent researcher: molecular design/nanotech, human-AI collab, informetrics, DIY-scihw. Prev compchem/biophys/info phd&postdoc @helsinkiuni

Ronnie Cotham @RonnieCoth83122

17 Followers 3K Following

I mostly tweet about #ai, #robots, #science, and the @packers...

Robotics PhD student @GeorgiaTech studying #RL and #AI

My thoughts and opinions are my own

Jonathan Balloch @JonathanBalloch

343 Followers 923 Following I mostly tweet about #ai, #robots, #science, and the @packers... Robotics PhD student @GeorgiaTech studying #RL and #AI My thoughts and opinions are my own

Shannon Turansky @ShanTurans

80 Followers 5K Following

b @tomorrowstae

64 Followers 990 Following --

Kanishk Gandhi @gandhikanishk

921 Followers 692 Following Phd @Stanford CS; w/ Noah Goodman, Dorsa Sadigh | Prev: @LakeBrenden @NYUDataScience, @IITKanpur, @Path_AI

Automate bug triage & root cause analysis to solve bugs quickly & get back to building!
--
Book an appointment:
📅https://t.co/lgRrvWE0gS

Waypoint AI @waypointai

111 Followers 4K Following Automate bug triage & root cause analysis to solve bugs quickly & get back to building! -- Book an appointment: 📅https://t.co/lgRrvWE0gS

Aaditya ; @Aaditya26082004

524 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈

curious how will we define the public good i.e., 👉 https://t.co/sE02VgqC1t #righttotheurbancity ..don't die in a crosswalk. #imagineeringatlanta #twitterdeveloper

..still, who is David.. @jimat944

507 Followers 979 Following curious how will we define the public good i.e., 👉 https://t.co/sE02VgqC1t #righttotheurbancity ..don't die in a crosswalk. #imagineeringatlanta #twitterdeveloper

Hao Tang @tanghao95

53 Followers 517 Following CS PhD student @ Cornell

CardGamesFanatic @CardmeisterGuy

2K Followers 5K Following Guy interested in transit and politics. Tends to talk too much.

ricochicomico (stop/L.. @ricochicomico1

689 Followers 8K Following It is that important.

Briony Kitanik @kita_bri

38 Followers 5K Following

Yofo Diame @YDiame87519

329 Followers 4K Following

🙋‍♂️= NeuroAI Researcher. Postdoc @ Amii / UAlberta studying language, vision & (and in) DNNs and brains. PhD & ex-Google Brain intern (🇮🇪 & 🇬🇧)

Alex Murphy @Alxmrphi

491 Followers 2K Following 🙋‍♂️= NeuroAI Researcher. Postdoc @ Amii / UAlberta studying language, vision & (and in) DNNs and brains. PhD & ex-Google Brain intern (🇮🇪 & 🇬🇧)

Fasil Muhammad @pvfasil

13 Followers 56 Following

Amanda Rendel @rend_aman

36 Followers 5K Following

AI Research / Founder @ Red Dragon AI.
Co-organiser of Machine Learning Singapore MeetUp. @GoogleDevExpert (ML).
Fixed Income quant in NYC during AI winter

Martin Andrews @mdda123

601 Followers 1K Following AI Research / Founder @ Red Dragon AI. Co-organiser of Machine Learning Singapore MeetUp. @GoogleDevExpert (ML). Fixed Income quant in NYC during AI winter

Andrej Karpathy @karpathy

978K Followers 904 Following 🧑‍🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥

Sam Altman @sama

2.8M Followers 891 Following AI is cool i guess

hardmaru @hardmaru

285K Followers 1K Following Building Collective Intelligence @SakanaAILabs 🧠

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Professor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.

Yann LeCun @ylecun

711K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.

Greg Brockman @gdb

667K Followers 51 Following President & Co-Founder @OpenAI

SynthLabs @synth_labs

12K Followers 43 Following AI Aligned with Your Vision. We’re doing cutting edge research for transparent, auditable AI alignment.

Stability AI @StabilityAI

190K Followers 31 Following We are building the foundation to activate humanity's potential.

François Chollet @fchollet

469K Followers 770 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.

We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Google DeepMind @GoogleDeepMind

943K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Louis Castricato @lcastricato

3K Followers 477 Following Math @uwaterloo, RLHF @BrownCSDept, Goosefluencer. x-RS @aieleuther, x-Head of LLMs @stabilityai, x-lead @CarperAI. co-founder @synth_labs. We're hiring.

Aran Komatsuzaki @arankomatsuzaki

95K Followers 78 Following @TeraflopAI

Tanishq Mathew Abraha.. @iScienceLuvr

John Carmack @ID_AA_Carmack

1.1M Followers 241 Following AGI at Keen Technologies, former CTO Oculus VR, Founder Id Software and Armadillo Aerospace

Stella Biderman @BlancheMinerva

15K Followers 748 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/her

Machine Learning ex Nvidia. Kaggle Quadruple Grandmaster. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. e/xgb. XGBoost.eth. AMDG.

Bojan Tunguz @tunguz

187K Followers 8K Following Machine Learning ex Nvidia. Kaggle Quadruple Grandmaster. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. e/xgb. XGBoost.eth. AMDG.

Sebastian Raschka @rasbt

267K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.

The original AI alignment person. Missing punctuation at the end of a sentence means it's humor. If you're not sure, it's also very likely humor.

Eliezer Yudkowsky ⏹.. @ESYudkowsky

175K Followers 89 Following The original AI alignment person. Missing punctuation at the end of a sentence means it's humor. If you're not sure, it's also very likely humor.

Eric Jang @ericjang11

69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0p

OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6LgzPA

OpenAI @OpenAI

3.4M Followers 0 Following OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6LgzPA

Andrew Dai @andrewdai99

191 Followers 253 Following Researcher @Aleph__Alpha, a Kerryman, LLMs x open-endedness; prev @tcddublin, @FormulaTrinity Autonomous | 🇮🇪

Cofounder @NousResearch, prev @StabilityAI
Github: https://t.co/LZwHTUFwPq
HuggingFace: https://t.co/sN2FFU8PVE
Support me on Github Sponsors

Teknium (e/λ) @Teknium1

29K Followers 3K Following Cofounder @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE Support me on Github Sponsors

Previously doing maths at @WeizmannScience, currently AI researcher at @MSFTResearch.

Pretty good at loading a dishwasher.

Ronen Eldan @EldanRonen

2K Followers 115 Following Previously doing maths at @WeizmannScience, currently AI researcher at @MSFTResearch. Pretty good at loading a dishwasher.

Yue Wu @yw_yuewu

122 Followers 129 Following PhD Student @mldcmu

Sebastian Borgeaud @borgeaud_s

994 Followers 259 Following Research Engineer at DeepMind with a focus on Large Language Models and large scale Deep Learning

Qingfei You @qingfeiyou

30 Followers 199 Following

The world can be ugly and cruel to the most innocent. Consider donating to help children suffering from one of the worst things: https://t.co/PYZWj8o4OW

Nathan Cooper @ncooper57

721 Followers 650 Following The world can be ugly and cruel to the most innocent. Consider donating to help children suffering from one of the worst things: https://t.co/PYZWj8o4OW

Alon Albalak @AlbalakAlon

885 Followers 464 Following CS PhD candidate at @ucsbNLP. Research: Data-centric AI, Efficiency in ML, NLP.

Assistant Professor at UC Berkeley, Department of Statistics and EECS. Researcher working on foundations of generative AI.

Song Mei @Song__Mei

1K Followers 550 Following Assistant Professor at UC Berkeley, Department of Statistics and EECS. Researcher working on foundations of generative AI.

Assistant professor @StanfordEng ChemE and @StanfordData, Innovation Investigator @arcinstitute | Machine learning for biology

Brian Hie @BrianHie

5K Followers 402 Following Assistant professor @StanfordEng ChemE and @StanfordData, Innovation Investigator @arcinstitute | Machine learning for biology

Aubrey de Grey @aubreydegrey

60K Followers 11 Following I'm spearheading the global crusade to defeat aging. President and CSO of https://t.co/QxFW8fuCt2

Shunyu Yao @ShunyuYao12

7K Followers 857 Following Language agents (ReAct, Reflexion, Tree of Thoughts) for digital automation (WebShop, SWE-bench, SWE-agent)

Aleksandr Sviridov @_sviridov_

993 Followers 106 Following Longevity activist and software engineering lead. CTO at @GeroSense. Co-Founder of @SayForeverOrg.

Professor at UMD. AI security & privacy, algorithmic bias, foundations of ML.
Follow me for commentary on state-of-the-art AI.

Tom Goldstein @tomgoldsteincs

23K Followers 2K Following Professor at UMD. AI security & privacy, algorithmic bias, foundations of ML. Follow me for commentary on state-of-the-art AI.

Building agents that discover knowledge and get better at doing so over time. Staff research scientist @GoogleDeepMind

Tom Zahavy @TZahavy

2K Followers 318 Following Building agents that discover knowledge and get better at doing so over time. Staff research scientist @GoogleDeepMind

Founder of @HigherOrderComp
Building the massively parallel future of computing
Reaching AGI to cure all diseases and suffering is all that matters

Taelin @VictorTaelin

17K Followers 903 Following Founder of @HigherOrderComp Building the massively parallel future of computing Reaching AGI to cure all diseases and suffering is all that matters

Ambrosia Path - Longe.. @ambrosiapath

227 Followers 163 Following Reporting the latest longevity news and interviews Subscribe to our weekly newsletter to get the latest longevity news in your inbox. Join 2,354+ subscribers

We are a Tokyo-based R&D company on a quest to create a new kind of foundational AI model based on nature-inspired intelligence. https://t.co/1q07mb3TzE

Sakana AI @SakanaAILabs

19K Followers 0 Following We are a Tokyo-based R&D company on a quest to create a new kind of foundational AI model based on nature-inspired intelligence. https://t.co/1q07mb3TzE

Robin Rombach @robrombach

6K Followers 397 Following Generative enthusiast and long-term PhD Student @LMU_Muenchen. Author of VQGAN, Latent Diffusion, Stable Diffusion.

Tianlin @linylinx

6K Followers 579 Following ML Tech Lead @sourceful ⏩: @illumina AI Lab @qualcomm AI, PhD @LSEStatistics 📜 generative models 🤪 joking not joking

Allan Jie @allanjienlp

58 Followers 327 Following

Member of Technical Staff at @inflectionAI. Former Research Scientist @Google. In a previous life, I did String Theory. Language models and Conversational AI.

alewkowycz @alewkowycz

3K Followers 173 Following Member of Technical Staff at @inflectionAI. Former Research Scientist @Google. In a previous life, I did String Theory. Language models and Conversational AI.

Jacob Austin @jacobaustin132

3K Followers 797 Following @Google @DeepMind researcher. AI for math and science. Coding. Gemini. I also play piano. NYC. Opinions my own

Eric Zelikman @ericzelikman

5K Followers 1K Following studying why @xAI // was phd-ing @stanford

Robert Yang @GuangyuRobert

3K Followers 185 Following Co-founder, CEO at @Altera_AL, Computational Neuroscientist, former Assistant Professor @mitbrainandcog & @MITEECS

phd @MIT_CSAIL, llm for math and code. intern @MetaAI and analyst @pillar_vc.

prev @BigCodeProject, @MITIBMLab, @JaneStreetGroup, @PonyAI_tech

Alex Gu @minimario1729

2K Followers 2K Following phd @MIT_CSAIL, llm for math and code. intern @MetaAI and analyst @pillar_vc. prev @BigCodeProject, @MITIBMLab, @JaneStreetGroup, @PonyAI_tech

Patrick Collison @patrickc

457K Followers 29 Following @Stripe CEO, @ArcInstitute cofounder.

Arc Institute @arcinstitute

22K Followers 24 Following A new scientific institution for curiosity-driven biomedical science and technology.

Head of Data & Evals @Cohere | prev: Research Eng Lead @MetaAI | founded @Meta’s A/B testing platform and the AI annotation platform | @ICepfl alumnus

Wojciech Galuba @wgaluba

490 Followers 1K Following Head of Data & Evals @Cohere | prev: Research Eng Lead @MetaAI | founded @Meta’s A/B testing platform and the AI annotation platform | @ICepfl alumnus

Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiq

Cognition @cognition_labs

123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiq

it's pronounced locky. lachygroom@gmail.com. π.com

Lachy Groom @lachygroom

23K Followers 774 Following it's pronounced locky. [email protected]. π.com

Ramon Dario Iglesias .. @RamonDarioIT

583 Followers 609 Following Founder of @ClementineInc

Albert Gu @_albertgu

9K Followers 90 Following assistant prof @mldcmu. chief scientist @cartesia_ai. leading the ssm revolution.

Jenny Zhang @jennyzhangzt

752 Followers 430 Following PhD student @UBC • Previously at @ASTARsg • Undergrad @imperialcollege • Reinforcement Learning, Open-endedness, AI-GAs

Postdoctoral Research Fellow @UBC_CS in open-endedness, generative models, and deep RL. Prev: PhD @UniofOxford, Research Intern @Waymo, @MSFTResearch!

Cong Lu @cong_ml

635 Followers 864 Following Postdoctoral Research Fellow @UBC_CS in open-endedness, generative models, and deep RL. Prev: PhD @UniofOxford, Research Intern @Waymo, @MSFTResearch!

Lior⚡ @AlphaSignalAI

84K Followers 895 Following Covering the latest in AI R&D • ML Engineer • Ex-Mila researcher • MIT Lecturer • Building AlphaSignal, a technical newsletter read by 180,000+ ML experts.

Rumen Dangovski @dangovski

205 Followers 968 Following Applied Research @GoogleDeepMind

Morph @morph_labs

4K Followers 1 Following Building the future of software, for everyone.

Yunha Hwang @Micro_Yunha

1K Followers 1K Following Building genomic intelligence @tatta_bio

Tatta Bio @tatta_bio

286 Followers 23 Following Building genomic intelligence

Alice Ting @aliceyting

14K Followers 179 Following Molecular designer

ceo/cofounder @ https://t.co/bDd3J4Lmzf (we're hiring!) #GenerativeAI

recurrent rabbit hole victim. swims in data lakes & pools. nothing great is easy.

nathan lile @NathanThinks

2K Followers 883 Following ceo/cofounder @ https://t.co/bDd3J4Lmzf (we're hiring!) #GenerativeAI recurrent rabbit hole victim. swims in data lakes & pools. nothing great is easy.

PhD student at @Stanford Genetics in @LarsMSteinmetz lab | Prev @broadinstitute | Explaining cool biotech to the world here and @ 60_SecondScience on TikTok

Julia Bauman @JuliaBauman2

8K Followers 456 Following PhD student at @Stanford Genetics in @LarsMSteinmetz lab | Prev @broadinstitute | Explaining cool biotech to the world here and @ 60_SecondScience on TikTok

Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.

Yuandong Tian @tydsh

16K Followers 801 Following Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.

Anton Bakhtin @ SF @anton_bakhtin

2K Followers 127 Following MTS at @AnthropicAI, Ex @MetaAI, Ex @Google Three logicians walk into a bar ...

Genomics, Machine Learning, Statistics, Big Data and Football (Soccer, GGMU).

Post: @anshulkundaje, Threads: anshulkundaje

Anshul Kundaje (anshu.. @anshulkundaje

22K Followers 2K Following Genomics, Machine Learning, Statistics, Big Data and Football (Soccer, GGMU). Post: @anshulkundaje, Threads: anshulkundaje

@Stanford @StanfordAILab, Staff Scientist @togethercompute, prev @MSFTResearch. DL, numerics and systems.

I like to architect big neural nets that run fast.

Michael Poli @MichaelPoli6

2K Followers 279 Following @Stanford @StanfordAILab, Staff Scientist @togethercompute, prev @MSFTResearch. DL, numerics and systems. I like to architect big neural nets that run fast.

Eric Nguyen @exnx

2K Followers 324 Following PhD in BioEngineering & AI @stanford @HazyResearch @StanfordAILab @arcinstitute

Assistant Prof. CS,LMP @UofT; CIFAR AI Chair @VectorInst; Chief AI Scientist, @UHN; former PHD, CS @Stanford; opinions my own. #AI #healthcare #combio

Bo Wang @BoWang87

8K Followers 2K Following Assistant Prof. CS,LMP @UofT; CIFAR AI Chair @VectorInst; Chief AI Scientist, @UHN; former PHD, CS @Stanford; opinions my own. #AI #healthcare #combio

Head of Neuroimaging @StabilityAI, leading the @MedARC_AI Neuroimaging & AI Lab | Collaborating w/ @PrincetonNeuro @ptoncompmemlab

Paul Scotti @humanscotti

1K Followers 937 Following Head of Neuroimaging @StabilityAI, leading the @MedARC_AI Neuroimaging & AI Lab | Collaborating w/ @PrincetonNeuro @ptoncompmemlab

Nora Belrose @norabelrose

2 days ago

Cool stuff, we found a similar result back in December arxiv.org/abs/2312.01037. Kind of upset they didn't cite/link to us tbh.

Anthropic @AnthropicAI

5 days ago

New Anthropic research: we find that probing, a simple interpretability technique, can detect when backdoored "sleeper agent" models are about to behave dangerously, after they pretend to be safe in training. Check out our first alignment blog post here: anthropic.com/research/probe…

30 164 956 250K 435

Download Image

6 10 172 45K 88

Albert Jiang @AlbertQJiang

3 days ago

Happening now!

Albert Jiang @AlbertQJiang

5 days ago

events.nationalacademies.org/42507_04-2024_… Giving a talk on evaluating large language models for mathematics through interactions (work co-lead with @katie_m_collins) on Thursday. In the same session is the one and only @ChrSzegedy!

0 3 35 5K 6

0 0 15 1K 2

Kevin Rojas @KevRojas1499

3 days ago

@alec_helbling This can be understood very nicely from the sampling perspective! The score can be written out as an expectation of the conditional distribution. When time is large this conditioning doesn't change the original distribution!

0 0 9 282 0

Download Image

Yao Fu @Francis_YAO_

4 days ago

Cannot agree more. My intuition is that FFN is for storing knowledge (this is why most knowledge editing are on FFNs) and Attention is for implementing algorithms (this is why most mechanistic interpretability, e.g., induction heads, are on Attn). Additionally, it seems that…

Yi Tay @YiTayML

4 days ago

not true, especially for language. if you trained a large & deep MLP language model with no self-attention, no matter how much data you'll feed it you'll still be lacking behind a transformer (with much less data). will it get to the same point? i don't think so. your tokens…

32 63 654 194K 299

0 27 141 21K 97

Rishabh Agarwal @agarwl_

6 days ago

@Dahoas1 @stefan_fee This is likely one reason why model-generated rationales can be better than human-written rationales.

0 0 1 52 0

Maksym Andriushchenko 🇺🇦 @maksym_andr

5 days ago

Super excited to share that I successfully defended my PhD thesis "Understanding Generalization and Robustness in Modern Deep Learning" today 👨‍🎓 A huge thanks to the thesis examiners @SebastienBubeck, @zicokolter, and @KrzakalaF, jury president Rachid Guerraoui, and, of course,…

61 12 428 26K 104

Download Image

Andrej Karpathy @karpathy

5 days ago

33 79 1K 132K 360

Yue Wu @yw_yuewu

6 days ago

Introducing AgentKit. AgentKit offers a unified framework for explicitly constructing a complex human "thought process" from simple natural language prompts. The user puts together chains of nodes, like stacking LEGO pieces. (1/3) github.com/Holmeswww/Agen…

1 10 21 3K 13

Download Image

Guan-Horng Liu @guanhorng_liu

5 days ago

I2SB (i2sb.github.io) + (nearly optiaml) flow matching yields 1000x speed-up compared to standard denoising diffusion for generating highly accurate transition states 🧑‍🔬⚗️🧪 Check out our new preprint👇 arxiv.org/abs/2404.13430 Very fun collab w/ @chenru_duan @YuanqiD!

Chenru Duan @chenru_duan

5 days ago

New paper alert: React-OT: Optimal Transport for Generating Transition State in Chemical Reactions (arxiv.org/abs/2404.13430). React-OT formulates TS search as a transport problem, approaching chemical accuracy while taking only 0.5 seconds in inference on a single GPU. #compchem

7 24 119 21K 50

Download Image

0 5 30 3K 3

Eric Zelikman @ericzelikman

5 days ago

Among the coolest projects I helped with at Stanford. The key idea is very simple: a pragmatic response in one context is something you'd rarely say in other contexts. This basic principle lets LMs teach themselves to generally follow constitutions but has many cool implications

Philipp Fränken @jphilippfranken

6 days ago

Constitutional AI showed LMs can learn to follow constitutions by labeling their own outputs. But why can't we just tell a base model the principles of desired behavior and rely on it to act appropriately? Introducing SAMI: Self-Supervised Alignment with Mutual Information!

3 31 145 59K 139

Download Gif

1 7 64 12K 35

Jonathan Balloch @JonathanBalloch

6 days ago

updated github.com/balloch/awesom… finally. let me know if I am missing anything obvious/exciting

0 0 7 113 0

Shuyan Zhou @shuyanzhxyc

6 days ago

Join us on the fun LLM agent workshop at CMU!

Frank Xu @frankxu2004

6 days ago

On May 2-3, we're going to have a big event in Pittsburgh about LLM Agents. We have invited talks from great speakers inside and outside CMU, student research presentations and posters, tutorials and discussions! Come join us at CMU campus, and register at cmu-agent-workshop.github.io

1 17 108 37K 27

0 1 19 3K 3

Aviral Kumar @aviral_kumar2

6 days ago

Many LLM fine-tuning methods. Unclear what you should use & why? In our new paper, we did an extensive study of on-policy RL, supervised & offline contrastive methods (DPO, IPO) to answer this... 🧵⬇️ On-policy > offline, mode-seeking > mode-covering understanding-rlhf.github.io

3 65 269 32K 246

Download Image

Dieuwke Hupkes @_dieuwke_

6 days ago

Can LLMs acquire meaning/semantics from just text? Some think it is a priori not possibile, I personally think it's a super interesting philosophical question which needs further investigation! Thoughts? arxiv.org/abs/2404.12145

Xenia Ohmer @xenia_ohmer

6 days ago

📜 New preprint! Equipped with our multisense consistency method, we dive deep into an exploration of the semantic understanding of #LLMs. @eliabruni & @_dieuwke_ @metaai #NLProc [1/7]🧵 arxiv.org/abs/2404.12145

2 4 31 7K 17

Download Image

3 5 52 5K 22

Frank Xu @frankxu2004

6 days ago

1 17 108 37K 27

Pengfei Liu @stefan_fee

6 days ago

Crazy finding!!!!! -> ” Without introducing any additional data or advanced training techniques, and merely by reformatting the response, LLaMA-2-13B’s mathematical reasoning ability on GSM8K can be improved from 46.77% to 56.63% in accuracy"

Run-Ze Fan @Vfrz525_

a week ago

Been diving into some papers on data synthesis lately, especially those about enhancing math reasoning. Most of them seem to miss our work on 'Reformatted Alignment' (arxiv.org/abs/2402.12219)—another approach to boosting data for math reasoning.

1 7 41 27K 26

3 25 131 23K 82

Download Image

Tomek Korbak @tomekkorbak

6 days ago

David Krueger @DavidSKrueger

5 months ago

I was very impressed with @tomekkorbak's thesis! Some really nice insights into LLM alignment: 1) RL is not the way --> distribution matching let's us target constraints like "generate as many of these as of those" 2) fine-tuning is not the way --> PHF aligns during pre-training

0 6 52 41K 43

7 39 263 35K 202

Download Image

Zhangir Azerbayev @zhangir_azerbay

6 days ago

@moinnadeem I used to study mathematics. Sometimes I would not know how to prove something, so I'd stare at the wall and think until I found a proof. I don't think the visual input of the wall was essential to this process.

1 0 27 743 0

Bharath Ramsundar @rbhar90

a week ago

A lot of LLM benchmarks don't properly test out of distribution behavior. As uses of LLMs for science increase, we need benchmarks that actually check generalization beyond training data. My experience so far that LLMs are pretty weak outside training distribution, but can have…

9 7 80 21K 32

Banghua Zhu @BanghuaZ

7 days ago

Very excited about the release of arena hard, the main benchmark we looked at when selecting the checkpoints for Starling model. It focuses on a subset of very hard prompts from chatbot arena.

lmsys.org @lmsysorg

7 days ago

Introducing Arena-Hard – a pipeline to build our next generation benchmarks with live Arena data. Highlights: - Significantly better separability than MT-bench (22.6% -> 87.4%) - Highest agreement to Chatbot Arena ranking (89.1%) - Fast & cheap to run ($25) - Frequent update…