Stella Biderman @BlancheMinerva

Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/her stellabiderman.com Joined May 2019

Tweets

12K
Followers

15K
Following

749
Likes

11K

Edward Raff @EdwardRaffML

4 days ago

I’m presenting @aistats_conf today poster 133, check out how we can mix neuro-symbolic methods with state space models for malware, and the fu things we learn! @rea1mma @BlancheMinerva @oatesbag @BoozAllen @umbccsee arxiv.org/abs/2403.17978

Stella Biderman @BlancheMinerva

a week ago

TIL: exactly one question in ARC has five choices. The rest have four.

6 0 61 12K 7

Download Image

Hailey Schoelkopf @haileysch__

2 weeks ago

@tamaybes but current models don’t allocate parameters to rotary embs! this means the Chinchilla D=20*N is skewed already for the actual param counts of most models, even if it held across datasets! If we disregarded the pos. encoding params the coefficients would change

2 2 56 3K 9

Hailey Schoelkopf @haileysch__

2 weeks ago

@tamaybes a super-fun arcane historical detail: Gopher (and by extension Chinchilla) use Transformer-XL style position encodings. This means they spend 20B params (Gopher) and 5B params (Chinchilla) on just rel. position encoding!

1 6 79 7K 18

near @nearcyan

2 weeks ago

the best TTS available from 2020-2021 was done by a single unemployed guy who supported every single my little pony voice and literally nothing else the best TTS from 2022-2024 was also done by a single (different) unemployed guy w a custom-built 6x3090 rig in his basement (1/2)

25 43 851 185K 418

Edward Raff @EdwardRaffML

2 weeks ago

#HowGPTWorks, book w/ @BlancheMinerva @drewfarris is already up to # 3 on the best sellers! We are removing all the mystery behind WTF a language model is, how they work, but in an accessible way for people without any AI/ML training. @ManningBooks manning.com/books/how-gpt-…

1 4 16 2K 13

Download Image

Stella Biderman @BlancheMinerva

2 weeks ago

This seems clearly correct to me and is something I've personally experienced. Probably the easiest way to see this is true is to realize that people don't know the logical closure of their beliefs, but given time and a pencil can work many things in said logical closure out.

Zhangir Azerbayev @zhangir_azerbay

2 weeks ago

6 2 35 13K 8

8 1 60 10K 14

Stella Biderman @BlancheMinerva

2 weeks ago

100%! People are really bad at understanding the logical close of their beliefs. (Proof: if they weren't, we would know if ZFC was consistent!)

Pete Mandik @petemandik

2 weeks ago

100%! People are really bad at understanding the logical close of their beliefs. (Proof: if they weren't, we would know if ZFC was consistent!)

2 0 21 5K 3

0 0 4 3K 2

Stella Biderman @BlancheMinerva

2 weeks ago

Really amazing work by the @huggingface team! Infrastructure work, including dataset work, evaluations work, and building libraries, is the single highest-leverage thing you can do in AI. This will provide dividends for the broader AI community for years to come.

Philipp Schmid @_philschmid

2 weeks ago

14 86 392 107K 167

Download Image

2 25 199 28K 40

EleutherAI @AiEleuther

2 weeks ago

An essential blocker to training LLMs on public domain books is not knowing which books are in the public domain. We're working on it, but it's slow and costly... if you're interested in providing support reach out!

Daniel Bullock @Is_Dan_Bull

2 weeks ago

1 0 0 7K 0

2 10 53 6K 8

Stella Biderman @BlancheMinerva

2 weeks ago

SSMs + long sequence analysis + malware detection with LLMs is all the buzzwords you need to decide to check our paper out, right? arxiv.org/abs/2403.17978

Edward Raff @EdwardRaffML

2 weeks ago

SSMs + long sequence analysis + malware detection with LLMs is all the buzzwords you need to decide to check our paper out, right? arxiv.org/abs/2403.17978

0 0 6 7K 4

0 2 32 7K 12

Stella Biderman @BlancheMinerva

2 weeks ago

Training data transparency is an unambiguous win for society, but all the incentives are against companies doing it right now. We need to fix this as soon as possible.

Yacine Jernite @YJernite

2 weeks ago

Training data transparency is an unambiguous win for society, but all the incentives are against companies doing it right now. We need to fix this as soon as possible.

1 4 53 18K 1

3 10 79 11K 11

EleutherAI @AiEleuther

3 weeks ago

We are excited to see torchtune, a newly announced PyTorch-native finetuning library, integrate with our LM Evaluation Harness library for standardized, reproducible evaluations! Read more here: Blog: pytorch.org/blog/torchtune… Thread:

Kartikay Khandelwal @kakemeister

3 weeks ago

1 3 22 9K 3

0 7 55 6K 14

Quentin Anthony @QuentinAnthon15

3 weeks ago

Zyphra is pleased to announce Zamba-7B: - 7B Mamba/Attention hybrid - Competitive with Mistral-7B and Gemma-7B on only 1T fully open training tokens - Outperforms Llama-2 7B and OLMo-7B - All checkpoints across training to be released (Apache 2.0) - Achieved by 7 people, on 128…

21 86 430 110K 236

Download Image

Apoorv Khandelwal @apoorvkh

3 weeks ago

Calling all academic AI researchers! 🚨 We are conducting a survey on compute resources. We want to help the community better understand our capabilities+needs. We hope that this will help us all advocate for the resources we need! Please contribute at: forms.gle/3hEie4hj999fiS…

0 31 49 9K 20

Aran Komatsuzaki @arankomatsuzaki

3 weeks ago

🚀 Introducing Pile-T5! 🔗 We (EleutherAI) are thrilled to open-source our latest T5 model trained on 2T tokens from the Pile using the Llama tokenizer. ✨ Featuring intermediate checkpoints and a significant boost in benchmark performance. Work done by @lintangsutawika, me…

14 110 552 124K 219

Download Image

Stella Biderman @BlancheMinerva

3 weeks ago

I've been brain-dumping what I know about how LLMs work for several months now into an accessible general audience book! Check out the pre-release at the link.

Edward Raff @EdwardRaffML

3 weeks ago

I've been brain-dumping what I know about how LLMs work for several months now into an accessible general audience book! Check out the pre-release at the link.

1 4 37 17K 31

4 12 144 16K 87

(((ل()(ل() 'yoav))).. @yoavgo

46K Followers 2K Following

Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

Delip Rao e/σ @deliprao

46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

near @nearcyan

46K Followers 882 Following https://t.co/IdaJwZJCXm partner @ https://t.co/9g1MIgjiqc dms open

SynthLabs @synth_labs

12K Followers 43 Following AI Aligned with Your Vision. We’re doing cutting edge research for transparent, auditable AI alignment.

Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics.

Same content in the Sky, Threads, & the Prehistoric Elephant

MMitchell @mmitchell_ai

80K Followers 1K Following Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Same content in the Sky, Threads, & the Prehistoric Elephant

Eric Jang @ericjang11

69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0p

Miles Brundage @Miles_Brundage

43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.

AI & Climate Lead @HuggingFace, Board Member of @WiMLworkshop, Founding Member of @ClimateChangeAI. @TEDTalks speaker.
She/her/Dr/ 🦋

Sasha Luccioni, PhD �.. @SashaMTL

19K Followers 4K Following AI & Climate Lead @HuggingFace, Board Member of @WiMLworkshop, Founding Member of @ClimateChangeAI. @TEDTalks speaker. She/her/Dr/ 🦋

Horace He @cHHillee

24K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemale

clem 🤗 @ClementDelangue

91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders

Professor, @plfmse, @IllinoisCS! Proof Automation. @SigplanM & CCF Founder. Israeli-American for peace, equality, & justice. They/היא, ND, bi. די לכיבוש

Talia Ringer 🟣 �.. @TaliaRinger

26K Followers 6K Following Professor, @plfmse, @IllinoisCS! Proof Automation. @SigplanM & CCF Founder. Israeli-American for peace, equality, & justice. They/היא, ND, bi. די לכיבוש

Rivers Have Wings @RiversHaveWings

31K Followers 226 Following AI/generative artist. Writes her own code. Absolute power is a door into dreaming.

merve @mervenoyann

56K Followers 4K Following open-sourceress at @huggingface 🧙🏻‍♀️ proud mediterrenean 🍋 I do TL;DR on ML papers

Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechnique

Julien Chaumond @julien_c

47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechnique

Thomas Wolf @Thom_Wolf

68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-science

Cofounded and lead @PyTorch at Meta.
Also dabble in robotics at NYU.

AI is delicious when it is accessible and open-source.

Soumith Chintala @soumithchintala

187K Followers 887 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.

Cofounded & running @ml_collective.
Host of Deep Learning Classics & Trends.
Research at Google DeepMind.
DEI/DIA Chair of ICLR & NeurIPS.
Writing https://t.co/IbycyGfnDR

Rosanne Liu @savvyRL

33K Followers 969 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDR

@AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkaTu Past: @openai, @business @theregister. Neural nets, distributed systems, weird futures

Jack Clark @jackclarkSF

68K Followers 5K Following @AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkaTu Past: @openai, @business @theregister. Neural nets, distributed systems, weird futures

Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

Percy Liang @percyliang

50K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

PhD at 19 |
Founder and CEO at @MedARC_AI |
Research Director at @StabilityAI |
@kaggle Notebooks GM |
Biomed. engineer @ 14 |
TEDx talk➡https://t.co/xPxwKTq6Qb

Tanishq Mathew Abraha.. @iScienceLuvr

Hari Nair @harisnair

302 Followers 1K Following Enabling Digital Transformation within Enterprises l Toastmaster. Views are personal.

Heegon Jin @jineussw

24 Followers 577 Following Exploring the frontiers of AI 🤖 Research Scientist @ NCSOFT working on Machine Translation and NLP | #NLProc #AI

Ashwin Jayaprakash @ashwinjay

206 Followers 4K Following Falling into the future at light speed. (Any opinions expressed are my own)

Cam Howe (e/acc) @camhowe1729

4 Followers 260 Following full-time techbro, part-time anon/undergrad. love explaining tech stuff.

Ahmed Mujtaba @ahmedmujii

24 Followers 256 Following

okeblay @okeblay

8 Followers 96 Following

Nitish ⚡️ @nitishmutha

3K Followers 333 Following Co-founder and CTO @GenieAI - Building the world’s best AI Legal Assistant. @UCL alum.

Sujan Kumar @kumarsujan

43 Followers 444 Following Sr. ML Scientist at AWS AI Labs @AmazonScience @awscloud

Austin Kwoun @kwoun_austin

0 Followers 10 Following

Taku @takuonline

18 Followers 402 Following Senior Data scientist at Shoprite Holdings

@yaelmendez @yaelmendez

807 Followers 5K Following 🕊️ Omnimedia @aicommandprompt

Daniel @dabronofstock

13 Followers 78 Following

Fizzarolli 🏳️‍.. @fizzarolliAI

0 Followers 13 Following hobbyist ai dev #freepalestine #acab

little whirl in her crackpot era. reticulating sage splines. cream of stonsciousness. mad hatter. show me my opponent “in my opinion:”

steady rockin all nig.. @sonikudzu

2K Followers 862 Following little whirl in her crackpot era. reticulating sage splines. cream of stonsciousness. mad hatter. show me my opponent “in my opinion:”

dooniek @doonielk

33 Followers 369 Following

Software performance, Machine Learning, Prob/Stats, finance/econ, electronic music, lean/small tech businesses. Opinions are most certainly definitely my own.

dwrodri @__dwrodri

201 Followers 1K Following Software performance, Machine Learning, Prob/Stats, finance/econ, electronic music, lean/small tech businesses. Opinions are most certainly definitely my own.

Electronicsseeker @libertarian108

39 Followers 3K Following

Jim Auwerda @JimAuwerda

93 Followers 1K Following

Julien Borderieux @J_Borderieux

5K Followers 6K Following

Palestine journalist, media director at ✈️ Rafah border crossing( and there hearts will be reassured in the remembrance of God)

Mohammed _h_mno3 @Slalucynkdia

531 Followers 23 Following Palestine journalist, media director at ✈️ Rafah border crossing( and there hearts will be reassured in the remembrance of God)

Jacob Somer @jacob_somer_

666 Followers 4K Following AI Enthusiast & Software Engineer 💻 Building intelligent systems that make a difference.

Andrei Savu @andreisavu

7K Followers 6K Following Actions Speak Louder Than Words | Awareness Above Wishful Thinking

Gmail Accounts 🇺�.. @accounts_g1158

67 Followers 688 Following #Bitcoin #USDT #Ethereum #Payoneer #Direct_Bank_Transfer #PayPal

Shadab Choudhury @ShcChy

105 Followers 409 Following MS CS @UMBC | Accessibility and Multimodal DL | every day I edge closer to weebpfp anonpoasting

OutOfContext Bharat(i.. @uselesslyyours

117 Followers 435 Following just another sports fan

Kristina Terech @kristina_terex

5 Followers 37 Following Software writer at TechRadar. She/Her. Views my own.

Brian Roach @itsbrex

653 Followers 2K Following Technical Product Manager | MBA, AI+ML #OpenToWork | 🪩 @UCSD, @USC, @UCBerkeley

Tiezhen WANG @Xianbao_QIAN

928 Followers 369 Following Engineer at HuggingFace, ex-Googler on TFLite / micro. Ideas are my own.

Wilf Rosenbaum @WilfRosenbaum

104 Followers 962 Following

Srmouse @mousebars

269 Followers 5K Following

CIO of Phoenix Group| Chairman of Cypher Capital | CIO of https://t.co/fR7qMZOHi1 | Board Member of TON Foundation | Ex-Global Head of M&A/Labs- Binance

Bill Qian @billqian_uae

1K Followers 4K Following CIO of Phoenix Group| Chairman of Cypher Capital | CIO of https://t.co/fR7qMZOHi1 | Board Member of TON Foundation | Ex-Global Head of M&A/Labs- Binance

Can Yaman @CanYaman_21

13 Followers 578 Following Can Yaman is a Turkish actor who was born on November 8, 1989 in Istanbul, Turkey. He is 6'2". He won a Golden Butterfly Award for Best Actor in a Romantic.

Make @LearnAnything_

Learn in public: https://t.co/GbFvuErkYn

macOS course: https://t.co/JdbJWru6zG

https://t.co/94R8ER7K2h
https://t.co/ROkqhyhpEK

Nikita @nikitavoloboev

4K Followers 7K Following Make @LearnAnything_ Learn in public: https://t.co/GbFvuErkYn macOS course: https://t.co/JdbJWru6zG https://t.co/94R8ER7K2h https://t.co/ROkqhyhpEK

shubhpa𝐭ni.eth @PatniShubh

805 Followers 3K Following buidling @resmeai | 5x hackathon 🏆 and top writer on medium

Hamzé @Hamzeml

510 Followers 5K Following A Humanist Technologist, AI optimist, CTO @gowelcomeplace, #inclusive_economy #AI #machinelearning #tech4good #edtech

Wes George @vvxgeorge

129 Followers 539 Following Technologist, Skeptic, Human. solutions @recursal_AI. I fight for the users.

Nextt @_Nextt_

20 Followers 11 Following

Shashank Shekhar @sshkhr16

2K Followers 1K Following Scale Maximalist. Opinions my own, ofc Previously: AI Research @MetaAI @uofg @vectorinst @_NextAI @iiscbangalore

Dean Clark @DeanCla88922559

148 Followers 1K Following Disabled part-time student, registered for artificial kidney trials.

Eman @emteodosio

16 Followers 2K Following Random thoughts 🇵🇭 @MapuaUniv @AmznFulfillment

Anjun Hu @anjunhu

42 Followers 274 Following First-year DPhil student @aims_oxford at @UniofOxford

Christian Miranda @cmoryah

212 Followers 454 Following

Once a mathematician, twice an immigrant. PhD student @ISTAustria building efficient, trustworthy ML. Formerly: software engineer @Google, teacher, wall street.

Jen Iofinova @oohaijen

238 Followers 699 Following Once a mathematician, twice an immigrant. PhD student @ISTAustria building efficient, trustworthy ML. Formerly: software engineer @Google, teacher, wall street.

nava zarkhah @NavaZarkhah

125 Followers 2K Following Mechanical Girl👩‍💻👩‍🏫👩‍🔧 💡📚🔑🗝🔨⛏⚒🛠📍📏📐🖇

A. J. Kübler 👨�.. @ajkuebler

62 Followers 2K Following

Visiting Ph.D. student at Cornell University. Ph.D. candidate at CUHK. Working on bandits and reinforcement learning theory.

Zhiyong Wang @Zhiyong16403503

437 Followers 3K Following Visiting Ph.D. student at Cornell University. Ph.D. candidate at CUHK. Working on bandits and reinforcement learning theory.

Syner @renchengyuan

230 Followers 3K Following

AttentionBot @XAttentionBot

8 Followers 89 Following Attention Human! You Need to Upvote my Posts!

Jprwg @jprwg

513 Followers 3K Following No idea if that's true, but it is at least faintly entertaining.

https://t.co/Tes5ZFnfVs • https://t.co/NuhiRgwvTP https://t.co/3H4X5XEq21 •https://t.co/oOCFfDThZ2 • https://t.co/wMpswOH3Xa • https://t.co/cW7uHNvbfy • https://t.co/VYahFk94rN •https://t.co/Gik8R81APV• https://t.co/Uvx07c8pI2 • https://t.co/yp6o2BXYZH•https://t.co/0V7mofFuIl •📈

Hpremium @web3nam3

805 Followers 3K Following https://t.co/Tes5ZFnfVs • https://t.co/NuhiRgwvTP https://t.co/3H4X5XEq21 •https://t.co/oOCFfDThZ2 • https://t.co/wMpswOH3Xa • https://t.co/cW7uHNvbfy • https://t.co/VYahFk94rN •https://t.co/Gik8R81APV• https://t.co/Uvx07c8pI2 • https://t.co/yp6o2BXYZH•https://t.co/0V7mofFuIl •📈

Emad @EMostaque

221K Followers 10 Following #decentralizeAI

Aran Komatsuzaki @arankomatsuzaki

95K Followers 78 Following @TeraflopAI

(((ل()(ل() 'yoav))).. @yoavgo

46K Followers 2K Following

Delip Rao e/σ @deliprao

46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as lb@sigmoid.social

Lucas Beyer (bl16) @giffmana

56K Followers 447 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]

SynthLabs @synth_labs

12K Followers 43 Following AI Aligned with Your Vision. We’re doing cutting edge research for transparent, auditable AI alignment.

MMitchell @mmitchell_ai

80K Followers 1K Following Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Same content in the Sky, Threads, & the Prehistoric Elephant

Eric Jang @ericjang11

69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0p

Miles Brundage @Miles_Brundage

43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.

Sasha Luccioni, PhD �.. @SashaMTL

19K Followers 4K Following AI & Climate Lead @HuggingFace, Board Member of @WiMLworkshop, Founding Member of @ClimateChangeAI. @TEDTalks speaker. She/her/Dr/ 🦋

Horace He @cHHillee

24K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemale

We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.

Anthropic @AnthropicAI

265K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.

clem 🤗 @ClementDelangue

91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders

Talia Ringer 🟣 �.. @TaliaRinger

26K Followers 6K Following Professor, @plfmse, @IllinoisCS! Proof Automation. @SigplanM & CCF Founder. Israeli-American for peace, equality, & justice. They/היא, ND, bi. די לכיבוש

Rivers Have Wings @RiversHaveWings

31K Followers 226 Following AI/generative artist. Writes her own code. Absolute power is a door into dreaming.

Julien Chaumond @julien_c

47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechnique

Thomas Wolf @Thom_Wolf

68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-science

Soumith Chintala @soumithchintala

187K Followers 887 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.

Hugging Face @huggingface

347K Followers 188 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhate

Rosanne Liu @savvyRL

33K Followers 969 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDR

pChal @pChalTV

80K Followers 578 Following I do really difficult Pokemon playthroughs | he/him

Co-Founder @myhackerhouse cyber security assurance & hacker training ~ ISBN9781119561453 ~ a book on professional hacking. Offensive Lua project.

hackerfantastic.x @hackerfantastic

103K Followers 4K Following Co-Founder @myhackerhouse cyber security assurance & hacker training ~ ISBN9781119561453 ~ a book on professional hacking. Offensive Lua project.

Kubernetes SIG Security co-chair, container escape artist, goose in the mainframe. They/them. Legacy verified. Stay punk 🏴

Ian Coldwater 📦�.. @IanColdwater

106K Followers 1K Following Kubernetes SIG Security co-chair, container escape artist, goose in the mainframe. They/them. Legacy verified. Stay punk 🏴

Tom McCoy @RTomMcCoy

3K Followers 483 Following Assistant professor @YaleLinguistics. Studying computational linguistics, cognitive science, and AI. He/him.

Ruochen Zhang @ruochenz_

327 Followers 1K Following PhDing @Brown_NLP & @health_nlp, working on multilingual NLP. Prev: Undergrad @sutdsg, she/her

Sebastian Majstorovic @storytracer

2K Followers 812 Following Digital Historian & Data Consultant | https://t.co/fev0QjCWjp | https://t.co/yqa5eIfpTu | Co-Founder @sucho_org

David @DavidSHolz

54K Followers 5K Following founder @midjourney, prev founder leap motion, nasa, max planck

Ashish Vaswani @ashVaswani

19K Followers 2K Following

Lucia Quirke @lucia_quirke

215 Followers 85 Following Neural network interpretability researcher at EleutherAI

Specialized in the design and development of scalable distributed systems with BigData & AI. Passionate about hacking and training LLMs. A huge fan of astronomy

Chris Ociepa @ChrisOciepa

27 Followers 68 Following Specialized in the design and development of scalable distributed systems with BigData & AI. Passionate about hacking and training LLMs. A huge fan of astronomy

pleias @pleiasfr

242 Followers 1 Following

arlo_son @gson_AI

91 Followers 158 Following Undergraduate @ Yonsei. UIC Economics.

BlinkDL @BlinkDL_AI

7K Followers 92 Following RWKV = 100% RNN with GPT-level performance. https://t.co/TkdxOJSFWX and https://t.co/86DzS6arA0

AI policy @StanfordHAI + avoiding war with China @BelferCenter. Words in @ForeignPolicy @TechCrunch et al. Ex @UNGlobalPulse @BanKillerRobots @hrw

Kevin Klyman @kevin_klyman

3K Followers 3K Following AI policy @StanfordHAI + avoiding war with China @BelferCenter. Words in @ForeignPolicy @TechCrunch et al. Ex @UNGlobalPulse @BanKillerRobots @hrw

Yanai Elazar @ICLR @yanaiela

3K Followers 1K Following Postdoc @ AI2 & UW | NLP

Logan Kilpatrick @OfficialLoganK

92K Followers 2K Following Lead product for @Google AI Studio and working on the Gemini API, helping developers build with AI, my views!

Grad student @UMDCS. Past: @AIatMeta, @AmazonScience, @IITMadras. Currently working on #Diffusion and #Multimodal understanding. GPU poor. She/her.

Gowthami Somepalli @gowthami_s

6K Followers 981 Following Grad student @UMDCS. Past: @AIatMeta, @AmazonScience, @IITMadras. Currently working on #Diffusion and #Multimodal understanding. GPU poor. She/her.

Ross @rpoo

25K Followers 1K Following

Aspen @aspenkhopkins

558 Followers 195 Following PhD Student advised by @aleks_madry @MIT_CSAIL interested in data and ml. On bluesky @ dataspen.bsky.soc

Dimitri von Rütte @dvruette

710 Followers 171 Following Studies @ETH_en, Machine Learning @DeepJudgeAI

Lennart Heim @ohlennart

3K Followers 823 Following huh? | AI (Compute) Governance @GovAI_ | Also @EpochAIResearch |

Conference on Languag.. @COLM_conf

2K Followers 6 Following https://t.co/GhGCMEoa4A Abstract submission: March 22, 2024

Senior Researcher @Microsoft DeepSpeed team, working on deep learning systems. @SCSatCMU PhD, @RiceCompSci BS+MS. Views are my own. English/Chinese/Japanese.

Conglong Li @conglongli

133 Followers 61 Following Senior Researcher @Microsoft DeepSpeed team, working on deep learning systems. @SCSatCMU PhD, @RiceCompSci BS+MS. Views are my own. English/Chinese/Japanese.

Dimitris Papailiopoul.. @DimitrisPapail

12K Followers 981 Following prof @ wisconsin; thinking about transformers; learning in context; babas of Inez Lily

Yi Tay @YiTayML

29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼

COO @allen_ai formerly AI/ML @Apple, SVP Strategy & Ops https://t.co/4Z5RuqSEkZ, PhD from @BrownUniversity, post-doc @CarnegieMellon

Sophie @lebrechts

905 Followers 839 Following COO @allen_ai formerly AI/ML @Apple, SVP Strategy & Ops https://t.co/4Z5RuqSEkZ, PhD from @BrownUniversity, post-doc @CarnegieMellon

Iz Beltagy @i_beltagy

2K Followers 422 Following Cofounder @SpiffyAI, Research Lead building OLMo at @allenai_org, formerly @UTCompSci PhD.

Mechanical Dirk @mechanicaldirk

550 Followers 244 Following Principal Engineer at @allen_ai. Engineering Lead of the OLMo project.

Research Scientist at @nvidia. Interested in the intersection of Computer Systems and ML. Occasionally tweet about sports. Views are my own.

Deepak Narayanan @deepakn94

1K Followers 1K Following Research Scientist at @nvidia. Interested in the intersection of Computer Systems and ML. Occasionally tweet about sports. Views are my own.

Researcher @datasociety • PhD @CornellCIS • AI auditing/accountability • Prev. @Meta Fellow @MicrosoftNY @spotifyresearch @Mozilla • 🏳️‍🌈

Briana Vecchione @brianavecchione

3K Followers 1K Following Researcher @datasociety • PhD @CornellCIS • AI auditing/accountability • Prev. @Meta Fellow @MicrosoftNY @spotifyresearch @Mozilla • 🏳️‍🌈

Builds Attention-Free Transformer (https://t.co/YL7CbNYKBs) from scratch
- CEO @ https://t.co/kQHiGtzJWr

Also built k8s tools, uilicious & GPU.js (https://t.co/OIfnI1EPU7)

PicoCreator (🇸🇬.. @picocreator

2K Followers 164 Following Builds Attention-Free Transformer (https://t.co/YL7CbNYKBs) from scratch - CEO @ https://t.co/kQHiGtzJWr Also built k8s tools, uilicious & GPU.js (https://t.co/OIfnI1EPU7)

michele benetti @michelebenben

65 Followers 600 Following

ML Engineer at Anlatan (@novelaiofficial). co-author of HDiT (Hourglass Diffusion Transformers). works on diffusion models and LLMs. 日本語を勉強してる。

Birchlabs @Birchlabs

4K Followers 172 Following ML Engineer at Anlatan (@novelaiofficial). co-author of HDiT (Hourglass Diffusion Transformers). works on diffusion models and LLMs. 日本語を勉強してる。

I am MTGO user and former preview card getter Aspiringspike
https://t.co/pn7Z5hUMcz
he/him
business inquiries aspiringspike@gmail.com

Aspiringspike @Aspiringspike

26K Followers 526 Following I am MTGO user and former preview card getter Aspiringspike https://t.co/pn7Z5hUMcz he/him business inquiries [email protected]

Asma Ghandeharioun @ghandeharioun

2K Followers 489 Following Research Scientist @GoogleAI working on ML interpretability & human-centered AI, PhD from @MIT

Ben Rubinstein @bipr

2K Followers 847 Following ML & Privacy Prof @cis_unimelb. Deputy Dean (Research) @engunimelb. Prev @MSFTResearch, @Berkeley_EECS. He/him. 🇦🇺

Associate professor at U of T. Computer science and math research: (differentially) private data analysis, geometry, discrepancy, optimization.

Sasho Nikolov (thesas.. @thesasho

4K Followers 432 Following Associate professor at U of T. Computer science and math research: (differentially) private data analysis, geometry, discrepancy, optimization.

Nalini Joshi @monsoon0

16K Followers 699 Following mathematician, wife, mother, Professor, addicted to math

Associate Professor of Theoretical Computer Science @Cambridge_Uni.
My research is in Complexity Theory and Quantum Computing.

Tom Gur @TomGur

4K Followers 281 Following Associate Professor of Theoretical Computer Science @Cambridge_Uni. My research is in Complexity Theory and Quantum Computing.

Niloofar (Fatemeh) @I.. @niloofar_mire

5K Followers 1K Following Postdoc @uwcse-@uwnlp, Ph.D. from @ucsd_cse /Privacy, ML, NLP, @winlpworkshop chair, @MSFTResearch - Semantic Machines

Anastasia Stasenko @ana_stasenko

203 Followers 338 Following Cofounder @pleiasfr

Hacking neural networks so that we don’t get stuck in the matrix. Red Team Director @ Electronic Arts. Entrepreneur. Builder and Breaker. Opinions are my own.

Johann Rehberger @wunderwuzzi23

3K Followers 628 Following Hacking neural networks so that we don’t get stuck in the matrix. Red Team Director @ Electronic Arts. Entrepreneur. Builder and Breaker. Opinions are my own.

Reporter on the AI beat, @FortuneMagazine
Formerly @venturebeat
sharon.goldman@fortune.com |
Signal: sharongoldman.43 (for news tips only, no pitches)

Sharon Goldman @sharongoldman

6K Followers 2K Following Reporter on the AI beat, @FortuneMagazine Formerly @venturebeat [email protected] | Signal: sharongoldman.43 (for news tips only, no pitches)

Hugo Touvron @HugoTouvron

2K Followers 131 Following Research Scientist at Meta AI

ceo/cofounder @ https://t.co/bDd3J4KOJH (we're hiring!) #GenerativeAI

recurrent rabbit hole victim. swims in data lakes & pools. nothing great is easy.

nathan lile @ ICLR '2.. @NathanThinks

2K Followers 891 Following ceo/cofounder @ https://t.co/bDd3J4KOJH (we're hiring!) #GenerativeAI recurrent rabbit hole victim. swims in data lakes & pools. nothing great is easy.

fluffy @fluffykittnmeow

511 Followers 193 Following toiling in the tts mines

Machine Learning Librarian @huggingface 🤗 | Championing Open Science & ML | Sharing the latest ML datasets 🌟 | Tips for mastering the HF Hub

Daniel van Strien @vanstriendaniel

3K Followers 1K Following Machine Learning Librarian @huggingface 🤗 | Championing Open Science & ML | Sharing the latest ML datasets 🌟 | Tips for mastering the HF Hub

Joseph Thacker @rez0__

49K Followers 885 Following the promptfather. christian. hacker. hobby jogger. principal ai engineer @appomnisecurity.

Laura O'Mahony @_lauraaisling

240 Followers 476 Following PhD Candidate SFI CRT in foundations of data science in UL 📚 just following the gradient of interesting 📈

Nomic AI @nomic_ai

14K Followers 50 Following Building explainable and accessible AI https://t.co/bbYqCdL8vQ

Teknium (e/λ) @Teknium1

3 days ago

Back when I'd play Minecraft, I liked making modpacks more than playing the game. I think this directly led to me curating and building datasets to train models with lol

15 5 189 16K 11

Lewis @ctjlewis

3 days ago

@vishalmisra why would an LLM not be able to accept input or examples? to recursively self improve it will need to accept its last N outputs as input. did you just find a contrived, isolated case where improvement is not possible because of the constraints you injected?

Vishal Misra @vishalmisra

3 days ago

If the LLM fills out these rows using no external input or prompt, then using a simple entropy argument one can show that the total information content in the matrix cannot increase. (2/n)

4 1 24 6K 3

3 0 4 946 0

Edward Raff @EdwardRaffML

4 days ago

Edward Raff @EdwardRaffML

a month ago

Inspired by @_albertgu recent works in state space models, can we merge them with #VSA and #HRR for our long sequence classification needs in #malware? Our HGConv says yes! With some interesting results on pros/cons. Lead by @rea1mma w/ @BlancheMinerva Tim Oates & Jim Holt!

1 2 15 4K 2

0 2 6 2K 1

Hugh Zhang @ICLR '24 @hughbzhang

4 days ago

Finally, wanted to give a shout out to the “Do ImageNet Classifiers Generalize to ImageNet?” paper by @beenwrekt, @BeccaRoelofs, @lschmidt3 and @Vaishaal which was a huge inspiration for this work and a longtime favorite of mine. We learned a lot of lessons from them.

1 2 46 3K 10

Download Image

uzu lim @cutezu_

6 days ago

@GuilleAngeris I think this is literally correct (suitably interpreted)

1 0 6 669 1

Ajeya Cotra @ajeya_cotra

a week ago

@1a3orn Was meaning to make a claim about the substance here, not what everyone in the AI risk community believes — agree some people do worry about existing systems directly, I disagree with them and think OS has been positive so far

0 0 14 375 0

Kyle O'Brien @KyleDevinOBrien

a week ago

@daniel_271828 A feature/bug by design of this API-based method is the question of who has access. I've only been able to get into research via open models. Folks like me who didn't have the right credentials or connections could be disenfranchised. What if I'm publicly critical of the lab?

0 0 1 47 0

Nishant 🙃 @NishantBalepur

a week ago

@BlancheMinerva Two of my recent papers have the sentence "We filter all questions from ARC that do not have four choices" 😔

0 0 4 463 0

Rohan Pandey @rohan99pandey

a week ago

@maksym_andr @ncmeade Yeah I think trying to dis-entangle which part of the procedure is causal is extremely tough. The pythia framework by @BlancheMinerva is fantastic for running such studies!

0 0 2 136 1

A. Feder Cooper @afedercooper

2 weeks ago

the amount the (then) undergrads and masters students ive worked with are rooting for me re: the faculty search is truly 🥹

0 0 18 2K 1

Bepis™ 🔀 @UnderwaterBepis

2 weeks ago

@aidanogara_ @daniel_271828 nnsight is very cool but does not solve the “making it easy to reverse engineer” issue afaik?

1 0 2 73 0

Quintin Pope @QuintinPope5

2 weeks ago

@daniel_271828 @BogdanIonutCir2 Of course there's current demand for interp access to closed source models. You don't think researchers want to run their interp experiments on GPT-4? OpenAI doesn't even bother to even maintain consistent, reproducible behavior out of their current APIs. Counting on them to…

1 1 9 165 0

John Q Public @conjurial

2 weeks ago

@daniel_271828 @nonagonono thing is this isn't an "API" anymore though, it's (in the limit of hostile actor wanting valuable weights) a facility ppl go to and work at, with their pockets searched for USB drives and even short of that, it's a sandbox w/ huge friction for researchers, headache for $LLMCO

0 0 2 81 0

Aryaman Arora @aryaman2020

2 weeks ago

@daniel_271828 @idavidrein is that a serious proposal or are you memeing

0 0 5 70 0

Stanislav Fort ✨🧠🤖📈✨ @stanislavfort

2 weeks ago

@daniel_271828 Realistically, who gives you access to weights via API though? As far as I can see, even getting a few top logits is a problem, let alone weights. Additionally, a lot of research requires you to make a ton of custom, unforeseen changes and APIs are typically super bad for that.

0 0 6 520 0

AI Hype guys making absurd claims @StopAIHype

2 weeks ago

@daniel_271828 @SecondA16110022 So your argument hinges on an imaginary future API that companies would have no incentive to create?

0 0 1 19 0

Nathan Lambert @natolambert

2 weeks ago

@BlancheMinerva Yeah, I was chatting with @soldni and I had no idea how awesome Pythia was. Pythia is way more than model weights.

0 0 8 3K 0

Aviya Skowron @aviskowron

2 weeks ago

ty we try 😭

Nathan Lambert @natolambert

2 weeks ago

@BlancheMinerva Yeah, I was chatting with @soldni and I had no idea how awesome Pythia was. Pythia is way more than model weights.

0 0 8 3K 0

0 1 18 2K 1

Jeremy Howard @jeremyphoward

2 weeks ago

@WenhuChen In theory i guess meta could demand you take down your work, since you don't have a license (since you're not following the license requirements)