Aran Komatsuzaki @arankomatsuzaki

@TeraflopAI arankomatsuzaki.wordpress.com/about-me/ Joined November 2016

Tweets

5K
Followers

94K
Following

78
Likes

11K

TeraflopAI @TeraflopAI

22 hours ago

Awesome to see @joespeez, AI Product Director, @Meta, mention our previous research, YaRN, on stage at the @weights_biases Fully Connected conference. We have another very exciting long-context release coming soon.

1 4 6 4K 1

Download Video

Weiyan Shi @shi_weiyan

24 hours ago

🚨New Paper🚨 We propose 1⃣CultureBank🌎 dataset sourced from TikTok & Reddit 2⃣An extensible pipeline to build cultural knowledge bases 3⃣Evaluation of LLMs’ cultural awareness 4⃣Insights into culturally-aware LLMs Project: culturebank.github.io Data: shorturl.at/hrtwP

4 56 215 32K 120

Download Image

Aran Komatsuzaki @arankomatsuzaki

2 days ago

Apple presents OpenELM - An efficient LM family with open-source training and inference framework - Performs on par with OLMo while requiring 2x fewer pre-training tokens repo: github.com/apple/corenet hf: huggingface.co/apple/OpenELM abs: arxiv.org/abs/2404.14619

6 54 228 56K 119

Download Image

Aran Komatsuzaki @arankomatsuzaki

2 days ago

SnapKV: LLM Knows What You are Looking for Before Generation - Automatically compresses KV caches - Consistent decoding speed with a 3.6x increase in generation speed and an 8.2x enhancement in memory efficiency repo: github.com/FasterDecoding… abs: arxiv.org/abs/2404.14469

6 55 300 33K 221

Download Image

Aran Komatsuzaki @arankomatsuzaki

2 days ago

Twelve Labs presents Pegasus-v1 - Presents a multimodal LM specialized in video content understanding and interaction through natural language - Achieves SotA in video QA and various other video tasks and outperforms Gemini 1.5 Pro proj: twelvelabs.io/blog/upgrading… abs:…

2 17 72 10K 28

Download Image

Aran Komatsuzaki @arankomatsuzaki

2 days ago

Microsoft presents Multi-Head Mixture-of-Experts Achieves notable improvements over the baseline MoE by using multiple MoE heads repo: github.com/yushuiwx/MH-MoE abs: arxiv.org/abs/2404.15045

6 112 557 38K 378

Download Image

Aran Komatsuzaki @arankomatsuzaki

2 days ago

Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Perfect Reasoners Improves the performance of GPT4 on GSM8K from 94.6% to 97.1% with a three-stage prompting arxiv.org/abs/2404.14963

3 42 213 19K 153

Download Image

Aran Komatsuzaki @arankomatsuzaki

2 days ago

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation Presents a zero-shot human-video generation approach that can perform personalized video generation given single reference facial image without further training proj: id-animator.github.io abs:…

1 24 82 11K 52

Download Video

Dawei Zhu @dwzhu128

2 days ago

Many thanks to Aran for sharing! @arankomatsuzaki Links are here: Code: github.com/dwzhu-pku/Long… Paper Page: huggingface.co/papers/2404.12… Benchmark: huggingface.co/datasets/dwzhu… Model: huggingface.co/dwzhu/e5rope-b…

Aran Komatsuzaki @arankomatsuzaki

2 days ago

4 34 208 39K 123

Download Image

1 2 21 8K 21

Aran Komatsuzaki @arankomatsuzaki

2 days ago

Microsoft presents LongEmbed: Extending Embedding Models for Long Context Retrieval - Presents the LongEmbed benchmark for long context retrieval - Releases the E5-Base-4k and E5-RoPE-Base models repo: github.com/dwzhu-pku/Long… abs: arxiv.org/abs/2404.12096

4 34 208 39K 123

Download Image

Aran Komatsuzaki @arankomatsuzaki

3 days ago

A few caveats about Phi-3: - The figure I attached at the beginning had some errors. Here's the updated one. - Phi-3-medium performs well on TriviaQA but noticeably underperforms rel. to GPT-3.5. We can guess that Phi-3 recipe doesn't magically make it understand more random…

Aran Komatsuzaki @arankomatsuzaki

3 days ago

32 144 809 323K 272

Download Image

1 4 57 43K 8

Download Image

Aran Komatsuzaki @arankomatsuzaki

3 days ago

ByteDance presents Hyper-SD Achieves SOTA performance from 1 to 8 inference steps for both SDXL and SD1.5 proj: hyper-sd.github.io abs: arxiv.org/abs/2404.13686

1 11 104 19K 52

Download Video

Aran Komatsuzaki @arankomatsuzaki

3 days ago

ByteDance presents Graphic Design with Large Multimodal Model Outperforms prior arts and establishes a strong baseline for the field of graphi design repo: github.com/graphic-design… abs: arxiv.org/abs/2404.14368

0 40 180 18K 112

Download Image

Aran Komatsuzaki @arankomatsuzaki

3 days ago

Better Synthetic Data by Retrieving and Transforming Existing Datasets repo: github.com/neulab/prompt2… abs: arxiv.org/abs/2404.14361

2 85 407 42K 307

Download Image

Teknium (e/λ) @Teknium1

3 days ago

Let's give er' a go!

Aran Komatsuzaki @arankomatsuzaki

3 days ago

Let's give er' a go!

32 144 809 323K 272

Download Image

10 6 175 28K 25

Jimmy Apples 🍎/acc @apples_jimmy

3 days ago

Small models excite me.

0 20 336 39K 36

AK @_akhaliq

309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gx

@NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as lb@sigmoid.social

Lucas Beyer (bl16) @giffmana

56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]

Eric Jang @ericjang11

69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0p

Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

Delip Rao e/σ @deliprao

46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

#CS Associate Prof @YorkUniversity, #ComputerVision Scientist Samsung #AI, @VectorInst Faculty Affiliate, TPAMI AE, #CVPR2024/#ECCV2024 Publicity Co-chair

Kosta Derpanis @CSProfKGD

48K Followers 198 Following #CS Associate Prof @YorkUniversity, #ComputerVision Scientist Samsung #AI, @VectorInst Faculty Affiliate, TPAMI AE, #CVPR2024/#ECCV2024 Publicity Co-chair

Yannic Kilcher 🇸�.. @ykilcher

67K Followers 867 Following I make videos. Skill > Destiny. vi / vim

near @nearcyan

45K Followers 883 Following https://t.co/IdaJwZJCXm partner @ https://t.co/9g1MIgjiqc dms open

abhishek @abhi1thakur

81K Followers 662 Following 🤗 I build AutoTrain @huggingface 👨🏽‍💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub Star

Jason Wei @_jasonwei

56K Followers 490 Following ai researcher @openai

Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

Percy Liang @percyliang

49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

Horace He @cHHillee

23K Followers 448 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemale

PhD at 19 |
Founder and CEO at @MedARC_AI |
Research Director at @StabilityAI |
@kaggle Notebooks GM |
Biomed. engineer @ 14 |
TEDx talk➡https://t.co/xPxwKTq6Qb

Tanishq Mathew Abraha.. @iScienceLuvr

Yi Tay @YiTayML

29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼

Cofounded & running @ml_collective.
Host of Deep Learning Classics & Trends.
Research at Google DeepMind.
DEI/DIA Chair of ICLR & NeurIPS.
Writing https://t.co/IbycyGfnDR

Rosanne Liu @savvyRL

33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDR

Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.

Ross Wightman @wightmanr

18K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.

SynthLabs @synth_labs

12K Followers 43 Following AI Aligned with Your Vision. We’re doing cutting edge research for transparent, auditable AI alignment.

Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).

Sander Dieleman @sedielem

50K Followers 2K Following Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).

a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).

Kyunghyun Cho @kchonyc

61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).

Chief Llama Officer @huggingface 🦙

Founder @AI_Learners.
Xoogler (SWE @Google Assistant, 20% PM TF Graphics).
100% Hacker Llama🇵🇪🇲🇽

Omar Sanseviero @osanseviero

31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽

journeyman coder @xgenaidev

6 Followers 55 Following organic intelligence(?) all tweets are generated using electric circuits

niko @niko1311117

18 Followers 151 Following

Electronicsseeker @libertarian108

2 Followers 379 Following

zouFeng @zouKunkka

12 Followers 154 Following

Lilianahlg @Lilianahlg

9 Followers 1K Following

Guy Swann ⚡️| Act.. @TheGuySwann

80K Followers 3K Following Liberty is a technology problem • Host of @BitcoinAudible, @Ai_Unchained • Pro Memecraft • Audiobook Narrator

Viresh Jagesser @viresh_jagesser

342 Followers 1K Following Entrepreneur

Quarkstar @Quarkstar9

17 Followers 107 Following

Postdoc in Jiayang Li Lab, Institute of Genetics and Developmental Biology, CAS. Focus on the Multi-omics of the rice & dandelions🌾🌱🧬

Yin-Hong Cao @caoyinhong

68 Followers 890 Following Postdoc in Jiayang Li Lab, Institute of Genetics and Developmental Biology, CAS. Focus on the Multi-omics of the rice & dandelions🌾🌱🧬

Rob Tiffany 🇺🇸 @RobTiffany

28K Followers 22K Following Military Advisor on Emerging Technologies • Author • Speaker • Inventor • US Navy Veteran

itsjusttrash @its_justtrash

1 Followers 32 Following

zelan Luo @ZelanLuo187

2 Followers 39 Following

shanhai @shanhai95147186

0 Followers 587 Following

Koolster @Koolster34

172 Followers 2K Following

sanderhahn @sanderhahn

0 Followers 45 Following

Tjaž Silovsek @silovsek

1 Followers 20 Following

Mpho @dion_mpho

9 Followers 91 Following Peace ✌🏾

Gu @guchenhe

67 Followers 494 Following building @dify_ai

LW Owens @LOwens6923

116 Followers 923 Following For God gave His only begotten Son, that whosoever believeth in Him should not perish, but have everlasting life.”

ibaadkhan @ibaadkhan229427

4K Followers 7K Following

Johnelle Hunt @HuntJohnel39354

187 Followers 3K Following BUSINESS AND DONATIONS ONLY

dhika @erastus_AIL

23 Followers 153 Following Live and Rare

Ramagu @Don_Ramoncillo

229 Followers 2K Following Padre y marido | Además, aprendiz de todo y experto de nada.

Erlend Fiskerud @ephisx

6 Followers 16 Following

Itqdevs is your one-stop service provider for all your business technology needs. Custom softwares, exceptional design services, data analytics & cybersecurity

Itqdevs Softwares @itq_devs

4 Followers 357 Following Itqdevs is your one-stop service provider for all your business technology needs. Custom softwares, exceptional design services, data analytics & cybersecurity

Medtech innovator, virtual clinic@home. Screening, diagnostic, remote patient monitoring and tele-consultation, all cardiac patient journey in one place.

Ion Mocanu @IonMocanuion18

7 Followers 198 Following Medtech innovator, virtual clinic@home. Screening, diagnostic, remote patient monitoring and tele-consultation, all cardiac patient journey in one place.

fr13nz @fr13nz

118 Followers 2K Following

Atal Bhushan @atal_23

37 Followers 156 Following PM @Celonis📊👷‍♂️Cook 🧑‍🍳🥘

promise eyo @promiseeyo60399

1 Followers 53 Following

Falko Heinze @falko26

61 Followers 115 Following

🐘 Founder BLACK ELEPHANT COACHING ▫️Excellence Coach ▫️Transforming individuals & teams to EXCELLENCE: Character. Culture. Strengths. Grit. Habits. Emotions.

Sascha Schmunk @sascha_schmunk

403 Followers 910 Following 🐘 Founder BLACK ELEPHANT COACHING ▫️Excellence Coach ▫️Transforming individuals & teams to EXCELLENCE: Character. Culture. Strengths. Grit. Habits. Emotions.

Leo Kapatos @KapatosLeo53501

79 Followers 578 Following

富江 @JiangFu9920

47 Followers 905 Following hello everybody❤️💙

Alin Ciocan @AlinCiocan4

2 Followers 55 Following

Echo @tony_kk121

72 Followers 1K Following

Xing Zhou @xingzhougmu

8 Followers 99 Following

coffee & AI @realcoffeeAI

33 Followers 462 Following

Because, if you confess with your mouth that Jesus is Lord and believe in your heart that God raised him from the dead, you will be saved. Romans 10:9

Zakky'sLordIsJesus @ZakkySJ

256 Followers 519 Following Because, if you confess with your mouth that Jesus is Lord and believe in your heart that God raised him from the dead, you will be saved. Romans 10:9

Alex Lee @Boxcounter

12 Followers 47 Following

Boolom @BIT3DAO

78 Followers 804 Following 💲₿🪙🤑🔥🚀📈

luis buera @luisbuera3

5 Followers 44 Following

nix @nix2liu

10 Followers 24 Following System Architect in Autonomous Driving @Li_Auto_ /// prev.@ZEEKRGlobal @Apple

Jay @jayloofah

38 Followers 95 Following

Yubin Kim @ybkim95_ai

5 Followers 33 Following Graduate student @MIT conducting research on Healthcare AI and Wearable Sensors with Personal Robots.

Shrey Pandey @ShreyPandey1509

8 Followers 22 Following

Stanaaa @stanxingaaa

16 Followers 579 Following 主业是SRE工程师，喜欢各种新东西

Sharing my creative AI experiments • Business Inquiries (consulting ONLY - no commission at this time): aiwarperinc@gmail.com

A.I.Warper @AIWarper

12K Followers 126 Following Sharing my creative AI experiments • Business Inquiries (consulting ONLY - no commission at this time): [email protected]

硅脑实验室，致力于让低功耗AI涌现智慧，飞入寻常百姓家。Silicon Mind/Brain Labs，committed to enabling low-power AI to emerge wisdom and benefiting everyone。(Email: simindlab@hotmail.com)

SiMindLab @SiMindLab149369

1 Followers 11 Following 硅脑实验室，致力于让低功耗AI涌现智慧，飞入寻常百姓家。Silicon Mind/Brain Labs，committed to enabling low-power AI to emerge wisdom and benefiting everyone。(Email: [email protected])

rockets💰💰💰�.. @Bighcbc

0 Followers 50 Following

AK @_akhaliq

309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gx

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.

AI at Meta @AIatMeta

531K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.

Lucas Beyer (bl16) @giffmana

56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]

Cofounded and lead @PyTorch at Meta.
Also dabble in robotics at NYU.

AI is delicious when it is accessible and open-source.

Soumith Chintala @soumithchintala

185K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.

Eric Jang @ericjang11

69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0p

Delip Rao e/σ @deliprao

46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

near @nearcyan

45K Followers 883 Following https://t.co/IdaJwZJCXm partner @ https://t.co/9g1MIgjiqc dms open

Jason Wei @_jasonwei

56K Followers 490 Following ai researcher @openai

Tanishq Mathew Abraha.. @iScienceLuvr

Hugging Face @huggingface

342K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhate

Yi Tay @YiTayML

29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼

Wojciech Zaremba @woj_zaremba

79K Followers 192 Following Co-Founder of OpenAI

Kyunghyun Cho @kchonyc

61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).

Tim Dettmers @Tim_Dettmers

29K Followers 819 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.

Sergey Levine @svlevine

79K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical Intelligence

VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead.

Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.

Oriol Vinyals @OriolVinyalsML

166K Followers 82 Following VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.

Shane Gu @shaneguML

28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)

New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to townhall@neurips.cc.

NeurIPS Conference @NeurIPSConf

111K Followers 35 Following New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to [email protected].

Rivers Have Wings @RiversHaveWings

31K Followers 225 Following AI/generative artist. Writes her own code. Absolute power is a door into dreaming.

udio @udiomusic

27K Followers 0 Following

Jimmy Apples 🍎/acc @apples_jimmy

34K Followers 860 Following Wagmi. 2025.

TeraflopAI @TeraflopAI

398 Followers 2 Following @EnricoShippole @arankomatsuzaki

TensoruAI @TensoruAI

19 Followers 3 Following

Jan Leike @janleike

44K Followers 322 Following ML Researcher, co-leading Superalignment @OpenAI. Optimizing for a post-AGI future where humanity flourishes.

DuckAI @TheDuckAI

621 Followers 8 Following An open-source ML research community at Discord: https://t.co/7YDTo6Mo1G

Emad @EMostaque

221K Followers 10 Following #decentralizeAI

Nat Friedman @natfriedman

182K Followers 286 Following https://t.co/Lhh178sIjq

Math @uwaterloo, RLHF @BrownCSDept, Goosefluencer. x-RS @aieleuther, x-Head of LLMs @stabilityai, x-lead @CarperAI. co-founder @synth_labs. We're hiring.

Louis Castricato @lcastricato

3K Followers 477 Following Math @uwaterloo, RLHF @BrownCSDept, Goosefluencer. x-RS @aieleuther, x-Head of LLMs @stabilityai, x-lead @CarperAI. co-founder @synth_labs. We're hiring.

NVIDIA AI @NVIDIAAI

156K Followers 822 Following Solving the unsolvable with AI. #IAMAI

Christian Szegedy @ChrSzegedy

32K Followers 2K Following #deeplearning, #ai research scientist. Opinions are mine.

Microsoft Research @MSFTResearch

553K Followers 2K Following We advance science and technology to benefit humanity. https://t.co/kz0nARXbwT Register for Microsoft Research Forum on June 4 ⬇️ Get our newsletter

PI @UChicagoCS & @DSI_UChicago, leader of Conceptualization Lab https://t.co/BVCT3zdaNV, Post-doc @Meta. We don’t really know much about language models...yet.

Ari Holtzman @universeinanegg

3K Followers 2K Following PI @UChicagoCS & @DSI_UChicago, leader of Conceptualization Lab https://t.co/BVCT3zdaNV, Post-doc @Meta. We don’t really know much about language models...yet.

𝔊𝔴𝔢𝔯𝔫 @gwern

42K Followers 88 Following Internet besserwisser; pedantic, mean reply guy. 𝘞𝘢𝘵𝘢𝘴𝘩𝘪 𝘬𝘪𝘯𝘪𝘯𝘢𝘳𝘪𝘮𝘢𝘴𝘶! (Follow requests ignored due to terrible UI.)

London-based AI/NLP Research Scientist. I co-lead the RAG & tool use team at Cohere w/ @s_hofstaetter. Previous Fundamental AI Research at Meta AI, FAIR, UCL AI

Patrick Lewis @PSH_Lewis

4K Followers 655 Following London-based AI/NLP Research Scientist. I co-lead the RAG & tool use team at Cohere w/ @s_hofstaetter. Previous Fundamental AI Research at Meta AI, FAIR, UCL AI

Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.

Mike Lewis @ml_perception

6K Followers 227 Following Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.

Madison May (e/ia) @pragmaticml

2K Followers 2K Following teaching machines @indicodata - professional novice

Covariant @CovariantAI

11K Followers 157 Following Empowering robots to see, think, and act.

ICML Conference @icmlconf

70K Followers 17 Following Int'l Conf on ML • July 21-27, 2024 (Vienna, Austria) • #icml2024 • Contact: https://t.co/6saHKWV01y • https://t.co/sFwmcQNWkE

Colin Raffel @colinraffel

30K Followers 654 Following nonbayesian parameterics, sweet lessons, and random birds. Friend of @srush_nlp

Association for Computational Linguistics |
ACL 2024 conference |
The 62nd Annual Meeting of the ACL
Hashtags: #NLProc #ACL2024NLP

ACL 2024 @aclmeeting

18K Followers 35 Following Association for Computational Linguistics | ACL 2024 conference | The 62nd Annual Meeting of the ACL Hashtags: #NLProc #ACL2024NLP

UW NLP @uwnlp

11K Followers 160 Following The NLP group at the University of Washington.

Allen Institute for A.. @allen_ai

53K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfL

Sam Bowman @sleepinyourhat

35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.

Senior staff scientist @GoogleDeepMind. PhD @StanfordNLP. PI #AlphaGeometry. Co-lead #Bard Multimodality, now #Gemini. Co-founder #MeenaBot (later LaMDA).

Thang Luong @lmthang

20K Followers 100 Following Senior staff scientist @GoogleDeepMind. PhD @StanfordNLP. PI #AlphaGeometry. Co-lead #Bard Multimodality, now #Gemini. Co-founder #MeenaBot (later LaMDA).

trieu @thtrieu_

2K Followers 241 Following thinking about thinking. created alphageometry, darkflow. prev: nyu, google brain/deepmind

rewon @rewonfc

980 Followers 288 Following Here for the research papers

Researcher in Deep Learning @GoogleDeepMind. Angel investor. Co-creator @GoogleAI Brain Amsterdam, Ex @DeepMind, Edu at Oxford, UvA and Stanford.

Nal @nalkalc

35K Followers 259 Following Researcher in Deep Learning @GoogleDeepMind. Angel investor. Co-creator @GoogleAI Brain Amsterdam, Ex @DeepMind, Edu at Oxford, UvA and Stanford.

Cofounder & Chief Scientist https://t.co/hLfvKLkFHd (@MistralAI). Working on LLMs. Ex @MetaAI | PhD @Sorbonne_Univ_ | MSc @CarnegieMellon | X11 @Polytechnique

Guillaume Lample @GuillaumeLample

37K Followers 648 Following Cofounder & Chief Scientist https://t.co/hLfvKLkFHd (@MistralAI). Working on LLMs. Ex @MetaAI | PhD @Sorbonne_Univ_ | MSc @CarnegieMellon | X11 @Polytechnique

The AllenNLP team works on language-centered AI that equitably serves humanity. We deliver high-impact research and open-source tools to accelerate progress.

AllenNLP @ai2_allennlp

14K Followers 31 Following The AllenNLP team works on language-centered AI that equitably serves humanity. We deliver high-impact research and open-source tools to accelerate progress.

Quoc Le @quocleix

49K Followers 107 Following Distinguished Scientist, Google.

Thomas Wolf @Thom_Wolf

68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-science

Stat.ML Papers @StatMLPapers

20K Followers 0 Following Unofficial updates of statistical machine learning papers on arXiv

Aäron van den Oord @avdnoord

13K Followers 176 Following Research Scientist @ DeepMind

Zach Nussbaum @zach_nussbaum

14 hours ago

Benchmarks, especially long-context embedding benchmarks, are few and far between. Great work done by @dwzhu128 and collaborators. Great seeing Nomic Embed stack up well against other long context models! Following some debugging of the evals, we reran the evals and upstreamed…

Aran Komatsuzaki @arankomatsuzaki

2 days ago

4 34 208 39K 123

Download Image

0 1 7 2K 1

Download Image

Fuzhao Xue @XueFz

13 hours ago

I always strongly suggest people to read this work (arxiv.org/abs/2207.10551) by @YiTayML and @m__dehghani when discussing the model architecture. It almost takes up to 50% pages of the literature survey Chapter in my PhD thesis. It is so visionary to study this in 2022. I can…

Yi Tay @YiTayML

17 hours ago

not true, especially for language. if you trained a large & deep MLP language model with no self-attention, no matter how much data you'll feed it you'll still be lacking behind a transformer (with much less data). will it get to the same point? i don't think so. your tokens…

29 52 532 145K 231

1 28 140 19K 156

Sachin @sacmehtauw

18 hours ago

Thank you @_akhaliq @ClementDelangue @Thom_Wolf @arankomatsuzaki @pcuenq @awnihannun and others for sharing our work.

0 1 5 254 0

Joseph Suarez (e/🐡) @jsuarez5341

19 hours ago

@arankomatsuzaki Name taken @lcastricato

0 0 2 320 0

TeraflopAI @TeraflopAI

22 hours ago

1 4 6 4K 1

Download Video

EnricoShippole @EnricoShippole

22 hours ago

A big thank you to @joespeez @Meta for mentioning our previous research, YaRN, at the @weights_biases Fully Connected conference. We have some exciting long-context releases coming up soon.

TeraflopAI @TeraflopAI

22 hours ago

1 4 6 4K 1

Download Video

2 0 11 906 0

Alex Havrilla @Dahoas1

23 hours ago

In my humble opinion the recent Stream of Search paper (arxiv.org/abs/2404.03683) is truly outstanding. Everyone should give it a thorough read.

7 23 158 19K 220

Hailey Schoelkopf @haileysch__

a day ago

@natolambert @arankomatsuzaki @herbiebradley tbf, there are also a lot of “pythia”s, though not ones in the LLM subfield

0 0 3 119 0

Hailey Schoelkopf @haileysch__

a day ago

Great to see others follow suit in releasing fully-open and documented LLMs!

Aran Komatsuzaki @arankomatsuzaki

2 days ago

6 54 228 56K 119

Download Image

0 0 10 2K 1

Weiyan Shi @shi_weiyan

24 hours ago

4 56 215 32K 120

Download Image

Nathan Lambert @natolambert

a day ago

A modern version of Pythia? Curious how good the models are.

Aran Komatsuzaki @arankomatsuzaki

2 days ago

6 54 228 56K 119

Download Image

2 2 22 8K 8

Nathan Lambert @natolambert

a day ago

@haileysch__ @arankomatsuzaki @herbiebradley ooooooh noooooo github.com/CarperAI/OpenE…

1 0 3 209 1

Hailey Schoelkopf @haileysch__

a day ago

@arankomatsuzaki heyyy, that name was taken already @herbiebradley

1 0 12 999 0

kache (dingboard.com) @yacineMTB

2 days ago

@arankomatsuzaki wew

0 0 3 910 0

Shengju Qian @thesouthfrog

2 days ago

@letalvoj @arankomatsuzaki Hi Vojta, thanks for your comment. Actually, when working on the model, the first few cases we tested were on our own faces, which look quite good IMHO. Figure 6 in the paper also shows some ordinary faces. We are cleaning code for release and happy for you to test when ready

0 0 6 55 0

Ryan Boyle @_RyanBoyle_

2 days ago

Let's go @Apple !! "Diverging from prior practices that only provide model weights and inference code, and pre-train on private datasets, our release includes the complete framework for training and evaluation of the language model on publicly available datasets, including…

Aran Komatsuzaki @arankomatsuzaki

2 days ago

6 54 228 56K 119

Download Image

0 2 8 1K 0

Hassan Hayat 🔥 @TheSeaMouse

2 days ago

woah, cohere seriously cooked here

Aran Komatsuzaki @arankomatsuzaki

2 days ago

6 55 300 33K 221

Download Image

2 2 33 8K 12

Download Image

🎙Jean-Louis Queguiner @JiliJeanlouis

2 days ago

phi-3 TLDR: Model trained with default 4K context length but Long Rope training coming with 128K context length. Original size 1.8Gb. Able to run on an iPhone A16 bionic chip using 4bit quantization with a rate of 12t/sec Overall really good model with strong performance in…

Aran Komatsuzaki @arankomatsuzaki

3 days ago

Microsoft just released Phi-3 - phi-3-mini: 3.8B model trained on 3.3T tokens rivals Mixtral 8x7B and GPT-3.5 - phi-3-medium: 14B model trained on 4.8T tokens w/ 78% on MMLU and 8.9 on MT-bench arxiv.org/abs/2404.14219

32 144 809 323K 272

Download Image

4 2 31 7K 15

dinos @din0s_

2 days ago

Aran Komatsuzaki @arankomatsuzaki

3 days ago

Btw here's the corrected figure. Phi-2 result missing some results is expected.

3 2 39 10K 4

Download Image

0 0 9 580 0

Download Image

Underscore @RL_Underscore

2 days ago

@arankomatsuzaki Awesome stuff. Makes you wonder how many people'll be testing this thanks to the small size, can't wait to see how Phi-3 performs outside of benchmarks.