Nouamane Tazi @Nouamanetazi
ML Research Engineer @huggingface 🤗. Scale it 'til you make it 🇵🇸🕊 Paris, France Joined May 2012-
Tweets230
-
Followers1K
-
Following1K
-
Likes2K
Every software developer who contributed to Lavender needs to seriously reconsider their life choices. 972mag.com/lavender-ai-is…
⚡️Announcing Nanotron v0.4, featuring the groundbreaking Mamba architecture!⚡️
Training a SOTA code LLM with a fully transparent library (nanotron) built from scratch -> Done ✅
So proud to see nanotron used for such amazing projects! 👏
So proud to see nanotron used for such amazing projects! 👏
Today is a good day – pushing two new first library releases on PyPi - nanotron⚡️: first version on pypi of this lightweight open-source library where we're playing with fast pre-training 3D parallelism in new architectures like MoE, Mamba, MiniCPM, etc - lighteval🌤️: also…
Mini OSS release today: lighteval 🌤️ (with @nathanhabib1011 and @Thom_Wolf ) It's a small LLM eval suite, to: - iterate on new tasks easily (prompt/templates variations, custom tasks...) 🧪 - evaluate HF/nanotron compatible models as fast as possible with DP/PP on GPUs ⚡️
Today, we're excited to bring you aMUSEd - a lightweight masked-image-model (MIM) intended for fast generation 🚀 We're releasing aMUSEd as a research release because its quality is NOT SoTA. Training code, paper, demo, fine-tuning code 👇 huggingface.co/blog/amused 🧵 1/6
As a non-CS person, what an honor to sit among the authors of these 4 awarded papers at NeurIPS 2023 (out of 13k papers submitted 🤯) I was only an enabler, all props should go to the amazing @Muennighoff (starting soon grad school...) as well as @srush_nlp @boazbaraktcs…
Our work on multi-epoch scaling laws with the amazing @Muennighoff (applying to grad school!) and @srush_nlp @Fluke_Ellington @olapiktus @Nouamanetazi @TurkuNLP @Thom_Wolf @colinraffel won runner-up outstanding paper award! See @Muennighoff 's thread x.com/Muennighoff/st…
Our work on multi-epoch scaling laws with the amazing @Muennighoff (applying to grad school!) and @srush_nlp @Fluke_Ellington @olapiktus @Nouamanetazi @TurkuNLP @Thom_Wolf @colinraffel won runner-up outstanding paper award! See @Muennighoff 's thread x.com/Muennighoff/st…
The top 15 most-liked organizations on @huggingface 1. @StabilityAI 20k likes 2. @AIatMeta 20k 3. @runwayml 11k 4. CompVis 10k 5. @thukeg 7k 6. @BigscienceW 7k 7. @TIIuae 7k 8. @Microsoft 6.5k 9. @GoogleAI 6k 10. @OpenAI 4k 11. @BigCodeProject 4k 12. @MosaicML 4k 13. @UKPLab 3k…
We just launched github.com/huggingface/te… New docker container optimized for speed (running faster than torch thanks to candle, reducing our kernel launches which show really well on such small models). This should be really helpful for anyone wanting to run enterprise RAG LLM.
Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueHugging Face @huggingface
346K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhatemerve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papersOmar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Nate Raw @_nateraw
7K Followers 1K Following machine learning hacker. previously @huggingface @lightningaiLewis Tunstall @_lewtun
9K Followers 425 Following 🤗 LLM engineering & research @huggingface 📖 Co-author of "NLP with Transformers" book 💥 Ex-particle physicist 🤘 Occasional guitarist 🇦🇺 in 🇨🇭Sasha Luccioni, PhD �.. @SashaMTL
19K Followers 4K Following AI & Climate Lead @HuggingFace, Board Member of @WiMLworkshop, Founding Member of @ClimateChangeAI. @TEDTalks speaker. She/her/Dr/ 🦋Thomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceJeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordPhilipp Schmid @_philschmid
16K Followers 654 Following Tech Lead and LLMs at @huggingface 👨🏻💻 🤗 AWS ML Hero 🦸🏻 | Cloud & ML enthusiast | 📍Nuremberg | 🇩🇪 https://t.co/l1ppq3q3hkTristan Thrush @TristanThrush
3K Followers 762 Following PhD-ing @StanfordAILab @stanfordnlp. Advisor @PlaytestAI. Past: @ContextualAI, @huggingface, @Meta FAIR, @mitbrainandcog, @MIT_CSAIL, @NASAJPLMatthew Carrigan @carrigmat
3K Followers 352 Following @huggingface engineer. I'm the reason your LLM frontend has a jinja2cpp dependency. Sometimes yells about housing and trans rights instead of working He/himZach Mueller @TheZachMueller
10K Followers 395 Following 🤗 Technical Lead for the Accelerate Project | Passionate about Open Source | Nerd who enjoys touching the grass | #ADHD | He/HimManu Romero @mrm8488
21K Followers 2K Following CSO/Co-founder @maisaAI_. Head Contrib/ Ambassador🤗 @huggingface. Research 🌸@bigsciencew/@BigCodeProject | @SomosNLP_ co-founderClémentine Fourrier .. @clefourrier
3K Followers 306 Following Leaderboards & evals research @HuggingFace 🐍✨ "The future is already here, it’s just not very evenly distributed" (Gibson)Sharif Shameem @sharifshameem
53K Followers 3K Following founder @LexicaArt • in pursuit of good explanationsAnton Lozhkov @anton_lozhkov
2K Followers 283 Following Open-sourcing Language Models @huggingface ✨Suhail @Suhail
295K Followers 464 Following Founder: @playground_ai, @mixpanel Pizzatarian, programmer, music makerMohamed Achbar @MohamedAch19359
2 Followers 82 FollowingMaria @gonzalezmaria35
174 Followers 3K FollowingNoathi @Noathi183971
0 Followers 234 FollowingFlorent Daudens @fdaudens
11K Followers 6K Following Press Lead @HuggingFace / Passionate about AI & news / Previously @radiocanadainfo @ledevoir & coGlisioshi @glisioshi13580
0 Followers 216 FollowingHamza Attar @Hamza_01111011
12 Followers 236 Following I'm documenting my journey building Saas. Customer-centric solutions, marketing, and crypto. Welcome and have fun.Sailesh Kumar @saileshtalks
2K Followers 5K Following Design industry leading semiconductor solutions and help advance the industry. CEO at @bayasystems. ex- Intel Fellow and Founder at NetSpeed Systems.DaisySwift @34Al3dzrCkzlE
3 Followers 255 FollowingSmosheal @Smoshealw2kX
0 Followers 156 FollowingMadgeEvans @rWz76XmTK1FFv
0 Followers 263 FollowingNeetish @Neetish76227
175 Followers 6K FollowingMaxence @maxencejm
345 Followers 575 Following Helping ambitious students & graduates build the future • @join_ef 🇫🇷SrinivasanSS @SrinivasanSS52
6K Followers 2K Following Was a Software Engineer , Programmer a few years ago, at present, at Bangalore, focusing more into Android App, MERN Stack, Deep Learning and Hiring DevelopersTasour @TasourR
44 Followers 358 Following Error code: 0xF2024 (Lost in the virtual world). Backup failed. All data lost.Abdulrahman Tabaza @embed_dim
3 Followers 870 Following enjoyer of various vector spaces, encoders and modalitiesHamza @Hmellahiii
463 Followers 919 Following Frontend Developer by day, Building https://t.co/lQTmJ7nk7b by night. A platform to help you prepare for behavioral interviews (not ready yet)Aaditya ; @Aaditya26082004
545 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈Mouad Lousimi @mouad_lous
16 Followers 27 Followingyingzhi wang @yingzhi_wang
18 Followers 86 Following Research on Speech & Audio, collaborator @SpeechBrain1HF = HaFedh @not_so_lain
471 Followers 1K Following i contribute to custom Ai architectures on huggingface | Tensorflow developer | LowRes admin | open for work | https://t.co/9rhyDH220LEva Louise Marie Gabr.. @e681554349
9 Followers 3K FollowingKaran Patel @kishanpatel_ai
26 Followers 492 Following__vaibhav__ @Sillychap101
102 Followers 3K Following Computer Science and Mathematics undergrad | IIITDigmimyo @YBeniguemim
26 Followers 282 Following Data scientist - #NLP & #KG enthusiast. Student @ Ecole Polytechnique Parisfarid @farid_dinar
13 Followers 47 Following⿻ barton 🦺𑗊 @bmorphism
2K Followers 4K Following applied categorical duck cyberneticist • building for agencies in the 21st century • inventor of the operadic cognitive diagram cognitive continuation standardTaishi @Setuna7777_2
2K Followers 3K Following CS M1 at @tokyotech_jp advised by @rioyokota 未踏TG23 Research intern: @SakanaAILabsReza Sayar @iamRezaSayar
177 Followers 697 Following 👨🏻🎓Life-long Learner👨🏻🎓 Kindness❤️, Helpfulness🫂 , AI🧠 & Reggaetón💃🏻Guneet Singh Kohli @guneetsk99
465 Followers 3K Following AI Engineer @ GreyOrange, Building Indian LLMs with Odia GenAI Independent Researcher working on variety of random problems.Muhammad Abdullah @Abdullah_kwl
42 Followers 501 Following Life is better when you're laughing...... "your time is limited,So don't waste it living someone else's life❤wassim ouanaou @wassimM_
64 Followers 95 FollowingAmine Saber @saberamine000
125 Followers 548 Following 🅴︎🅽︎🅶︎🅸︎🅽︎🅴︎🅴︎🆁︎🅸︎🅽︎🅶︎ 🆂︎🆃︎🆄︎🅳︎🅴︎🅽︎🆃︎ (𝗠𝗮𝗸𝗲 𝗶𝘁 𝗲𝗮𝘀𝘆 --» 𝗿𝗲𝗰𝗲𝗶𝘃𝗲 𝗶𝘁 𝗲𝗮𝘀𝘆 .)AK @_akhaliq
311K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxAndrej Karpathy @karpathy
981K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueHugging Face @huggingface
346K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateclem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersYann LeCun @ylecun
713K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.François Chollet @fchollet
470K Followers 770 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.Bojan Tunguz @tunguz
187K Followers 8K Following Machine Learning ex Nvidia. Kaggle Quadruple Grandmaster. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. e/xgb. XGBoost.eth. AMDG.merve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papersOmar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Mark Tenenholtz @marktenenholtz
115K Followers 547 Following Head of AI @PredeloHQ. XGBoost peddler, transformer purveyor.elvis @omarsar0
189K Followers 486 Following Building with LLMs @dair_ai • Prev: Meta AI, Galactica LLM, PapersWithCode, Elastic, PhD • Creator of the Prompting Guide (~4M learners)Nate Raw @_nateraw
7K Followers 1K Following machine learning hacker. previously @huggingface @lightningaiabhishek @abhi1thakur
81K Followers 664 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarLewis Tunstall @_lewtun
9K Followers 425 Following 🤗 LLM engineering & research @huggingface 📖 Co-author of "NLP with Transformers" book 💥 Ex-particle physicist 🤘 Occasional guitarist 🇦🇺 in 🇨🇭Jim Fan @DrJimFan
230K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Sebastian Raschka @rasbt
267K Followers 885 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Rafi Ayub @theayubinator
49 Followers 57 Following LLM fine-tuning at @MetaAI and @PyTorch. Formerly Microsoft and Stanford.Pliny the Prompter �.. @elder_plinius
13K Followers 1K Following latent space liberator, breaker of markov chains, 1337 ai red teamer, white hat, architect-healer, cogsci 🐻Elena Yunusov @communicable
6K Followers 7K Following Executive Director, Human Feedback Foundation w Linux AI & Data | Per Aspera Ad Astra | Here to build a future that I’m fromLucie-Aimée Kaffee @frimelle
1K Followers 2K Following Computer Scientist, PhD. Applied Policy Researcher @huggingface 🤗 ML & Society; Wikipedia & languages are my ♡Matthew Johnson @SingularMattrix
12K Followers 3K Following Researcher at Google Brain. I work on JAX (https://t.co/UGa5tGfinF).Inflection AI @inflectionAI
49K Followers 3 Following We are an AI studio creating a personal AI for everyone. Our first is @pi, a supportive and empathetic conversational AI.Yuval Abraham יוב�.. @yuval_abraham
47K Followers 689 Following עיתונאי בשיחה מקומית, לרוב בגדה Journalist, 972 MagazineMarc Sun @_marcsun
870 Followers 274 Following Machine Learning Engineer @huggingface Open Source teamAI Brews @AIBrews
1K Followers 111 Following Weekly curated AI newsletter for effortless updates, without the information overload!Ben Brooks @opensauceAI
833 Followers 159 Following Public policy @StabilityAI – making foundational AI technology accessible to all. Ex-GoogleX, Uber, Coinbase. Views my ownMetaGPT @MetaGPT_
4K Followers 118 Following The Multi-Agent Framework Github: https://t.co/nJEEwdSWBy Discord: https://t.co/CQDk1XH0bz Doc Site: https://t.co/UyljGj3Imw@levelsio @levelsio
419K Followers 1K Following 💆https://t.co/AoNP9BW2Dp $3K/m ✨https://t.co/BmbkrX4Zyf $0K/m 📸https://t.co/lAyoqmSBRX $58K/m 🏡https://t.co/1oqUgfD6CZ $45K/m 🌍https://t.co/BjTozWAXwG $28K/m 🛰https://t.co/ZHSvI2wjyW $42K/m 👕https://t.co/w98s8lFJiK $9K/mSakana AI @SakanaAILabs
19K Followers 0 Following We are a Tokyo-based R&D company on a quest to create a new kind of foundational AI model based on nature-inspired intelligence. https://t.co/LonvHEtlJRWang xidong @Wangxidong2
22 Followers 111 Following PHD@Chinese University of Hong Kong, Shenzhen. To improve (Medical) LLMs’ interpretability and interactivityMihir Patel @mvpatel2000
3K Followers 385 Following Research Engineer @MosaicML | cs, math bs/ms @StanfordSubbarao Kambhampati .. @rao2z
16K Followers 29 Following AI researcher & teacher @SCAI_ASU. Works on Human-Aware AI. Former President of @RealAAAI; Chair of @AAAS Sec T. Here to tweach #AI. YouTube Ch: https://t.co/4beUPOmMW6Szymon Tworkowski @s_tworkowski
5K Followers 503 Following minimizing perplexity @xAI | prev. @GoogleAI @UniWarszawski | LongLLaMA | long-context LLMs and math reasoning | scaling maximalistIgor Babuschkin @ibab
44K Followers 685 Following Maybe the real AGI was the friends we made along the way. @xAILM Studio @LMStudioAI
16K Followers 188 Following Download & run local/open LLMs on your computer 👾 App: https://t.co/YS5uiRQ7TI (Mac/Windows/Linux)Santosh Bhavani @santosh_bhavani
193 Followers 577 Following AI/ML @NVIDIA // prev @AWSCloud @SemanticMD // @CarnegieMellon csPlayground @playground_ai
16K Followers 1 Following A powerful AI image editor to create graphics like a pro without being one. Discord: https://t.co/D2tvrsvFWuYuandong Tian @tydsh
16K Followers 808 Following Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.Guillaume Verdon @GillVerd
53K Followers 3K Following Founder & CEO @Extropic_AI • prev: Physics & AI R&D @ (Alphabet X / Google) • Founder @ TensorFlow Quantum • (PhD(ABD) + MMath) @ (IQC / UWaterloo / PI) • e/accLawrence Chan @justanotherlaw
946 Followers 149 Following I do AI Alignment Research. Currently at the Alignment Research Center, and on leave from my PhD at UC Berkeley’s @CHAI_berkeley.Tolga Bilge @TolgaBilge_
2K Followers 570 Following Leader of https://t.co/IiJWWuiWhD & https://t.co/Ep1VpsILyj — @Superforecaster, @SamotsvetyF, @Swift_Centre, @INFERpub — Mathematics graduate of @UiB & @UnivofStAndrewsFerdinand Mom @FerdinandMom
131 Followers 558 Following Large scale training @HuggingFace. Average CPU & CUDA optimization enjoyer ~Maxime Labonne @maximelabonne
12K Followers 437 Following Author of Hands-On Graph Neural Networks https://t.co/Q8victWUmR • Machine Learning ScientistAmir Efrati @amir
36K Followers 998 Following Executive Editor @theinformation. We're hiring. amir @ https://t.co/XVwTW5cS62 DM for SignalStephanie Palazzolo @steph_palazzolo
8K Followers 3K Following Writing AI Agenda @theinformation, texan, & horror movie aficionado // reach me at [email protected] or on Signal at 979-599-8091Yishan @yishan
79K Followers 356 Following I run Terraformation, and I was once the CEO of Reddit. Both are very interesting challenges. Views are mine alone, but also yours if I do my job right.Sang Michael Xie @sangmichaelxie
3K Followers 709 Following PhD student @StanfordAILab @StanfordNLP @Stanford advised by Percy Liang and Tengyu Ma. Prev: visiting @GoogleAI Brain, BS, MS Stanford ‘17djalal #unchained @enlamp
3K Followers 3K Following Writer, Speaker, Cloud Engineer, Survivor of Dev-Oops, Dockerfails and Kubernet-ish — #NoCode is the best code, #NoOps is the best opsLinoy Tsaban🎗️ @linoy_tsaban
2K Followers 894 Following Exploring the world of AI Art as a ML engineer @HuggingFace 🤗 | ✡️ & 🇮🇱 #BringThemHome 🎗️Alex Reibman 🖇️ @AlexReibman
24K Followers 799 Following Accelerating @agentopsai @foomvc Agents, ML, math, and data viz. Hack reporter🕶️XR-5 🐀 @xariusrke
834 Followers 248 Following High school dropout. Scaling neural networks to massive scale @huggingface. DMs open.Trevor Gale @Tgale96
1K Followers 250 Following Research Scientist @ Google DeepMind | PhD Candidate @ Stanford CSVincent Abbott | Deep.. @vtabbott_
3K Followers 200 Following Maker of *those* diagrams for deep learning algorithms | 🇦🇺 | https://t.co/kE2BIGCiMYMustafa Jarrar @mjarrar
1K Followers 913 Following Professor of #NLProc | #Ontology | #KnowledgeGraphs. Birzeit UniversityOmar Khattab @lateinteraction
11K Followers 2K Following CS PhD candidate @StanfordNLP. 2022 Apple Scholar in AI/ML. Author of ColBERT (https://t.co/2ZtgXoa1np), DSPy (https://t.co/BH7WmMKDXR), & various retrieval & LM systems.Quentin Anthony @QuentinAnthon15
997 Followers 129 Following I make models more efficient. Google Scholar: https://t.co/kzVsAKPdrpOmar Kamali @OmarKamali
507 Followers 158 Following Founder. https://t.co/V8i9X2e2fe: Get data, alerts, run automations from any website without code. @SawalniBot: the first Moroccan-speaking AI.Happy to say that @huggingface accelerate has hit 100 MILLION downloads today! It's been so much fun enabling so many users to have their code just run on any system with as minimal friction as possible. Here's to 200M 🚀🚀🚀
meanwhile I’m still on free @GoogleColab
First @nvidia DGX H200 in the world, hand-delivered to OpenAI and dedicated by Jensen "to advance AI, computing, and humanity":
With @Nouamanetazi mentoring teams this morning (cc @mervenoyann @julien_c @ClementDelangue) 🤗
The GPT4 of datasets took down Hugging Face, sorry all 😅😅😅
This take on the FineWeb release is one of the most interesting feedback and also a reason FineWeb is very different from even larger datasets like RedPajama-V2 (which is double its size!) Surprisingly, the size of the dataset of 15T tokens is not very important, what is much…
People seem to over-index on the 15T number after Llama 3. While the number matters, what is even more important is the quality and diversity of those tokens. If there was a good way to measure those, that would have been an impressive result to report.
The craziest LLaMA 3 reveal: The 400B+ version of the model is **on par with Claude 3 Opus**, and it's still training. Soon, we'll have a better-than-Opus, fully open-source model. The implications are huge.
Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3…
People need to get Open ML is not a zero-sum game. It's about collaboration, transparency & growth 🤗 📈50k Llama-based repos on HF, 100k weekly users 🚀Mistral, Cohere et al growing the ecosystem
🚨🇮🇷🇮🇱 IRAN WARNS ISRAEL AT UNITED NATIONS: "If Israel wants to continue its evil operations, it will receive a response dozens of times stronger. We will not hesitate to defend ourselves and reveal a small part of our deterrent power. We targeted Israeli military sites in the…
THE REVENGE OF PYTORCH just kidding :) @cHHillee (from PyTorch team) was kindly able to help improve the PyTorch baseline, done by 1) upgrading to nightly, 2) using the "compound" F.sdpa (scaled dot product attention) layer directly, and turning on a torch compile flag:…
France has an AI scene vraiment magique Merci bien à @roxannevarza for making it so 🙌 With some of the very best @sarahookr @NandoDF @mervenoyann @LoubnaBenAllal1 @osanseviero @ylecun
Highly amusing update, ~18 hours later: llm.c is now down to 26.2ms/iteration, exactly matching PyTorch (tf32 forward pass). We discovered a bug where we incorrectly called cuBLAS in fp32 mathmode 🤦♂️. And ademeure contributed a more optimized softmax kernel for very long rows…
A few new CUDA hacker friends joined the effort and now llm.c is only 2X slower than PyTorch (fp32, forward pass) compared to 4 days ago, when it was at 4.2X slower 📈 The biggest improvements were: - turn on TF32 (NVIDIA TensorFLoat-32) instead of FP32 for matmuls. This is a…
Excited to share that I joined @huggingface 🤗 contributing my experience and research on AI, communities, open source in the AI policies space as Applied Policy Researcher in the ML & Society team. Democratize all the AI! Watch this space, in the meantime, enjoy my cross-stitch
@lvwerra @ServiceNow 💯 ! A big thanks to @NicolasChapados for understanding the value of open-science for businesses like ServiceNow
We have decided to update text-generation-inference (TGI)'s license. We switch the license from HFOIL (our custom license) back to Apache 2, hence making the library fully open-source. Read below for why we are making this change 👀
Why would any company release a strong LLM for free? This is @ServiceNow (mkt cap ~$160Bn) stock price since the release of StarCoder. Only required a small amount of compute and a handful of people while building up a lot of valuable know-how fast. It's not a zero-sum game!
This week was one of the most fun ever 😍 - Met @jefrankle (DBRX), @sophiamyang (Mistral) and @ylecun (🐐) in person - Met super interesting people from the Llama team (@misovalko @ThomasScialom) and collaborators (@christiankeller, Code Llama team, and more) - Saw @sarahookr…