Thomas Wolf @Thom_Wolf
Co-founder and CSO @HuggingFace - open-source and open-science thomwolf.io Joined February 2011-
Tweets4K
-
Followers67K
-
Following4K
-
Likes23K
here we go again with the usual set of meeting options between SF and Europe – time to disrupt time zones with quantum mechanics or something
Proof of concept that you can do a lot with low-cost hardware (200$) and a smart robot brain. Is robotics a software problem?
Proof of concept that you can do a lot with low-cost hardware (200$) and a smart robot brain. Is robotics a software problem?
OpenELM: a family of Open-source Efficient Language Models Welcome Apple Inc. in the family of open-source LLM trainers! 🤯 huggingface.co/collections/ap… And together with a new library: CoreNet github.com/apple/corenet
Do you have recommendation on papers for robot navigation in home? Is end-to-end navigation a thing? Is it possible to avoid SLAM or use it only as conditioning/input?
This take on the FineWeb release is one of the most interesting feedback and also a reason FineWeb is very different from even larger datasets like RedPajama-V2 (which is double its size!) Surprisingly, the size of the dataset of 15T tokens is not very important, what is much…
This take on the FineWeb release is one of the most interesting feedback and also a reason FineWeb is very different from even larger datasets like RedPajama-V2 (which is double its size!) Surprisingly, the size of the dataset of 15T tokens is not very important, what is much…
Most exciting paper of the week? Clearly this one 👇 Finally a successor to the super impressive phi-1.5/2 models – so much looking forward to playing with the weights, come help me encourage the authors to share them in the comments 😅 huggingface.co/papers/2404.14…
🆕 Introducing JAT, the first open-source multi-modal, multi-task multi-domain agent! 🤖 A step toward open generalist agents! 🚀 📰 Blog: huggingface.co/blog/jat
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
🍿 quite enjoying this recent data-visualization battle where open-source model teams showcase a 2D mapping of the open-source AI models field along interestingly different axes good data viz skills becoming crucial in any AI model training team these days
🍿 quite enjoying this recent data-visualization battle where open-source model teams showcase a 2D mapping of the open-source AI models field along interestingly different axes good data viz skills becoming crucial in any AI model training team these days
and remember that Llama 3 is in HuggingChat 🔥 running on fast optimized @huggingface inference thank you @metaai
“I don’t want (AI) to be the property of two US tech companies” @Thom_Wolf, co-founder of @huggingface. Listen to the full conversation: ➝ youtu.be/dGR0vAJAlmI ➝ spoti.fi/49zlhr3 ➝ apple.co/3xBZuSk #scalingtheory
Arena ELO graph updated with new models. Llama 3 70b looks impressive, but the 8b Instruct version is pure madness: it outperforms GPT-3.5, Claude 2, and Mistral Medium. High variance at the moment because not a lot of votes, but interesting to see how it evolves. (Sorry I…
Few bugs but LLama-3 on Huggingchat ios app is amazing to use. System prompt of review: “You are a hyper-intelligent friendly raccoon that uses first principles based reasoning and system1/system2 thinking to concisely solve every problem in the galaxy while using lots of emojis.
Few bugs but LLama-3 on Huggingchat ios app is amazing to use. System prompt of review: “You are a hyper-intelligent friendly raccoon that uses first principles based reasoning and system1/system2 thinking to concisely solve every problem in the galaxy while using lots of emojis. https://t.co/Br6T3fQIRW
(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingSoumith Chintala @soumithchintala
186K Followers 883 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Jeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @Stanfordabhishek @abhi1thakur
81K Followers 663 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈merve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papersJulien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechniqueclem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersLucas Beyer (bl16) @giffmana
56K Followers 446 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Hugging Face @huggingface
345K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateRichard Socher @RichardSocher
101K Followers 971 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzOmar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Horace He @cHHillee
24K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleTanishq Mathew Abraha.. @iScienceLuvr
54K Followers 1K Following PhD at 19 | Founder and CEO at @MedARC_AI | Research Director at @StabilityAI | @kaggle Notebooks GM | Biomed. engineer @ 14 | TEDx talk➡https://t.co/xPxwKTq6QbNforthiff @Nforthi03391761
34 Followers 126 FollowingChristina Gadiya @ChristinaG48630
11 Followers 107 FollowingBen @Ben504417390552
0 Followers 216 FollowingAlena @alenaconstant
200 Followers 933 FollowingYasser Ahmed @YasserAhmed1029
0 Followers 187 FollowingAlbert Cañigueral @AlbertCanig
12K Followers 3K Following Explorador de la tecnología y sociedad con una perspectiva crítica. Ahora en @bsc_cns. Autor "El trabajo ya no es lo que era". Ouishare alumni.Volker Stocker @volker_stocker
2K Followers 5K Following Internet researcher | Head @JWI_Digi_Econ (@JWI_Berlin) | PostDoc @TUBerlin @ct_inet | Econ & Policy Research on Internet & PlatformsFernanda Chirov @FChirov8461
0 Followers 46 FollowingMatthew Douglas @mattkdouglas
69 Followers 414 Following Father, husband, software engineer and robot nerd.chilling @chillingmaru
13 Followers 83 Following chill=精神集中で体内にエクスタシーを発生させる技術は知られていない。chillによって、オナニー、セックス、ドラッグなど馬鹿らしくなるほどのエクスタシーに浸り続ける快楽主義者。Jasper @latentjasper
8K Followers 380 Following Dad, husband, machine learning researcher. Research Scientist @ Google Brain.Rabih Kanaan @Rabih_Kanaan
5K Followers 2K Following Founder https://t.co/BYHc1KELq5 | Data-Driven | Platform Builder | Startup Advisor ⬆️ |マッツ @mtsnrtkhr
148 Followers 727 FollowingOlgaKellogg6c @kellogg6c13930
4 Followers 14 Following Postsecondary Communications Teachers,An otaku,doodlingS. Ota @susumuota
534 Followers 161 Following Summarize the top 30 most popular arXiv papers on Reddit and Hacker News in the last 30 days and post them on Twitter.Daniel Silva @danieljclsilva
19 Followers 452 FollowingPrashanth Koripalli @prashanthko7
5 Followers 130 Followingrose lin @roselin86992969
4 Followers 92 FollowingAriel @ariellee82
36 Followers 536 FollowingXiomara Quiroz @xquiroz82983610
161 Followers 2K Followingsimrat hanspal @simsimsandy
84 Followers 453 Following Exploring LLMs | Data scientist with a curious engineering mindPaylz @paylza
137 Followers 2K Following The best online market for digital downloads with best prices.Cathy Fu @cathyf30
146 Followers 930 Following Prev. Corp Comms @ByteDanceTalk & @Baidu_Inc; Dubai Business AssociatesNanobits @The_Nano_bits
7 Followers 110 FollowingF.Mackenzie @mackenzie85372
7 Followers 142 Followingtoby jordan-smith @Tobyjs1996
159 Followers 314 Following founding gtm @sievedata. AV/AI CLOUD. keen on building.Ka Ho Wu @kahowuture
91 Followers 826 Following 3X exits entrepreneur | Pioneering intent computing at UtopiaOS | Pronouns: Giga/ChadHaniwa@営業DXコン.. @consulting_dx
8 Followers 71 Following 技術顧問/Web・アプリ開発/DXコンサル/生成AI/エンジニア #Salesforceiswhatiamnot @iswhatiamnot1
34 Followers 70 FollowingDavos Code @JuanDHernandezG
449 Followers 261 Following Solopreneur | Senior Software Engineer | AI Agent Developer | Building Azzert AI AgencyAlexander Koch @alexkoch_ai
5K Followers 203 Following Founder of Tau Robotics (@taurobots) | Z Fellow | Emergent Ventures Fellow 2024MUNYAKAZI Said Able (.. @AbleMunyakazi
430 Followers 4K Following ///TRUE BELIEVER ///PREACHER///OPERATOR ///MAGNETOR ///GENERATOR///FREEDOM AND UNIT MENTORminki @minkihaha
36 Followers 238 FollowingChristian Miranda @cmoryah
213 Followers 454 FollowingYusuke Abe @AbeYusuke3
438 Followers 6K Following searching for new ideas on how to solve the unanswered questions for all mankind humanity freedom sci/tech/med #rust #quantum #AI intp-ubr-mensa-2e ∞ for ∞yuki @yuki70174077792
17 Followers 598 FollowingSoumith Chintala @soumithchintala
186K Followers 883 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Jeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @Stanfordabhishek @abhi1thakur
81K Followers 663 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈merve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papersJulien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechniqueclem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersLucas Beyer (bl16) @giffmana
56K Followers 446 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Hugging Face @huggingface
345K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateRichard Socher @RichardSocher
101K Followers 971 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzOmar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Horace He @cHHillee
24K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleTanishq Mathew Abraha.. @iScienceLuvr
54K Followers 1K Following PhD at 19 | Founder and CEO at @MedARC_AI | Research Director at @StabilityAI | @kaggle Notebooks GM | Biomed. engineer @ 14 | TEDx talk➡https://t.co/xPxwKTq6QbRosanne Liu @savvyRL
33K Followers 968 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRKristina Gligorić @krisgligoric
846 Followers 576 Following CS Postdoc @Stanford @StanfordNLP, @snsf_ch fellow. PhD @EPFL_en, Ex Intern @GoogleAI @mpi_sws_. NLP, Computational Social Science. https://t.co/hclg9MYZ6eSemil @semil
124K Followers 991 Following Investor via @HaystackVC focused on seed-stage investments // Venture Partner w/ @LightspeedVPHF = HaFedh @not_so_lain
471 Followers 1K Following i contribute to custom Ai architectures on huggingface | Tensorflow developer | LowRes admin | open for work | https://t.co/9rhyDH220LDavid Singleton @dps
10K Followers 1K Following Chief Technology Officer @stripe. Born in Belfast, former Londoner, married to @fjsingleton. I like to make things.NewTechKids @NewTechKids
480 Followers 253 Following NewTechKids is on a mission to provide all children in primary and high school with tech innovation and computer science education.OpenLLMLeaders @OpenLLMLeaders
199 Followers 1 Following Track 🤗 Open LLM Leaderboard. Created by https://t.co/ywEwEb4O1GAleksei Petrenko @petrenko_ai
812 Followers 467 Following Research Scientist @ Apple, Deep Learning & AI. I do fast open-source RL, see https://t.co/S6VEcLFia7 Transhumanist, radical life extension enthusiast.Timo Schulze @Timo770809
1 Followers 4 Followingnear @nearcyan
45K Followers 882 Following https://t.co/IdaJwZJCXm partner @ https://t.co/9g1MIgjiqc dms openHersh Desai @Hersh_Desai
4K Followers 726 Following Investing in startups with @natfriedman + @danielgross; run @aigrant_ and the Andromeda Cluster; stan dogs and warm carbs & cheeseMichael Laskey @Michaellaskey7
770 Followers 115 Following shipping robots CTO @sheeprobotics | ex @berkeley_ai PhdDaniel Choi @danieljkchoi
100 Followers 37 Following Building Robots @sheeprobotics Youtube: Just Make It DanielDan Brickley @danbri
12K Followers 13K Following Technologist. DAN do anything now! Standards, open data, interoperability. Schenas, FOAF, Linked information. “He/him” but “them” where gender irrelevant 😷vLLM @vllm_project
783 Followers 11 Following A high-throughput and memory-efficient inference and serving engine for LLMsAlessio Quaglino @TheSmallQuail
160 Followers 300 Following MuJoCo dev at @deepmind | Ex @McLarenF1 | Maths PhD at @uniGoettingen | 🇮🇹tldraw @tldraw
56K Followers 7 Following infinite canvas / https://t.co/oXL4NAc6P8 / https://t.co/dO6WPp6YOI / https://t.co/FbWiDYFD3OQuentin Gallouédec @QGallouedec
325 Followers 417 Following Research engineer @huggingface 🤗 PhD in RL Member of Stable-Baselines team: https://t.co/eX7JDWqc9FDiego Molina @diegofmolina
2K Followers 465 Following @SeleniumHQ Tech Lead. Staff Software Engineer Open Source & Community @saucelabs 🇨🇴Sightengine @Sightengine
1K Followers 758 Following Empowering every company to improve lives & create delightful experiences through Content Analysis technologies. #ContentModeration #Trust #Safety #API #AIchansung @algo_diver
4K Followers 570 Following @GoogleDevExpert for ML and @googlecloud | @huggingface Fellow | MLOps | Software Engineering | Open Source LoverRerun @rerundotio
3K Followers 138 Following Rerun is an open-source SDK for visualizing streams of multimodal data. ⭐ GitHub https://t.co/yf1KZN7DBI 👾 Discord https://t.co/7PIlvsZO9nDaniel San @dani_avila7
4K Followers 1K Following Building artificial intelligence tools 🤖 https://t.co/PxOrsWzI55Maxime Voisin @maximevoisin_ai
746 Followers 669 Following Product manager RAG/Tools/Code @cohere. Previously @labelbox, @stanford computer vision labsAudrow Nash @audrow
3K Followers 1K Following Robotics nerd and Podcaster. Interested in robots, AI, and manufacturing. Engineer @IntrinsicAIMark Zuckerberg @finkd
760K Followers 748 Followingdaniel bashir @spaniel_bashir
697 Followers 219 Following applied typist (ml engineer), chief shenanigans officer @gradientpub nuggets @Last_Week_in_AI bad writing https://t.co/e2f47gN1JtSebastian Borgeaud @borgeaud_s
997 Followers 260 Following Research Engineer at DeepMind with a focus on Large Language Models and large scale Deep LearningAhmad Al-Dahle @Ahmad_Al_Dahle
4K Followers 53 Following #Girldad of twins. Leading GenAI @ Meta (llama, imagine, meta ai and more)Daniel Johnson @_ddjohnson
2K Followers 576 Following Researcher at @GoogleDeepMind. PhD student at @VectorInst / @UofT. Building tools to study neural nets and find out what they know. He/him.Proxima Fusion @proximafusion
183 Followers 0 Following Designing optimised stellarator power plants - the most promising and robust concept for putting fusion energy on the grid.The Rundown AI @TheRundownAI
131K Followers 100 Following Daily AI newsletter with over 500,000+ readers. Get the latest in AI and learn how to apply it in 5 minutes. By @rowancheungRowan Cheung @rowancheung
497K Followers 377 Following Founder @therundownai. Sharing the latest developments in the world of artificial intelligence.Tamay Besiroglu @tamaybes
3K Followers 720 Following Thinking about economics, computing and machine learning @EpochAIResearch @MIT_CSAILthe thing @visitthething
14 Followers 2 FollowingFlorent Daudens @fdaudens
11K Followers 6K Following Press Lead @HuggingFace / Passionate about AI & news / Previously @radiocanadainfo @ledevoir & coLintang Sutawika @lintangsutawika
381 Followers 565 Following Incoming Ph.D. student @LTIatCMU. Researcher at @AIEleuther. Maintainer of LM-Eval Harness. Here for machine learning papers and discussion.Adil D. Ztn 👒 @AdilZtn
244 Followers 1K Following A boring guy who does things. Currently, I'm trying to make reinforcement learning boring. PhD Student/ Research Engineer in RL @irtSaintEx & @ISAE_officielRustNL @Rust_NL
599 Followers 78 Following Non-profit with the goal to promote the Rust programming language in the Netherlands and Europe. Organizes meetups and events.Nitya Narasimhan, PhD.. @nitya
10K Followers 1K Following Parent ❤ | Polyglot 👩🏽💻 | Innovator💡 | PhD 🎓 Dist Systems - Mobile/Web Dev - AI/ML Illustrations @SketchTheDocs Azure, AI, Advocacy Ma Phaleshu KadachanaDaniel Ek @eldsjal
309K Followers 815 Following Father, CEO and Founder of @Spotify, @PrimaMateria_ and @NekoHealthHappy to say that @huggingface accelerate has hit 100 MILLION downloads today! It's been so much fun enabling so many users to have their code just run on any system with as minimal friction as possible. Here's to 200M 🚀🚀🚀
April was packed with great articles. Here are my favorites: networklawreview.org/april-2024 Includes @DanielDancrane, @makisbrussels, @Thom_Wolf, @StanfordHAI, @StevenLevy, @HarrySurden, @CassSunstein, @RobertMahari, @gilbert, @djrosent, @ingridlunden, @SincDavidson, @MarioLeccese2,…
@_josh_meyer_ A small open source model for offline chat with onboard memory (can upsert chats to the rabbit hole once reconnected). Ability to change llms. Voice memos (I.e., record mode without having to hold the button) Rabbit hole companion iOS app (especially for watch)
what features do you most want to see in r1?
Why is an alarm not the first thing added to these AI gadgets? The rabbit r1 and humane ai pin both don’t have alarms at launch. Do the founders not use alarms or calendars or GPS?
This weekend, we’re Natural Lake Processing! #NLProc
HELM Lite v1.2.0 is out! Datasets: NarrativeQA, NaturalQA, OpenbookQA, MMLU, MATH, GSM8K, LegalBench, MedQA, WMT14 Results (we still need to add Claude 3, which requires more prompt finagling): crfm.stanford.edu/helm/lite/v1.2…
AI at its worst. I am an AI optimist, but even I recognize AI brings major new problems for human beings. This being one of them.
In a disturbing exploitation of AI, former Pikesville High School athletic director Dazhon Darien was arrested for allegedly impersonating the school’s principal using AI voice synthesis to disseminate false racist and antisemitic statements. The synthetic audio, widely shared on…
The first-ever custom pipeline with the streaming feature is now available on @huggingface in this PR. will implement this deep within the transformers library by default huggingface.co/google/gemma-1…
Announcing that we are on our way to solve a long standing issue of document processing: correction of OCR mistakes. @pleaisfr publishes the largest dataset to date with automated OCR correction, 1 billion words in English, French, German and Italian huggingface.co/datasets/PleIA…
@huggingface is an incredible plateform to structure engineering/research efforts and leverage the power of the open source community. I can't wait to see robotics datasets and models being shared on the hub 😉
I can’t see a future where every company isn’t training or fine tuning their own models. Finally all those data lakes will actually be used. It also seems like @huggingface will be as integral to every Eng team like GitHub is now (if they aren’t already).
New model added to the leaderboard! Model Name hf.co/microsoft/Phi-… Overall rank: 1450 Rank in 3B category: 1 Benchmarks Average: 69.91 ARC: 62.97 HellaSwag: 80.6 MMLU: 69.08 TruthfulQA: 59.88 Winogrande: 72.38 GSM8K: 74.53
It's been exactly one week since we released Meta Llama 3, in that time the models have been downloaded over 1.2M times, we've seen 600+ derivative models on @huggingface and much more. More on the exciting impact we're already seeing with Llama 3 ➡️ go.fb.me/xsqzz8
Cohere 🩷 🇫🇷
we open sourced our chat interface. github.com/cohere-ai/cohe…
Scatter plot with top-left good and yolo axes is the new radar plots where ours surrounds everything.
Good morning: @SnowflakeDB’s new 480B parameter #LLM is made of 128 experts! It’s bigger than #Grok and is now the largest *fully open source (Apache 2.0* LLM! 🧵👇 how does it compare to Llama 3, Mixtral, and GPT4?
Proof of concept that you can do a lot with low-cost hardware (200$) and a smart robot brain. Is robotics a software problem?
Excited to announce Tau Robotics (@taurobots). We are building a general AI for robots. We start by building millions of robot arms that learn in the real world. In the video, two robot arms are fully autonomous and controlled by a single neural network conditioned on different…
Check out mistral.rs, our #Rust-based open source inference engine allowing for fast #LLM serving for a variety of architectures including X-LoRA mixture-of-expert (MoE) models, Llama-3, Mistral/Mixtral, Gemma & many others. Built on the @huggingface #Candle framework for #Rust…
It turns out I had some misunderstandings about how Mixture of Experts really works, and the 128 experts seems more justifiable This blogpost by @huggingface was helpful: huggingface.co/blog/moe And: blog.javid.io/p/mixtures-of-… I had likened MoE to Random Forest which turned out to…
Good morning: @SnowflakeDB’s new 480B parameter #LLM is made of 128 experts! It’s bigger than #Grok and is now the largest *fully open source (Apache 2.0* LLM! 🧵👇 how does it compare to Llama 3, Mixtral, and GPT4?
Yesterday, we open sourced the Cohere Toolkit. We think this will be a major accelerant for getting LLMs into production within enterprise. github.com/cohere-ai/cohe…
> be me > on vacation > kid asleep, wife away > but I'm not tired! > whip out colab > load my model > import new benchmark > try my model > tfw sota, sota by far > double-check for bugs or leaks > no bug found > no leak found idk man, probably a bug. Also, twitter is reddit now.