Niels Rogge @NielsRogge
ML Engineer @ML6team, part-time at @huggingface. @KU_Leuven grad. General interest in machine learning, deep learning. Making AI more accessible for everyone! nielsrogge.github.io Belgium Joined April 2010-
Tweets2K
-
Followers10K
-
Following690
-
Likes1K
In case you're wondering who the goat is who implemented Whisper, Mistral, Mixtral, Llama-2 and 3 in Transformers, this is him!👇 CEOs would call him a 10x engineer
In case you're wondering who the goat is who implemented Whisper, Mistral, Mixtral, Llama-2 and 3 in Transformers, this is him!👇 CEOs would call him a 10x engineer
Today we really appreciated @narsilou and @huggingface for the hf_transfer package. Transferring xlarge model checkpoints couldn’t be easier! 🚀🔥 github.com/huggingface/hf…
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
One question I have here for the @lmsysorg team - are ELO ratings only based on the blind tests or also on the "side-by-side" tab? As otherwise people could boost Llama-3 results
One question I have here for the @lmsysorg team - are ELO ratings only based on the blind tests or also on the "side-by-side" tab? As otherwise people could boost Llama-3 results
If you weren't aware, LLaMa-3 is on @huggingface from day 1! Thanks @finkd huggingface.co/blog/llama3
Super interesting paper about 3D awareness of various vision models. All available on @huggingface: DINOv2: huggingface.co/collections/fa… SigLIP: huggingface.co/collections/go… MiDaS: huggingface.co/collections/In… SAM: huggingface.co/facebook/sam-v… MAE: huggingface.co/facebook/vit-m… etc.
Super interesting paper about 3D awareness of various vision models. All available on @huggingface: DINOv2: huggingface.co/collections/fa… SigLIP: huggingface.co/collections/go… MiDaS: huggingface.co/collections/In… SAM: huggingface.co/facebook/sam-v… MAE: huggingface.co/facebook/vit-m… etc.
Mistral dropped their official 8x22B checkpoints on the hub, including a new instruction tuned one! huggingface.co/mistralai/Mixt…
The @huggingface M4 (multimodal) team released a new model, Idefics-2, and look at those numbers! With only 64 image tokens, the model outperforms LLava-NeXT-13B which uses 2880 image tokens Clever techniques include NaViT and Perceiver resampling Blog: huggingface.co/blog/idefics2
The @huggingface M4 (multimodal) team released a new model, Idefics-2, and look at those numbers! With only 64 image tokens, the model outperforms LLava-NeXT-13B which uses 2880 image tokens Clever techniques include NaViT and Perceiver resampling Blog: huggingface.co/blog/idefics2
Introducing Idefics 2 🤯 An 8B Vision-Language Model - literally punching above its weight. > Apache 2.0 licensed! 🔥 > Competitive with 30B models like MM1-Chat > 12 point increase in VQAv2, 30 point increase in TextVQA (compared to Idefics 1) > 10x fewer parameters than…
Pretty crazy how little effort it takes to push a custom dataset to @huggingface: huggingface.co/datasets/niels… This is the recent dataset that @xai shared for their Grok-1.5 vision preview: x.ai/blog/grok-1.5v
I miss the days of 140 characters Twitter, without influencers putting their text in bold
Weights on @huggingface or it didn't happen Mr. Musk 😎
Weights on @huggingface or it didn't happen Mr. Musk 😎
Anybody can now train a multimodal model on their own dataset in just a few lines of code with TRL 🚀! The SFTTrainer now has support for vision LLMs like LLaVa, so you can fine-tune your models to both see and follow your instructions 👀 TRL: github.com/huggingface/trl Full…
We've made it easier to fine-tune VLMs like LLaVa, check out the tutorial below for more info! The `SFTTrainer` class of TRL now includes experimental support for fine-tuning vision-language models on custom data :)
We've made it easier to fine-tune VLMs like LLaVa, check out the tutorial below for more info! The `SFTTrainer` class of TRL now includes experimental support for fine-tuning vision-language models on custom data :)
AK @_akhaliq
309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxMark Tenenholtz @marktenenholtz
114K Followers 544 Following Head of AI @PredeloHQ. XGBoost peddler, transformer purveyor.Hugging Face @huggingface
343K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhatemerve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papers sometimes. RTs != endorsementsOmar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.abhishek @abhi1thakur
81K Followers 662 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarJeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordNate Raw @_nateraw
7K Followers 1K Following machine learning hacker. previously @huggingface @lightningaiLior⚡ @AlphaSignalAI
84K Followers 895 Following Covering the latest in AI R&D • ML Engineer • Ex-Mila researcher • MIT Lecturer • Building AlphaSignal, a technical newsletter read by 180,000+ ML experts.Lewis Tunstall @_lewtun
9K Followers 425 Following 🤗 LLM engineering & research @huggingface 📖 Co-author of "NLP with Transformers" book 💥 Ex-particle physicist 🤘 Occasional guitarist 🇦🇺 in 🇨🇭AI Pub @ai__pub
72K Followers 343 Following AI papers and AI research explained, for technical people. Get hired by the best AI companies: https://t.co/MySVjUGOQ3Jean de Nyandwi @Jeande_d
38K Followers 771 Following Deep Learning, Vision 🤍 Language, Multimodal LLMs • AI Education • CMU Research blog: https://t.co/1BEFLZAqe7 ML Pack: https://t.co/7PkTyDvuriThomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceRoss Wightman @wightmanr
18K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.TuringPost @TheTuringPost
62K Followers 16K Following Newsletter exploring AI & ML - Weekly trends - LLM/FM insights - Unicorn spotlights - Global dynamics - History Led by @kseniase_ Elevate your AI game 👇🏼Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Nils Reimers @Nils_Reimers
10K Followers 434 Following Director of Machine Learning @Cohere | ex-huggingface | Creator of SBERT (https://t.co/MKKOMfuQ4C)Fiana | A Learner @fianfitr
174 Followers 5K Following @F_Nurfitriana | Psy. | Always go with the choice that scares you the most, because that's the one that is going to help you grow. | Bismillah CHRO.Khiem Vinh Tran @vinhkhiem
16 Followers 146 Following NLP Enthusiast. My Google Scholar: https://t.co/GwQ5ZUTphW…abhi pandey @vylericD3vil
2 Followers 104 FollowingMaster @MasterXing88
86 Followers 858 Followingchristopher pinier @chris_pinier
15 Followers 1K FollowingNikita @nikitavoloboev
4K Followers 6K Following Make @LearnAnything_ Learn in public: https://t.co/GbFvuErkYn macOS course: https://t.co/JdbJWru6zG https://t.co/94R8ER7K2h https://t.co/ROkqhyhpEKFrаnçois @fpaupier
149 Followers 763 Following Engineering, data, ML - sharing what I learn along the wayTimo Denk @TimoDenk
89 Followers 445 Following Software, Music, Machine Learning, Philosophy, Sports, Science, AviationVincent Christlein @v_christlein
464 Followers 499 Following Researcher at pattern recognition lab, @FAU_Germany, interests: #MachineLearning, #PatternRecognition, #DeepLearning, #DocumentAnalysis, #ComputationaHumanitiesADITYA KABRA @adityakabra
286 Followers 1K Following I love science, entrepreneurship, and building things. This is my playground to run little experiments and share my ideas, projects, and learnings.Diego Prayudha @nerooo_11
0 Followers 27 Followingmiro @mirofurtado
75 Followers 583 Following engineering with gradients | ml @linkedin, prev @harvard安餒啊 @qiu48939
2 Followers 19 FollowingMel Mashiku @mgmashiku
124 Followers 1K FollowingAakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeVijaylaxmi Lendale @VJLaxmiLendale
29 Followers 243 FollowingMichal Lečbych @alka3tras123
1 Followers 99 FollowingAlain @Alain53078872
201 Followers 1K FollowingCarles Illa @carles_illa
83 Followers 180 FollowingAbdullah Maraş @AbdullahMa70560
0 Followers 9 FollowingAmgad Hasan @AmgadGamalHasan
236 Followers 293 Following A machine learning engineer specializing in LLMs and ASR modelsLea_liu @Lealiu32431828
8 Followers 168 FollowingAbhijit.eth(e/acc) @geekyabhijit
2K Followers 4K Following Integrated Circuit Design Engineer,Blogger, YouTuber. Astrophysics, Web3, Stocks and AI enthusiast, Geeky, In search of answer to life universe and EverythingBuchla Savage Sample .. @BuchlaSavage
182 Followers 1K FollowingMica Teo @micateo94
22 Followers 111 FollowingEljan Mahammadli @eljanmahammadli
79 Followers 336 Following ML Engineer @polygraf_ai | MSc Computer Science @gwuengineering | training Deep Neural NetsPRANAVI BAJJURI @PranaviBajjuri
18 Followers 149 Followingabderrahim zine @abderrahimzine6
25 Followers 593 FollowingP.M @p_misirov
2K Followers 535 Following InfoSec, Web3 Dev & UX Research. ex-ForEx trader. Interdisciplinary script kiddie & polyglot 🇪🇸, 🇺🇲, 🇷🇺, 🇫🇷, 🇳🇱 Building @SpearbitDAO @cantinaxyzTasour @TasourR
37 Followers 334 Following Error code: 0xF2024 (Lost in the virtual world). Backup failed. All data lost.Yulin Wang @YulinWang272829
9 Followers 28 FollowingTimothy Fowler @teslafsd69
113 Followers 545 FollowingInvestronauts @Investronautss
197 Followers 1K Following Not a financial advice 🚀✨ Tesla - AI - Nvidia - Tech - Spacex - the latest in companies, trends, and investment #Finance #Investing #Tech #SpaceApoorv Reddy @ApoorvReddy3
0 Followers 78 FollowingMuizz @muizzkhan77
31 Followers 1K FollowingPranav Silimkhan @PranavSilimkhan
12 Followers 93 FollowingEvans @nnnnninteen
205 Followers 1K Following 'Wanderer, your footsteps are the road, and nothing more, wanderer there is no road, the road is made by walking...' Antonio MachadoSoumyadip @soumyadip_mal
151 Followers 2K Following Faye Valentine simp. ML/Music/Movies and everything in between.Raul Campos Nasciment.. @raulprogru
71 Followers 367 Following Entusiasta de tecnologias emancipadoras, a vida é curta para não mergulhar em tudo que importa.西树 @xishudev
41 Followers 458 FollowingAK @_akhaliq
309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxHugging Face @huggingface
343K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateOmar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechniqueabhishek @abhi1thakur
81K Followers 662 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub Starclem 🤗 @ClementDelangue
90K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersAI at Meta @AIatMeta
531K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Lucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Nate Raw @_nateraw
7K Followers 1K Following machine learning hacker. previously @huggingface @lightningaiLewis Tunstall @_lewtun
9K Followers 425 Following 🤗 LLM engineering & research @huggingface 📖 Co-author of "NLP with Transformers" book 💥 Ex-particle physicist 🤘 Occasional guitarist 🇦🇺 in 🇨🇭AI Pub @ai__pub
72K Followers 343 Following AI papers and AI research explained, for technical people. Get hired by the best AI companies: https://t.co/MySVjUGOQ3Thomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceRoss Wightman @wightmanr
18K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Sanyam Bhutani @bhutanisanyam1
35K Followers 994 Following 👨💻 Sr Data Scientist @h2oai | Previously: @weights_biases 🎙 Podcast Host @ctdsshow 👨🎓 International Fellow @fastdotai 🎲 Grandmaster @Kaggle(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingGoogle AI @GoogleAI
2.2M Followers 23 Following Google AI is focused on bringing the benefits of AI to everyone. In conducting and applying our research, we advance the state-of-the-art in many domains.Pablo Montalvo @m_olbap
484 Followers 316 Following ML Engineer @HuggingFace. Previously ML R&D @ Rakuten. Computer vision and NLP mixer, ex-physicist. Dice thrower, dreamer, learner. He/him. Usually friendly :)Tesla Optimus @Tesla_Optimus
189K Followers 11 Following A general purpose, bi-pedal, humanoid robot capable of performing tasks that are unsafe, repetitive or boring.Sanchit Gandhi @sanchitgandhi99
4K Followers 37 Following Open-source speech @huggingface 🤗. Previously Masters' at @Cambridge_Uni.Perplexity @perplexity_ai
132K Followers 28 Following Our mission is to serve the world’s curiosity. https://t.co/BBZ1kG0TVGLoubna Ben Allal @LoubnaBenAllal1
4K Followers 622 Following ML Engineer @huggingface 🤗 | @ENS_ParisSaclay - MVAJiarui Xu @Jerry_XU_Jiarui
789 Followers 433 Following Fourth-year Ph.D in UC San Diego Undergrad. from HKUSTOla Piktus @olapiktus
1K Followers 395 Followingniki parmar @nikiparmar09
10K Followers 775 FollowingJonathan Ho @hojonathanho
4K Followers 151 Followingapolinario (multimoda.. @multimodalart
10K Followers 376 Following ML for Art and Creativity, working @HuggingFace ([email protected])OpenMMLab @OpenMMLab
6K Followers 127 Following From MMDetection to AI Exploration. Empowering AI research and development with OpenMMLab. Discord:https://t.co/BWaz5KtF5eJoão Gante @joao_gante
2K Followers 545 Following ML @huggingface 🤗, making text generation users happy. PhD from @istecnico 🇵🇹Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pDieuwke Hupkes @_dieuwke_
2K Followers 238 FollowingDrew Jaegle @drew_jaegle
1K Followers 642 Following AI, music. Staff research scientist @GoogleDeepMind.🦿Sofie Van Landeghem @OxyKodit
2K Followers 541 Following Software engineer with a passion for data & NLP. Open-source maintainer (spaCy, Typer). Project-based consulting through my company https://t.co/gdBBtGjV2IAlbert Villanova @avillanovamoral
2K Followers 5K Following ML Engineer @huggingface. Data Scientist, PhD Theoretical Particle Physics, BSc Computer Science. Always learning. he/himStas Bekman @StasBekman
7K Followers 268 Following Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at @ContextualAI Training LLM/RAG/Generative AI/Machine Learning/ScalabilityStella Biderman @BlancheMinerva
15K Followers 748 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/herAnton Lozhkov @anton_lozhkov
2K Followers 283 Following Open-sourcing Language Models @huggingface ✨Lilian Weng @lilianweng
94K Followers 148 Following Working on AI safety, past on robotics, applied research @OpenAI; Writing ML blogs to help myself & others to learn; Ideas my own.Saulnier Lucile @LucileSaulnier
4K Followers 432 Following AI Specialist @ Mistral AI | Former ML @ Hugging Face | ENS Paris-Saclay (MVA) | Centrale ParisNils Reimers @Nils_Reimers
10K Followers 434 Following Director of Machine Learning @Cohere | ex-huggingface | Creator of SBERT (https://t.co/MKKOMfuQ4C)Simon Brandeis @SimonBrandeis
745 Followers 284 Following Software Engineer @HuggingFace - opinions my ownMatthew Carrigan @carrigmat
3K Followers 352 Following @huggingface engineer. I'm the reason your LLM frontend has a jinja2cpp dependency. Sometimes yells about housing and trans rights instead of working He/himSuzana Ilić @suzatweet
22K Followers 2K Following Minds and Machines 🧠 Principal Product Manager, RAI | AI content safety for LLMs / Azure OpenAI @Microsoft Prev: @huggingface @CausalyAI Leading: @__MLT__Zalando Technology @ZalandoTech
14K Followers 986 Following We're the technology team at Zalando, Europe’s leading multi-brand fashion and lifestyle destination. For company updates, check out @Zalando_Press 👈Kashif Rasul @krasul
2K Followers 311 Following Research Scientist working on Deep Learning, Time Series Forecasting, Reinforcement Learning and HPC.Vasudev Gupta @thevasudevgupta
362 Followers 589 Following trying to learn what AI learns | getting stuff done @unboxai_ | its all about investingIkuya Yamada @ikuyamada
2K Followers 493 Following Chief scientist @StudioOusia working on NLP. Visiting scientist @RIKEN_AIP. Tweets in English & 日本語. LUKE, Wikipedia2Vec. Books: 大規模言語モデル入門, ディープラーニングによる自然言語処理.Neuralink @neuralink
1.4M Followers 1 Following Creating a general-purpose, high-bandwidth interface to the brainJustin Johnson @jcjohnss
17K Followers 552 Following Assistant Professor @UMich CSE; Previously Research Scientist @MetaAI; CS PhD @Stanford. Deep Learning + Computer Vision.Pieter Abbeel @pabbeel
78K Followers 435 Following Diffusion Models; Large World Model; UniSim; TRPO; SAC; Ring Attention; MAML; HER; Domain Randomization; Decision Transformer; LLM as Zero-Shot Planners; RFM-1Karel D’Oosterlinck @KarelDoostrlnck
2K Followers 593 Following Interpretable AI, RAG, Biomedical NLP. Intern @ContextualAI, PhD student @ugent, visitor @stanfordnlp. Instigator of hikes.Pieter Delobelle @pieterdelobelle
305 Followers 387 Following Postdoc researching fairer large language models and Dutch NLP at @KU_Leuven | ex @apple | PhD @KU_LeuvenYesterday I gave an overview of the LLM alignment landscape at the @zurichnlp meetup - thank you @AlekFicek and @FlorianCaesar for hosting me 🤗! Here's the slides from the talk: docs.google.com/presentation/d…
@NielsRogge Yes. I'm buying him a t-shirt that says "I'm him"
@levelsio I feel like it’s cause of that 45 TB dataset they uploaded yesterday 😭
15T tokens DataLoader, you're welcome
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
@NielsRogge blind tests only. vote will be invalid if identity leaked.
@NielsRogge @LangChainAI Good point! Let me add this to my todo list ;)
🤗🌐 Ready to explore Gemma models within the Hugging Face ecosystem? Join in on this demo on the open collaboration between Google's open models and @huggingface. → goo.gle/3vXbJZg
1 year diff of the Hugging Face Hub 🤗 (2023) -> (2024) Models: 220k -> 1.025 million (x4.6) Datasets: 50k -> 341k (x6.8) Spaces: 39k ->550k (x14) And people were telling me HF would die when ChatGPT came out 🤔Open ML is here to stay 🚀
Back with more Apache 2.0! We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Outperforms all open models with only 39B active parameters - Native function calling and 64K context Blog: mistral.ai/news/mixtral-8… HF base: huggingface.co/mistralai/Mixt… HF instruct:…
Time for the open-source AI robots revolution 🚀 We’ve been playing with a low-cost DJI robot controlled by 3 local open-source AI models (Whisper, Idefics2, Parler-TTS - all Apache2) & orchestrated by Dora-cs In comments a 250 lines code gist to build on top of it => enjoy!!
@NielsRogge Am I an influencer now 😱? x.com/_lewtun/status…
Anybody can now train a multimodal model on their own dataset in just a few lines of code with TRL 🚀! The SFTTrainer now has support for vision LLMs like LLaVa, so you can fine-tune your models to both see and follow your instructions 👀 TRL: github.com/huggingface/trl Full…
Alright, strap in. Support for Command-R+ was merged into llama.cpp exactly 4 hours ago. We're going to start talking to a GPT-4 level model on local hardware without a GPU. If you have 64GB of RAM, feel free to follow along 🧵
@lvwerra @ServiceNow Why do we allow Nicolas Cage to appear in any more films?! This is people drowned by falling into a pool. Only required were a few films with Nicolas Cage. It's not a zero-sum game, Nicolas Cage films can be deadly!
@NielsRogge You might find our work Striped Attention relevant and interesting (all credit goes to @exists_forall ). Its Ring Attention optimized for causal lm. arxiv.org/abs/2311.09431
This week was one of the most fun ever 😍 - Met @jefrankle (DBRX), @sophiamyang (Mistral) and @ylecun (🐐) in person - Met super interesting people from the Llama team (@misovalko @ThomasScialom) and collaborators (@christiankeller, Code Llama team, and more) - Saw @sarahookr…
These are the faces of people that open source 🤗 Amazing to be with @jefrankle from Databricks, @sarahookr from Cohere, @sophiamyang from Mistral, @dvilasuero from Argilla, and @_lewtun from Hugging Face
Principal Llama Engineer finally got to meet his boss Chief Llama Officer @osanseviero for the first time in person in Parisian Tigermilk bar and it was a blast! @huggingface @AIatMeta partnership and #llamalove 🦙💚
@NielsRogge @karpathy @3blue1brown 3b1b is awesome!