-
Tweets331
-
Followers125
-
Following108
-
Likes691
After two incredible years, it feels weird to announce that today is my last day at @gladia_io I can't express how rich those two years were for me, both technically and on the human aspect. Going from no product, trying to find the PMF, to having our first clients and finally…
magnet:?xt=urn:btih:9238b09245d0d8cd915be09927769d5f7584c1c9&dn=mixtral-8x22b&tr=udp%3A%2F%2Fopen.demonii.com%3A1337%2Fannounce&tr=http%3A%2F%https://t.co/OdtBUsbeV5%3A1337%2Fannounce
Just finished reading the ML Engineering book by @StasBekman, it's without a doubt the best resource I have come across so far regarding the training of LLMs. Highly recommend giving it a read: github.com/stas00/ml-engi…
One of the mega trends in this YC batch is the wave of consumer AI companies. Consumer was stuck in the doldrums for years, but AI has brought it back in a big way. Here are the 21 consumer AI companies that launched today. 🧵
⚡️AutoRound, new SOTA LLM low-bit quantization approach developed by Intel Neural Compressor team (github.com/intel/neural-c…) 🎯Lots of interesting comparison with GPTQ, AWQ, HQQ, etc. Check out the blog for more details: medium.com/intel-analytic… @huggingface #IAmIntel
[1/7] Happy to release 🥕QuaRot, a post-training quantization scheme that enables 4-bit inference of LLMs by removing the outlier features. With @akmohtashami_a @max_croci @DAlistarh @thoefler @jameshensman and others Paper: arxiv.org/abs/2404.00456 Code: github.com/spcl/QuaRot
As someone tried yet?
Exactly what I needed
One more time
A tweak in the architecture of #Transformers can significantly boost accuracy! With direct access to all previous blocks’ outputs, a 48-block #DenseFormer outperforms a 72-block Transformer, with faster inference! A work with @akmohtashami_a,@francoisfleuret, Martin Jaggi. 1/🧵
looking to save some Go of VRAM while using @huggingface 's Trainer? Just upgrade to unstable, you will thank me later: pip install git+github.com/huggingface/tr… It also solved some NaNs I experienced using while using fp16
Today we're excited to introduce Devin, the first AI software engineer. Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork. Devin is…
Another finding while comparing two training runs with @huggingface Trainer: one with compute_metrics, the other without. Interestingly, enabling this parameter generates spikes on VRAM usage, even before calling the metric function. Might be worth investigating
Gjoni Tapan @GjoniTapan78239
91 Followers 245 FollowingV1na @V1nab0y
0 Followers 197 FollowingBlaze (Balázs Galamb.. @gblazex
1K Followers 975 Following A Smooth Guy; Developer of SmoothScroll for macOS, Windows & Google Chrome.Franklin Tranié @franklintrn
76 Followers 100 Following 🎓 CS Student @EPFL 📍Lausanne 🤖 Sharing the latest in ML & Tech News 🌍Dhruv Malik @lesDecroissant
398 Followers 3K Following fullstack developer @ https://t.co/nKH7HyDDgQ / freelance Auditor/ researcher in #systemdesign, #web3 securityuuganbayar temuujin @ugshanyu
0 Followers 45 FollowingAtlasMLRunner @ozgurgulerx
373 Followers 692 Following Generative AI and Startups. Loves Mediterrenean and maritime history 🇹🇷 AtaturkAndy @anonymousLoco09
70 Followers 967 Followingjona @_jon_gg
98 Followers 629 FollowingFlorian Labaye @Flo_Labaye
23 Followers 43 Followingraphaelsrty @raphaelsrty
234 Followers 447 Following Language Models, Knowledge Bases, Knowledge Distillation PhD | Data Scientist @ ManoManoFrancois Kruta @fkruta
110 Followers 203 Following Passionate by technology and internet, world citizen Co-Founder & CEO UbuduNicolas Guyon - e/acc @nico16184
3K Followers 5K Following 🤖 Comptoir IA Podcast & Meetup mensuel ChatGPTThomas Chardonnens @thomas_chardo
148 Followers 627 Following Software Engineer doing research about embeddings | CS alumni @UCBerkeley and @ISEP | proud alumni of the @IHouseBerkeley familyBen @RunningTooLean
60 Followers 279 Following APM @ Google | Also building https://t.co/Oc4MP11m62 | Thoughts = my own | עם ישראל חיPierre Vannier (e/acc.. @pierre_vannier
3K Followers 4K Following Founder & CEO @flint_company, #futureOfWork. Technologist - #AI #genAI advocate, #podcast producer and guest.LeNSansChinoiseries @Mega_dz_NRV
104 Followers 607 FollowingMaxence @maxencejm
338 Followers 569 Following Helping ambitious students & graduates build the future • @join_ef 🇫🇷Anicet @AniC_dev
183 Followers 179 Following Teaching code to 13yrs old kids and stochastic parrots Detecting black holes around billions of suns @esa My follows reflect my curiosity not my approvalSandra Stewart @SandraStew46287
0 Followers 296 FollowingCherylAdam @z34vhXk93ukEU32
0 Followers 90 FollowingMichael Gackstatter @Michael__Ga
669 Followers 590 Following Building something new | Ex https://t.co/cQ9Bwmb9Yp | InsurTech & AI | Win and help win.Nikola Liverpool @LiverpoolN58891
94 Followers 5K FollowingHaihao Shen @HaihaoShen
3K Followers 3K Following Creator of Intel Neural Compressor/Speed/Coder, Intel Ext. for Transformers, AutoRound; HF Optimum-Intel Maintainer; Founding member of OPEA; Opinions my ownMolaryy @molaryy22
14 Followers 50 FollowingIdriss @idrisrupt
160 Followers 2K Following AI & Data Engineer 🤖 | Digital Marketer 🌐 | Web3 believer ⛓️. "Mom raised a man, So it shall be"ぬこぬこ @schroneko
2K Followers 2K Following 文章と生成物との互換性 | 岐阜高専 → 名大 B/M → 京大 D → 法人化 → LLM 無職 → 某 AI 企業 | 核融合は良いぞ!| Claude | Vim | Ollama | Any to Any | https://t.co/0NY8zrz7dLLucas Simões @_lsimoes
208 Followers 373 Following ML @pimloc || PhD CompNeuro/ML @GatsbyUCL || MSc Physics @usponline 🇬🇧 🇧🇷 🇻🇦Amelia @khairulneymar10
28 Followers 253 Following Every new friend is a new adventure the start of more memories # Looking for a new good friend ☺️🌼AI Deeply @AiDeeply
406 Followers 5K Following AI is reshaping the world. Who are the people and companies driving the change? Visit our website to search more than 5,000 profiles.. @kyhwksh4b
10 Followers 0 Followingcurator @jeromepatel_
174 Followers 745 Following MSc student at @UniHeidelberg, working in AI field helping to achieve AGI by small contributions (and achieve NIRVANA)Lucas D. Rbnt 🏴.. @LTenib
23 Followers 134 Following enjoy lame stuffs such as fortnite, Olympique de Marseille and deep learning. Interested in foundation models for medical applicationsCharlie GARAYT @charliegarayt
47 Followers 511 Following PhD Student at @Mines_Paris and @Geovariances Working and deep learning generative method applied to geological modelingStas Bekman @StasBekman
7K Followers 268 Following Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at @ContextualAI Training LLM/RAG/Generative AI/Machine Learning/ScalabilityTarek Khalil @cmdkhalilov
802 Followers 979 Following I help sales & support teams succeed using #WhatsApp 🚀📊📈 @rasayelioThibault Douzon, PhD @thibaultdouzon
131 Followers 389 Following Docteur Informatique / Machine Learning @LIRISLyon / @EskerFrance. GPU-poor. Mes tweets n'engagent que moi.Ferdinand Mom @FerdinandMom
129 Followers 559 Following Large scale training @HuggingFace. Average CPU & CUDA optimization enjoyer ~Regina Galeas @regina_gal27361
74 Followers 3K FollowingRokacha @Rokacha183360
7 Followers 1K Following With you, I don’t lack anything. No matter how wild my heart is, I know how to say no.Lady Falstaff @NewOne88315
268 Followers 2K Following Actress, artist, musician, dancer, courtesan and a fantastic gardenerEiso Kant @eisokant
7K Followers 1K Following Co-founder & CTO @poolsideai w/ @jasoncwarner “The best way to predict the future is to invent it.” - Alan Kay Prev: Athenian & source{d}Jared Friedman @snowmaker
25K Followers 521 Following founder, techno-optimist, college dropout, partner at @ycombinator.CarperAI @carperai
20 Followers 1 FollowingStas Bekman @StasBekman
7K Followers 268 Following Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at @ContextualAI Training LLM/RAG/Generative AI/Machine Learning/ScalabilityLucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Justine Tunney @JustineTunney
33K Followers 272 Following I built a C library that lets you compile 12kb static binaries that run natively on Linux, Mac, Windows, FreeBSD, OpenBSD, NetBSD and BIOS using just GCC/Clang.Adil D. Ztn 👒 @AdilZtn
245 Followers 1K Following A boring guy who does things. Currently, I'm trying to make reinforcement learning boring. PhD Student/ Research Engineer in RL @irtSaintEx & @ISAE_officielArthur Mensch @arthurmensch
40K Followers 872 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxZachary Nado @zacharynado
5K Followers 648 Following Research engineer @googlebrain. Past: software intern @SpaceX, ugrad researcher in @tserre lab @BrownUniversity. All opinions my own.kyutai @kyutai_labs
6K Followers 6 FollowingThomas Scialom @ThomasScialom
6K Followers 231 Following AGI Researcher @MetaAI -- Lead Llama 2 and Postraining Llama 3. Also CodeLlama, Galactica, Toolformer, Bloom, Nougat, GAIA, ..Mathieu Acher @acherm
2K Followers 5K Following Professor @INSA_Rennes Researcher @DiverSE_Inria Junior member #IUF @InstUnivFr #SciencesDuLogiciel, Software #Variability, Artificial Intelligence and #ChessMistral AI @MistralAI
90K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPTom Jobbins @TheBlokeAI
15K Followers 237 Following My Hugging Face repos: https://t.co/yh7J4DFGTc Discord server: https://t.co/5h6rGsGfBx Patreon: https://t.co/yfQwFggGtxNicolas Debock @ndebock
7K Followers 2K Following VC @eurazeo (ex @balderton / @xangevc / @laposte). Tweet about the #web #tech #startups #mathematics . Fan of Claude Shannon. #FOCSTim Dettmers @Tim_Dettmers
29K Followers 820 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.AI Breakfast @AiBreakfast
167K Followers 209 Following The latest rumors and developments in the world of artificial intelligence. DM to include your AI project in the newsletter.Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Krystal Hu @readkrystalhu
5K Followers 1K Following Covering VC, startups & tech investments for @Reuters. Made in China, trained by NYC, trying to vibe with Silicon Valley. Signal: krystalhu.01Lior⚡ @AlphaSignalAI
84K Followers 895 Following Covering the latest in AI R&D • ML Engineer • Ex-Mila researcher • MIT Lecturer • Building AlphaSignal, a technical newsletter read by 180,000+ ML experts.AI Pub @ai__pub
72K Followers 343 Following AI papers and AI research explained, for technical people. Get hired by the best AI companies: https://t.co/MySVjUGOQ3Amit Patel @redblobgames
19K Followers 8 Following I explain algorithms and math with interactive web pages (incl. pathfinding, hexagons, procgen maps, voronoi). Wrote Solar Realms Elite; helped w/@rotmg_newsShek Azizi @AziziShekoofeh
7K Followers 1K Following Staff Research Scientist @Google @GoogleDeepMind 🧠 Opinions are my own.CobolStone @CobolStone
40 Followers 0 Following The place for hackers and builders. Live soon. #web3 #security #ai #softwareJohn Schulman @johnschulman2
39K Followers 609 Following Cofounder @openai, lead post-training for ChatGPT and the API. Interested in reinforcement learning, alignment, birds, jazz musicJeff Rasley @jeffra45
671 Followers 926 Following @SnowflakeDB AI Research Team. @MSFTDeepSpeed co-founder, @BrownCSDept PhD, @uwcse alumLilian Weng @lilianweng
94K Followers 148 Following Working on AI safety, past on robotics, applied research @OpenAI; Writing ML blogs to help myself & others to learn; Ideas my own.Jeff Dean (@🏡) @JeffDean
296K Followers 6K Following Chief Scientist, Google DeepMind and Google Research. Co-designer/implementor of things like @TensorFlow, MapReduce, Bigtable, Spanner, Gemini .. (he/him)Yi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Andrew D. Huberman, P.. @hubermanlab
1.3M Followers 1K Following Professor of Neurobiology & Ophthalmology at Stanford Medicine • Host of the Huberman Lab podcast • Focused on science & health research & public educationHarrison Kinsley @Sentdex
71K Followers 200 Following Neural networks from Scratch book: https://t.co/MWlYbXicwc YouTube: https://t.co/5osPue5EW9 @skunkworks_aiOpenAI @OpenAI
3.4M Followers 0 Following OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6LgzPAIlya Sutskever @ilyasut
370K Followers 2 Following towards a plurality of humanity loving AGIs @openaiJean de La Rochebroch.. @2lr
38K Followers 320 Following People ^ Venture ^ Capital | With @Xavier75 | @kimaventures | @newwavevc | Jean @ https://t.co/YdynWVrQMH | email onlyRandall Balestriero @randall_balestr
3K Followers 228 Following AI Researcher: From theory to practice (and back) Postdoc @MetaAI with @ylecun PhD @RiceUniversity with @rbaraniuk Masters @ENS_Ulm @Paris_SorbonneNVIDIA AI Developer @NVIDIAAIDev
51K Followers 277 Following All things AI for developers from @NVIDIA. Additional developer channels: @NVIDIADeveloper, @NVIDIAHPCDev, and @NVIDIAGameDev.Michaël Benesty @pommedeterre33
3K Followers 612 Following Apply mathemagic to law understanding Head of R&D @LefebvreSarrut ex tax lawyer @Deloitte, CPA, financial audit, core dev @XGBoostProject50 hackers at the LLM Hack in Paris organised by @join_ef, @MistralAI and @huggingface Strong teams building innovative LLMs projects You can check them below 1/n 🧵
1/4 Have you wondered how to optimize sys-perf for training Arctic-like models (MoE arch)? Let’s dive in! Our first technique: custom fused kernels. By crafting these kernels, we streamline irregular and sparse operators, boosting efficiency. #SnowflakeArctic #SystemOptimization
60 hackers reunited in Paris to build at Europe’s largest LLM hackathon @join_ef @MistralAI @huggingface
@ThytuVDM @gladia_io HB mate. Take care :) You need another Kili to recharge I think
@ThytuVDM @gladia_io An inspiring path, happy birthday and I wish you success in your goals for the future.
@ThytuVDM @gladia_io Thanks buddy ! Good luck in your new gig try not to play too much chess with your own AI
@ThytuVDM @gladia_io Good luck for your future projects Valentin and happy birthday! 🥳
Big announcement: @pleiasfr releases a massive open corpus of 2 million Youtube videos in Creative Commons (CC-By) on @huggingface. Youtube-Commons features 30 billion words of audio transcriptions in multiple languages, and soon other modalities huggingface.co/datasets/PleIA…
Boom! Meta announces Megalodon Efficient LLM Pretraining and Inference with Unlimited Context Length! Paper: huggingface.co/papers/2404.08…
Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models Provides an overview of synthetic data research, discussing its applications, challenges, and future directions arxiv.org/abs/2404.07503
Google announces Leave No Context Behind Efficient Infinite Context Transformers with Infini-attention This work introduces an efficient method to scale Transformer-based Large Language Models (LLMs) to infinitely long inputs with bounded memory and computation. A key
We introduce LLM2Vec, a simple approach to transform any decoder-only LLM into a text encoder. We achieve SOTA performance on MTEB in the unsupervised and supervised category (among the models trained only on publicly available data). 🧵1/N Paper: arxiv.org/abs/2404.05961