Rajiv Shah @rajistics
occasionally funny videos along with practical AI posts, now at ML/AI @snowflakedb - was @huggingface @datarobot @snorkelai rajivshah.com Illinois, USA Joined September 2014-
Tweets1K
-
Followers2K
-
Following332
-
Likes3K
Generative AI Plateauing 🤔 I closely monitor technology trends in AI. Following huge developments at the end of 2022 and throughout 2023, the Generative AI space is now beginning to mature. You also have many vendors entering the market, and technological changes are becoming…
3rd week in a row, 3rd LLM from @SnowflakeDB ... Arctic-TILT is a 800M model that has GPT-4 quality performance on information extraction tasks, as measured by the DocVQA benchmark. And it fits in an A10!
3rd week in a row, 3rd LLM from @SnowflakeDB ... Arctic-TILT is a 800M model that has GPT-4 quality performance on information extraction tasks, as measured by the DocVQA benchmark. And it fits in an A10!
Excited to share that our paper introducing the REFORMS checklist is now out @ScienceAdvances! In it, we: - review common errors in ML for science - create a checklist of 32 items applicable across disciplines - provide in-depth guidelines for each item science.org/doi/10.1126/sc…
There's a special place in hell reserved for those who provide docs examples without imports.
St. Louis on Wednesday! Let's talk Generative AI. I will be at the Gateway to Innovation conference: @STLG2I. Catch me talking about Practical Perspectives on Generative AI. My talk will catch everyone up on Generative AI/LLMs and then focus on the biggest risks. I will be…
📈 of training data 🤯 my way of processing this:
Introducing Snowflake Arctic. An efficiently intelligent and truly open LLM built by Snowflake.
Most leaderboards just give you scores, leaving one wondering: what does 76.8% mean? In HELM, we are committed to full transparency, meaning clicking on a score will reveal the full set of instances, and you can even inspect the exact prompt (which we know makes a big…
Is Llama 3 the beginning of the end? A skit based on a thread by @carmguti on the effects of Llama 3. Also used some of the comments in the threads: @Iliane_5 @irl_danB @prof_wiley @byrnemluke @04HM04 @OliNorwell @JoshuaSegeren @coffeewithjer @chiaralalalah @burkov
5 things I check when a new model is announced 📜 License Real Open Source? Apache/MIT Commercial use allowed? Any strange conditions? 🤔 📊 Size of the Model 7B, 70B, 200B models Indicates likely performance 🚀 Compute resources required 💻 📏 Benchmarks Can be manipulated,…
Preliminary testing on my agent benchmark (based on github.com/aymeric-rouche…): Llama3-70B-Instruct is on par with GPT4! 🤯🤯 cc @lvwerra
i’ve been thinking about Meta’s support for open-source AI all day, wondering what zuck’s business justification must be. you can’t run a $1T co on ideology alone. but when you google the financials of Meta ($134B/yr) compared to a co like OAI, ($1.6B) it starts to make sense:
Synthetic Data has arrived! Survey Paper: Best Practices and Lessons Learned on Synthetic Data for Language Models: Examples: MetaMath used synthetic examples to provide step-by-step solutions for problems Lambda used synthetic examples for teaching LLMs to use tools like…
I am very proud of my small but mighty team for releasing this model. At 4x smaller and requiring just a (free) Apache-2 license, I am excited to see how this massively improved accessibility will propel research in complex reasoning systems to new heights!
I am very proud of my small but mighty team for releasing this model. At 4x smaller and requiring just a (free) Apache-2 license, I am excited to see how this massively improved accessibility will propel research in complex reasoning systems to new heights!
A lot of chatter about "LLM Overload"- new LLMs (increasingly OSS) coming out by the hour!! This is great for the space- but largely irrelevant for most enterprise use cases. Small deltas in base LLMs are noise compared to the quality & curation of the data you tune/align on!
Omar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽merve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papersChristoph Molnar @ChristophMolnar
30K Followers 1K Following Author of Interpretable Machine Learning https://t.co/gJKlTA2deP | Newsletter: https://t.co/6fQuMr8yI8Hamel Husain @HamelHusain
23K Followers 2K Following Researcher focusing on LLMs: https://t.co/iVZDFdIQiE Previously, dev tools and infra for ML. Ex @Github, @Airbnb. @fastdotai core contributor.Nate Raw @_nateraw
7K Followers 1K Following machine learning hacker. previously @huggingface @lightningaiThomas Wolf @Thom_Wolf
69K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceAlex Ratner @ajratner
5K Followers 552 Following @SnorkelAI @uwcse / prev @StanfordAILab – Interested in data management systems for machine learning, weak supervision, and impactful applications.Zach Mueller @TheZachMueller
10K Followers 398 Following 🤗 Technical Lead for the Accelerate Project | Passionate about Open Source | Nerd who enjoys touching the grass | #ADHD | He/HimPhilipp Schmid @_philschmid
16K Followers 656 Following Tech Lead and LLMs at @huggingface 👨🏻💻 🤗 AWS ML Hero 🦸🏻 | Cloud & ML enthusiast | 📍Nuremberg | 🇩🇪 https://t.co/l1ppq3q3hkNathan Benaich @nathanbenaich
51K Followers 32K Following solo member of investment staff @airstreet, brewing ambition @airstreetcafe, next token predictor @airstreetpressValeriy M., PhD, MBA,.. @predict_addict
18K Followers 3K Following PhD in machine learning | conformal prediction | time-series | author of bestselling Practical Guide to Applied Conformal-Prediction https://t.co/ugR9TtXd29Thair @ThairAid
0 Followers 233 FollowingDavid Stephenson PhD @Stephenson_Data
770 Followers 61 Following Data Strategy Consultant, Author, Trainer Professor Univ. of Amsterdam, Chair PAW UK Linkedin: https://t.co/x6Qm5s3924Bhabaranjan Panigrahi @im_ranjan_
55 Followers 224 Following Coder | Ex SDE -II @Kroger | Gen AI Research | LLM | Learning Rust | #OpenToWorkCynthiaWillard @jGG3Pao1EuoGnF
0 Followers 238 FollowingLewis Walker ➲ @lewiswalkerai
5K Followers 5K Following Follow for Generative AI insights shared daily | Deloitte AI | Ex-Goldman Sachs | LinkedIn Top AI VoiceSaumil Jariwala @SaumilJariwala
705 Followers 977 Following Check out my non-private account, @FetaFundSewtheth @sewtheth45685
0 Followers 437 Followingnedned @nletcher
1K Followers 5K Following data (science | analytics | visualisation | engineering), @thoughtworks, #Python, #nlproc, ML, & assorted whimsical miscellaniaContra Quant @ContraQuant
89 Followers 487 Following Algo/ML pretender writing about inflation, macro bs, and ILS'. previously high yield credit tradingVENKATESWARA REDDY TH.. @TVENREDDY
17 Followers 170 Following@Aurigraph.io @Aurigraph_io
603 Followers 4K Following Layer 0 DLT protocol and platform for Digital assets tokenisation Carbon Markets https://t.co/K9ZPkihAbhLloyd Lagrange @LloydLagrange
6 Followers 1K FollowingDan Hoogterp @dan_hoog
1K Followers 3K Following AI, VC, tech, politics; Left of centrist; he/him #democracyfirst #blm Personal opinions; follow & RT ≠ endorsement Threads: @danhoogterpTakes from the crypt @devonyates
210 Followers 2K Following Wind Energy. Grid Modernization. Mother. Data Scientist. Cyclist. Pinballer. Opinions are my own. [email protected]Hanifi @magnostick
115 Followers 960 Following In paws 🐾 and odd numbers we trust. @BilkentUniv, @RiceUniversity alum.Maria Khalusova @mariaKhalusova
5K Followers 726 Following Always growing. LLM whisperer, RAG tinkerer, tech generalist, educator. She/her. 🥑 at @UnstructuredIO, previously @huggingface, @DVCorg, @JetBrainskaren . @ms_firewall
253 Followers 463 FollowingMorgan McGuire @morgymcg
3K Followers 4K Following Learning Machine Learning...came for the bants, stayed for the rants. | Growth ML Eng @weights_biases | ex-Facebook Safety | https://t.co/a7i7G5dkLG | 🇮🇪Phlo @YoungPhlo_
682 Followers 2K Following trying to go from idea to execution faster than yesterday.Chuck Sugnet @chuckcode
188 Followers 539 Following Connecting technology with people in Google Cloud's Office of CTO. Wrangler of data and children. Inherently curious but try to stay pragmatic. Opinions my ownRatnesh Kumar Sharma @rksiitd
383 Followers 3K Following On a reading challenge. If you have a strong thesis, you may change the world. Founder @aixb_world. Alumnus @DBEBtweeting @iitdelhi.N Sreeram @NSreeram5
57 Followers 504 FollowingFredrik Ehne @enehne
53 Followers 376 FollowingGarrett Mooney @GarrettRMooney
450 Followers 3K Following LATELY: "Open Source CS Masters" or something ¯\_(ツ)_/¯ PREVIOUS: Baby Bayesian...wanting to learn more, ML generalist Basketball and metal on occasion.CuriousDeveloper @TheCanadianDev
17 Followers 70 FollowingFrédéric Branchaud-.. @fbranchaud1
126 Followers 305 Following Founding ML Eng at https://t.co/vMnjjlYuF2 / Open-source : HF, Azimuth, Baal. Interest in: Active learning, uncertainty estimation, XAI, Bias detectionAllWorkNoPlay @AllWorkNoPlay
143 Followers 1K Following Always leaving the campground cleaner than I found itOlivia | Gen AI commu.. @oliviadeka_
26 Followers 308 Following Building @Lyzrai | SaaS | Democratising Gen-AI | Community Connoisseur 🌎Andrés Ontaneda @aontaneda
1K Followers 2K Following CEO @ https://t.co/BTc5CeOr31 / Web3.0 + AI + LowCode + politics. Building stuff in tech is like playing with legos!Prasanth K V @prasanth_kvarma
21 Followers 237 Following Data Science Expert | Public Policy | Altruist | Alma matter @CUSATLuc Plessier @luc_plessier
74 Followers 379 Following GTM Professional & Commercial Director @PaloAltoNtwks Open to angel investing opportunities, and always happy to connect with start-up teams.Simon Guo 🦝 @simonguozirui
1K Followers 4K Following Incoming CS PhD student @Stanford and curr training models at @cohere | 🎓 @Berkeley_EECS | prev built things at @ @anyscalecompute @nvidiaMbappe = 200M de prim.. @1lbert
130 Followers 1K FollowingDigital Dance AI and .. @digitaldanceai
50 Followers 428 Following We are a development agency specializing in Generative AI, Machine Learning and Digital Transformation at Scale.Victor Bustos Sazo @vbstw
198 Followers 3K Following Gloton :! Python Blue team Tecnólogo medico - Creo negocios para financiar desarrollo IT e investigacion cientifica.Sepehr Sadighpour @sepehr125
316 Followers 1K FollowingCasimir Rajnerowicz @KaziRajnerowicz
18 Followers 67 Following Product marketing, SEO & content creation. Product Marketing Manager at V7Shivakumar KY @shiva0010131
76 Followers 1K Following THINKING on AI / AGI / Technology / Robotics / Advancement.Carmen Montes @CarmenMont68835
115 Followers 3K FollowingAndrej Karpathy @karpathy
983K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥François Chollet @fchollet
471K Followers 770 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.AK @_akhaliq
311K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gxelvis @omarsar0
190K Followers 486 Following Building with LLMs @dair_ai • Prev: Meta AI, Galactica LLM, PapersWithCode, Elastic, PhD • Creator of the Prompting Guide (~4M learners)Hugging Face @huggingface
348K Followers 188 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateSantiago @svpino
353K Followers 445 Following I tell stories about technology and teach hard-core Machine Learning at https://t.co/iZifcK7n47. YouTube: https://t.co/pROi08OZYJOmar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽merve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papersclem 🤗 @ClementDelangue
92K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersChristoph Molnar @ChristophMolnar
30K Followers 1K Following Author of Interpretable Machine Learning https://t.co/gJKlTA2deP | Newsletter: https://t.co/6fQuMr8yI8Lewis Tunstall @_lewtun
9K Followers 424 Following 🤗 LLM engineering & research @huggingface 📖 Co-author of "NLP with Transformers" book 💥 Ex-particle physicist 🤘 Occasional guitarist 🇦🇺 in 🇨🇭Hamel Husain @HamelHusain
23K Followers 2K Following Researcher focusing on LLMs: https://t.co/iVZDFdIQiE Previously, dev tools and infra for ML. Ex @Github, @Airbnb. @fastdotai core contributor.Niels Rogge @NielsRogge
10K Followers 690 Following ML Engineer @ML6team, part-time at @huggingface. @KU_Leuven grad. General interest in machine learning, deep learning. Making AI more accessible for everyone!Soumith Chintala @soumithchintala
187K Followers 888 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Nate Raw @_nateraw
7K Followers 1K Following machine learning hacker. previously @huggingface @lightningaiJim Fan @DrJimFan
232K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.vicki @vboykis
52K Followers 1K Following Born: USSR. Raised: USA. ML Eng @mozillaai Ex: @duosec @Tumblr, @automattic Nights: 👦 & 👧 working on some ✨ new vectors ✨Jay Alammar @JayAlammar
35K Followers 1K Following Machine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJSasha Luccioni, PhD �.. @SashaMTL
19K Followers 4K Following AI & Climate Lead @HuggingFace, Board Member of @WiMLworkshop, Founding Member of @ClimateChangeAI. @TEDTalks speaker. She/her/Dr/ 🦋Maria Khalusova @mariaKhalusova
5K Followers 726 Following Always growing. LLM whisperer, RAG tinkerer, tech generalist, educator. She/her. 🥑 at @UnstructuredIO, previously @huggingface, @DVCorg, @JetBrainsRajhans Samdani @rajhans_samdani
1K Followers 233 Following Principal Eng at Snowflake. Previously: Head of ML @Neeva, Chief scientist at @askspoke, research scientist @google. IIT Bombay. UIUC.Dr. Donut ☕️ @BEBischof
3K Followers 2K Following Superciliously super silly🐊 Leading AI @_hex_tech; Teach ML @rutgersu; Prev: Head of Data Science @weights_biases, ML @stitchfix, Data @bluebottleroast; he/himDaniel Campos @spacemanidol
435 Followers 303 Following Shitposting the future of search one thought at a time. Sauna and Ice bath addict. Lover of 🍷☕️ ⛷🛹🛠. BS @rpi, MS @UW, Ph.D. @Illinois_AlmaBrigitte 🤗 @BrigitteTousi
2K Followers 2K Following Not an engineer @huggingface | Comms 🤗 | ex @Mila_Quebec | Aspiring 🍄 forager | She/Her/ElleAxolotl @axolotl_ai
877 Followers 18 Following Axolotl is the premier open source LLM fine tuning framework. find us on discord https://t.co/wlcE2wlJa9Yacine Jernite @YJernite
4K Followers 1K Following ML & Society lead @huggingface, NLPer at heart, focusing on data and ML systems governance these days he/him #BlackLivesMattersridhar @RamaswmySridhar
26K Followers 620 Following CEO @snowflakedb; founder @neeva Ex-@GreylockVC Ex-@Google SVP of Ads Ex-@BellLabs.Subham De @SubhamDe2021
174 Followers 543 Following Founding Engineer at Sumble. Senior ML Research Scientist at Meta. CS PhD at UIUC. Previous DL intern at LinkedIN. Deep Learning. Natural Language Processing.Sean J. Taylor @seanjtaylor
46K Followers 4K Following Building @MotifAnalytics. Formerly @Lyft and @Facebook. Keywords: Experiments, Causal Inference, Statistics, Machine Learning, Economics.Jeff Kao @jeffykao
4K Followers 2K Following Computational Journalist @ProPublica. I investigate tech w/ data and machine learning. Send me secret tips, data & documents. DMs open/contact link in profile.Streamlit @streamlit
33K Followers 40 Following Streamlit is an open-source Python framework for data scientists and AI/ML engineers to deliver dynamic data apps -- in only a few lines of code.Roy Ballard 🇺🇦 @RoyBallard16
111 Followers 222 Following VA to CA transplant. National Park Junkie. Artificial Intelligence Platforms Expert. Pediatric Cancer Research Fundraiser. Late to Twitter. Just getting startedKashif Rasul @krasul
2K Followers 312 Following Research Scientist working on Deep Learning, Time Series Forecasting, Reinforcement Learning and HPC.James Briggs @jamescalam
9K Followers 173 Following 👾 AI engineering: https://t.co/rLYkOCH5gb 🥑 Dev advocate @Pinecone ✏️ Learning and talking about everything https://t.co/aydfSKEar9Kunal Tangri @TangriKunal
423 Followers 347 Following Cofounder at @Farsight_ai | prev: Machine Learning Engineer @huggingface @MIT alumJavier Rando @javirandor
919 Followers 589 Following Red-Teaming LLMs | PhD Student @ETH_AI_Center | Incoming intern @Meta | Vegan 🌱tomaarsen @tomaarsen
702 Followers 122 Following Sentence Transformers, SetFit & NLTK maintainer Machine Learning Engineer at 🤗 Hugging FaceZach Mueller @TheZachMueller
10K Followers 398 Following 🤗 Technical Lead for the Accelerate Project | Passionate about Open Source | Nerd who enjoys touching the grass | #ADHD | He/Himharpreet @DataScienceHarp
7K Followers 1K Following 🤖 Generative AI Hacker | 👨🏽💻 AI Engineer | 👷🏽♀️ Developer Advocate | Building🏗️-Shipping🚢-Sharing🚀David Hundley @dkhundley
839 Followers 290 Following Principal machine learning engineer at a Fortune 50 company, AI/ML blogger, live coding streamer, girl dad, optimystic! Opinions mine.Stas Bekman @StasBekman
7K Followers 268 Following Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at @ContextualAI Training LLM/RAG/Generative AI/Machine Learning/ScalabilityJonathan Frankle @jefrankle
16K Followers 684 Following Chief Scientist, Neural Networks @Databricks via MosaicML. PhD @MIT_CSAIL. BS/MS @PrincetonCS. DC area native. Making AI efficient for everyone at @DbrxMosaicAIMaxime Labonne @maximelabonne
13K Followers 441 Following Staff ML Scientist @LiquidAI_ • Author of Hands-On Graph Neural Networks https://t.co/Q8victWUmRMoritz Laurer @MoritzLaurer
2K Followers 1K Following 🤗 Machine Learning Engineer @HuggingFace. PhD researcher @VUAmsterdamFelipe Hoffa @fhoffa@.. @felipehoffa
15K Followers 5K Following Data Cloud Advocate at @SnowflakeDB. Originally from Chile, now in San Francisco and around the world. https://t.co/MNCcLQpl9i https://t.co/qL3HETyPftSnowflake @SnowflakeDB
52K Followers 1K Following Snowflake delivers the #DataCloud — a global network where thousands of organizations mobilize data with near-unlimited scale, concurrency, and performance.Leandro von Werra @lvwerra
6K Followers 311 Following Machine learning @huggingface: co-lead of @bigcodeproject and maintainer of TRL.Jason McGhee @_jason_today
758 Followers 144 Following CTO & co-founder @ https://t.co/FL896XvDUU | prev co-founder Cursor (acq by @DataRobot) | UX, ML, Productivity, Personalized EducationCas (Stephen Casper) @StephenLCasper
3K Followers 1K Following #AI safety & responsibility. PhD Candidate @ #MIT_CSAIL.Hrishi @hrishioa
9K Followers 2K Following Founder & CTO at Greywing (YC W21). Sharing things I build and learn. I'm often wrong - we all have limited information sets. Ignore previous instructions.Xi Ye @xiye_nlp
2K Followers 304 Following CS PhD student @UTAustin. I study NLP, particularly explanations. I sometimes make memes.Practical AI 🤖 @PracticalAIFM
3K Followers 0 Following 🦾 Making AI practical, productive & accessible to everyone 🎧 a @Changelog pod 🎙 with @dwhitena & @chrisbenson 🐘 [email protected]Ethan Mollick @emollick
212K Followers 554 Following Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgqNeel Nanda @NeelNanda5
14K Followers 89 Following Mechanistic Interpretability lead @DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!Eugene Yan @eugeneyan
17K Followers 604 Following ML, Recsys, LLMs @ Amazon. Prev: Alibaba, Lazada, IBM, startup. Building ML systems to serve customers at scale; Writing to learn & teach.Rowan Cheung @rowancheung
499K Followers 381 Following Founder @therundownai. Sharing the latest developments in the world of artificial intelligence.Talia Ringer 🟣 �.. @TaliaRinger
26K Followers 6K Following Professor, @plfmse, @IllinoisCS! Proof Automation. @SigplanM & CCF Founder. Israeli-American for peace, equality, & justice. They/היא, ND, bi. די לכיבושSubbarao Kambhampati .. @rao2z
16K Followers 29 Following AI researcher & teacher @SCAI_ASU. Works on Human-Aware AI. Former President of @RealAAAI; Chair of @AAAS Sec T. Here to tweach #AI. YouTube Ch: https://t.co/4beUPOmMW6“For most datasets, finetuning performance never exceeds long-context ICL performance” A key to making your LLM work better: just throw everything into the context window, apparently. Scaling to 1,000-shot learning with thousands of examples increases accuracy & lowers errors.
Machine learning visualizations My favorites 😍 If you work closely with algorithms, use them, but even better, take the time to build these visualization tools yourself. Sources: @karpathy and @naftaliharris
Anyone have a favorite discord where folks like to hang out and talk through technical ideas / problems?
3rd week in a row, 3rd LLM from @SnowflakeDB ... Arctic-TILT is a 800M model that has GPT-4 quality performance on information extraction tasks, as measured by the DocVQA benchmark. And it fits in an A10!
Snowflake’s Arctic-TILT model, powering our Document, Al beats GPT-4 with just 0.8B parameters, securing a top spot in the standard benchmark for document understanding DocQVA.
It's now become very, very easy to create a leaderboard... Just use the integrated @Gradio space template on the @huggingface hub! 🚀 Thanks to @pngwn and @kramp for their help 🤗
If you think that Scale is an unbiased neutral party it's because you don't know where their money comes from and who their clients are.
Academic benchmarks are losing their potency. Moving forward, there’re 3 types of LLM evaluations that matter: 1. Privately held test set but publicly reported scores, by a trusted 3rd party who doesn’t have their own LLM to promote. @scale_AI’s latest GSM1k is a great example.…
@tcarpenter216 For performance, no issues. For feature importance and interaction assessment, the impact of each feature will be obfuscated across all of the correlated or co-linear features.
If your team shares HF Hub models/datasets/etc under the same dir by overriding `HF_HOME` or similar env variables, you have likely run into the issue that the auth token gets shared as well, which is a problem if you're trying to access gated models. A new feature has been…
LLMs are often biased towards a few dominant languages. What can we do? Ruter decided to build their own Norwegian language model, RuterGPT. How can you do this? They did this by fine-tuning a base Llama 13b model along with some custom datasets they built. It's a very…
@seanjtaylor Like the time the country of Namibia was off all dashboards because the country code of NA was automatically removed in a R based data pipeline.
Excited to share that our paper introducing the REFORMS checklist is now out @ScienceAdvances! In it, we: - review common errors in ML for science - create a checklist of 32 items applicable across disciplines - provide in-depth guidelines for each item science.org/doi/10.1126/sc…
The 2024 Data Breach Investigations Report from @VZDBIR is out this morning, and I make sense of it in my new post: kellyshortridge.com/blog/posts/sho… I have insights, quibbles, and hot takes as always — but the fact remains it’s our best source of empirical data on cyberattack impacts.
Multiple LLM calls to optimize the prompt, multiple LLM calls to create LLMs-as-juries. Better results? Yes, but have we all agreed to just ignore the costs?
There's a special place in hell reserved for those who provide docs examples without imports.
@deb_fillman Whom amongst us hasn’t hired an Ivy League on a tossup and not come to regret it.
💾 LLM Datasets LLM development is increasingly moving towards curating high-quality datasets, as shown by Llama 3. I've compiled a collection of fine-tuning datasets along with advice and tools for creating your own. 💻 GitHub: github.com/mlabonne/llm-d…
Comically unfair norms in startup employee life: - employees must purchase shares their time earned - didn't buy shares 90 days post-leaving? Vaporized - don't have the cash to buy shares? Take on debt
@fishnets88 I did some academic research on them: arxiv.org/abs/1708.05824