Drew Farris @drewfarris
Software, Data, Search, IR, ML & Distributed Systems. Open Source Geek, @theasf member. Author @tamingtext, Principal @BoozAllen, opinions are my own. he/him linkedin.com/in/drewfarris MD / DC Joined January 2009-
Tweets2K
-
Followers1K
-
Following2K
-
Likes3K
🎙️🎉 New pod episode is live! S2Ep3: "Evaluating Real-World Adversarial ML Attack Risks and Effective Management: Robustness vs. Non-ML Mitigations" with guests, Drew Farris and Edward Raff. 👀 Watch, listen, or read the transcript ➡️bit.ly/47BUomi #MLSecOps #ProtectAI
Thanks so much to @EdwardRaffML and @drewfarris for joining us in the studio for this conversation. Check out their paper that inspired the talk here: arxiv.org/abs/2306.09951 "You Don't Need Robust Machine Learning to Manage Adversarial Attack Risks."
When AI startup discovers their RAG problems are the same old search relevance problems we've struggled with mightily for years :)
When AI startup discovers their RAG problems are the same old search relevance problems we've struggled with mightily for years :)
By 𝑅𝑒𝑐𝑎𝑠𝑡𝑖𝑛𝑔 𝑆𝑒𝑙𝑓-𝐴𝑡𝑡𝑒𝑛𝑡𝑖𝑜𝑛 𝑤𝑖𝑡𝘩 𝐻𝑜𝑙𝑜𝑔𝑟𝑎𝑝𝘩𝑖𝑐 𝑅𝑒𝑑𝑢𝑐𝑒𝑑 𝑅𝑒𝑝𝑟𝑒𝑠𝑒𝑛𝑡𝑎𝑡𝑖𝑜𝑛𝑠 you too can classify sequence with 1 𝗺𝗶𝗹𝗹𝗶𝗼𝗻 𝘁𝗼𝗸𝗲𝗻𝘀! New #neurosymbolic awesomeness led by @rea1mma w @BlancheMinerva @oatesbag @icmlconf
By 𝑅𝑒𝑐𝑎𝑠𝑡𝑖𝑛𝑔 𝑆𝑒𝑙𝑓-𝐴𝑡𝑡𝑒𝑛𝑡𝑖𝑜𝑛 𝑤𝑖𝑡𝘩 𝐻𝑜𝑙𝑜𝑔𝑟𝑎𝑝𝘩𝑖𝑐 𝑅𝑒𝑑𝑢𝑐𝑒𝑑 𝑅𝑒𝑝𝑟𝑒𝑠𝑒𝑛𝑡𝑎𝑡𝑖𝑜𝑛𝑠 you too can classify sequence with 1 𝗺𝗶𝗹𝗹𝗶𝗼𝗻 𝘁𝗼𝗸𝗲𝗻𝘀! New #neurosymbolic awesomeness led by @rea1mma w @BlancheMinerva @oatesbag @icmlconf
Seconded! I am proud of the work @BlancheMinerva has done, continues to develop, and the fact @BoozAllen can support and drive value from open research.
Seconded! I am proud of the work @BlancheMinerva has done, continues to develop, and the fact @BoozAllen can support and drive value from open research.
What is the state-of-the-art for Semantic Search over text? Actual semantic *search*, not some chatbot hallucinating content that I need to go check.
(7/7) LEACE wouldn’t be possible without @TheDavidSJ, who proved the theorem that led to this paper. I'd also like to thank our other coauthors @ravfogel @ryandcotterell @EdwardRaffML @BlancheMinerva!
Ever wanted to mindwipe an LLM? Our method, LEAst-squares Concept Erasure (LEACE), provably erases all linearly-encoded information about a concept from neural net activations. It does so surgically, inflicting minimal damage to other concepts. 🧵 arxiv.org/abs/2306.03819
“We tend to overestimate the effect of a technology in the short run and underestimate the effect in the long run.” - Amara’s Law en.m.wikipedia.org/wiki/Roy_Amara
“When there is a smart person in the loop, unreliable advice [from an LLM] is better than no advice, and the advice comes much more explicitly than from carrying out a conventional search on a search engine” - Rodney Brooks rodneybrooks.com/what-will-tran…
A nice analysis of how parameters are used in transformer models. Good Sunday morning reading. lesswrong.com/posts/3duR8Crv…
Congratulations on the publication of Inside Deep Learning @EdwardRaffML! It is great to finally hold it - it’s Chonky! @ManningBooks manning.com/books/inside-d…
Come by today for the @iclr_conf #MLEvaluationStandards workshop. Also super excited that @drewfarris and I @BoozAllen received an outstanding paper award! And another for reviewing! A thread on two papers in the workshop: 🧵
Come by today for the @iclr_conf #MLEvaluationStandards workshop. Also super excited that @drewfarris and I @BoozAllen received an outstanding paper award! And another for reviewing! A thread on two papers in the workshop: 🧵
Excited to share the @icmlconf 2022 Workshop on Knowledge Retrieval and Language Models knowledge-retrieval-workshop.github.io Please consider submitting! We welcome work across topics including LM grounding, open-domain Q&A, bias in retrieval, analyses of scale, transfer and LM phenomena.
List of ICML Workshops. Please add yours to this thread. ➡️Follow this list for updates. x.com/i/lists/126110…
We release LAION-5B: 5,85B CLIP-filtered image-text-pairs, an intuitive search engine like web interface for exploration & one click subset creation, CLIP ViT L/14 embeddings, NSFW & watermark scores ( + the models used to compute them) , kNN indices, ... laion.ai/laion-5b-a-new…
Watching the economic news out of Russia as sanctions hit, I was reminded of the programmatic statement Putin made 22 years ago on his first day in office. It was called "Russia on the Eve of the Millennium." This is what he promised Russians. A short thread of quotes: 1/7
My current favorite worst error message: "PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to the requested target". Inevitably met with much wailing and gnashing of teeth.
UMBC has officially reached the nation’s highest level of research performance. The Carnegie Classification of Institutions of Higher Education announced UMBC has been placed into the category of doctoral universities w/ very high research activity (R1). news.umbc.edu/umbc-ascends-t…
Jo Kristian Bergum @jobergum
9K Followers 816 Following Distinguished Engineer @vespaengine. Tweets about Vespa, search, recommendation, ranking, and IR. CET. #StandWithUkraine 💙💛Doug Turnbull @softwaredoug
3K Followers 754 Following Search @Reddit; ex @Shopify & @o19s; Books: Relevant Search & AI Powered Search(((Ellen Friedman))) @Ellen_Friedman
5K Followers 746 Following Scientist, speaker, author, Apache Software Committer, PhD Biochem Rice Univ. Artist in the makingCharlie Hull @FlaxSearch
2K Followers 862 Following Managing Consultant at OpenSource Connections, helping you build amazing AI & search applications. Also hachyderm dot io slash @flaxsearchKirk Borne @KirkDBorne
448K Followers 6K Following Advisor to startups. Freelancer. Global Speaker. Founder @LeadershipData. Top influencer in #BigData #DataScience #AI #IoT #ML #B2B. PhD Astrophysics @CaltechStella Biderman @BlancheMinerva
15K Followers 749 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/herConnor Shorten @CShorten30
16K Followers 15K Following Research Scientist @weaviate_io, the AI-Native DatabaseOpenSource Connection.. @o19s
1K Followers 251 Following We can help you Own Your Search by empowering your search team to succeed! Team Mentorship,Consulting: Solr, Elasticsearch, OpenSearch On fosstodon as o19sOren J. Falkowitz @orenfalkowitz
2K Followers 2K Following A quirky former NSA operator who started an innovative Silicon Valley cybersecurity firm named Area 1 (acq by Cloudflare), and sqrrl (acq by Amazon).Edward Raff @EdwardRaffML
2K Followers 613 Following Director @BoozAllen. Chair @CamlisOrg. Author of #InsideDeepLearning @ManningBooks & of JSAT Machine Learning library. PhD from & Visiting Prof @UMBCManning Publications @ManningBooks
29K Followers 2K Following Follow Manning Publications on Twitter and get exclusive discounts, product announcements, author news, and great content.Open Data Science @_odsc
113K Followers 26K Following Bringing together the global data science community to help foster the exchange of innovative ideas and encourage the growth of open source software.coroot @coroot_com
1K Followers 1K Following Open-source observability platform | Prevent service outages with continuous telemetry audits to pinpoint issues & root causes | Quick setup, no code requiredたちはらみあ @tachiharam63940
0 Followers 142 FollowingSloufe @sloufe54631
0 Followers 219 FollowingSheighsair @sheighsair40574
0 Followers 348 FollowingGailGregory @hMd1h94xK8HUEs8
0 Followers 287 FollowingDulce Benet @bene_dul
67 Followers 5K FollowingToytashet @toytashet98624
2 Followers 324 FollowingBetsyPrice @rMvi6aY6d5tc2Wi
1 Followers 296 FollowingAshutosh Mehra @ashutoshmehra
2K Followers 5K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.Susanna Montie @MontieSusa81264
80 Followers 5K FollowingCathleen Hardie @CathlHardi
86 Followers 5K FollowingLeyla Schrumpf @le_schru
77 Followers 5K FollowingAlani Vitrano @ala_vitran
67 Followers 5K FollowingDulce Josef @DulceJosef6844
69 Followers 5K FollowingDataInsta @DataInsta_com
132 Followers 2K Following 𝗧𝗵𝗲 World's 𝗙𝗶𝗿𝘀𝘁 Freelance Platform for 𝗗𝗮𝘁𝗮 𝗘𝘅𝗽𝗲𝗿𝘁𝘀! 🧠Janice @Janice955583602
4 Followers 57 FollowingCarmen Shellhaas @CShellha
33 Followers 5K FollowingScott Edington @ScottTEdington
305 Followers 751 Following CEO & Cofounder @Deep_Labs / previously @Visa, @BoozAllen / @UVA @JohnsHopkinsderwen.ai @derwen_ai
341 Followers 368 FollowingShaneCurcuru everywhe.. @shanecurcuru
1K Followers 2K Following Father, husband, 20+ years @TheASF, punny guy. We have cats. He/him. #TrademarkTwitter and #OpenSource pundit focusing on community, licensing, public good.Melanie Okins @oki_melan
38 Followers 5K FollowingThootathot @thootathot85492
182 Followers 3K FollowingPhilip Tannor @PhilipTannor
5K Followers 5K Following CEO at Deepchecks | Moderator at https://t.co/eIctpd8n3A | Forbes 30 Under 30 | Open Source Validation of AI & LLMs https://t.co/e8ivMRLuEpMLSecOps @mlsecops
408 Followers 190 Following Discover expert resources and AI security best practices. Join us as we drive forward the field of Machine Learning Security Operations, aka MLSecOps.Varun Talwar @vt_65
22 Followers 5K FollowingChris Simpson @simplechris
1K Followers 2K Following Principal Software Engineer @vimeo, passionate about programming. Search/AI practitioner #search #relevancy #ml #semanticsearch #rag #video Always Learning.LanceDB @lancedb
1K Followers 48 Following Developer-friendly, open-source database for multi-modal AI https://t.co/wXn4tw66HVRoss Recruiting @Ross_Recruiting
8 Followers 136 FollowingGB @GBSCyber
215 Followers 1K Following Infosec | Geopolitics | National Security. Views and comments are my own.Sarah Armstrong @SArmstrong1103
138 Followers 3K Following Current MIDS student, UC Berkeley School of InformationMohammed Bin Rashid @Hhhmohamsheikh
130 Followers 2K Following Every picture has a story and every story has a moment that I'd love to share with you Thanks and enjoyThe Next Trends @trends_next
1K Followers 4K Following Get the latest trends and news on finance, crypto currencies, gadgets, technology, business idea. An online platform where meet tech and non techie peoples.Herbert R. Sim @HerbertRSim
268K Followers 266K Following COO @WebseaOfficial | MBA, PhD(C) #Economics #AI | VC https://t.co/zjtLNJnVNp 比特币侠 | #Transhumanist https://t.co/ezk6p49ePk | 🇸🇬🇬🇧Jennifer Arnold @arnoldjenl
183 Followers 188 FollowingTamirat @tamirat62538167
74 Followers 1K Following Reading books. enthusiastic for data science, machine learning , artificial intelligence, mathematics and statistics.pappy mike @pappymike7
7 Followers 58 FollowingJohnSnowLabs @JohnSnowLabs
41K Followers 30K Following Helping healthcare and life science organizations put AI to work faster with state-of-the-art LLM & NLP.Anne @anneliv38566352
3K Followers 5K FollowingPhyllis @phyllismulliga3
2K Followers 4K FollowingData Axle @Data_Axle
4K Followers 3K Following Formerly Infogroup. We heart #data, probably because that's our specialty. Follow us to nerd out on data, #AI, #machinelearning, #marketing, #sales and more.Kheda @AirGirl2305
57 Followers 486 FollowingSebastian Raschka @rasbt
268K Followers 885 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Jo Kristian Bergum @jobergum
9K Followers 816 Following Distinguished Engineer @vespaengine. Tweets about Vespa, search, recommendation, ranking, and IR. CET. #StandWithUkraine 💙💛(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingFrançois Chollet @fchollet
471K Followers 770 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.Chris Albon @chrisalbon
86K Followers 2K Following Director of Machine Learning at the Wikimedia Foundation. We host Wikipedia.Doug Turnbull @softwaredoug
3K Followers 754 Following Search @Reddit; ex @Shopify & @o19s; Books: Relevant Search & AI Powered SearchYann LeCun @ylecun
714K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Jimmy Lin @lintool
13K Followers 843 Following I profess CS-ly at the @UWaterloo and gaze into the technological crystal ball at @Primal. I used to write code for @Twitter and slides for @Cloudera.(((Ellen Friedman))) @Ellen_Friedman
5K Followers 746 Following Scientist, speaker, author, Apache Software Committer, PhD Biochem Rice Univ. Artist in the makingCharlie Hull @FlaxSearch
2K Followers 862 Following Managing Consultant at OpenSource Connections, helping you build amazing AI & search applications. Also hachyderm dot io slash @flaxsearchKirk Borne @KirkDBorne
448K Followers 6K Following Advisor to startups. Freelancer. Global Speaker. Founder @LeadershipData. Top influencer in #BigData #DataScience #AI #IoT #ML #B2B. PhD Astrophysics @Caltech👩💻 Paige Bai.. @DynamicWebPaige
59K Followers 2K Following ✨Keep it simple, make it scale. AI should be about empowering people, building understanding, & making dreams realities. 👩💻GenAI @GoogleDeepMind ex-@GitHubEric Sammer @esammer
13K Followers 716 Following ceo at @decodableco! prev: @splunk, @rocanainc (acq'd), @cloudera. open source / dist systems / data. o'reilly author. [email protected]Stella Biderman @BlancheMinerva
15K Followers 749 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/herLeo Boytsov @srchvrs
7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.Thomas G. Dietterich @tdietterich
51K Followers 507 Following Distinguished Professor (Emeritus), Oregon State Univ.; Former President, Assoc. for the Adv. of Artificial Intelligence; Robust AI & Comput. SustainabilityGunnar Morling 🌍 @gunnarmorling
51K Followers 302 Following Software engineer @Decodableco · Ex-lead of Debezium · Spec lead of Bean Validation 2.0 · Creator of JfrUnit, kcctl and MapStruct · Java Champion · 🚴Erik Bernhardsson @bernhardsson
38K Followers 3K Following Building @modal_labs when I'm not posting bangers about data and software. Previously built the music rec sys at Spotify and ran the eng team at Better.derwen.ai @derwen_ai
341 Followers 368 FollowingAmol Khanna @AmolKhanna00
2 Followers 4 FollowingClémentine Fourrier .. @clefourrier
3K Followers 307 Following Leaderboards & evals research @HuggingFace 🐍✨ "The future is already here, it’s just not very evenly distributed" (Gibson)MLSecOps @mlsecops
408 Followers 190 Following Discover expert resources and AI security best practices. Join us as we drive forward the field of Machine Learning Security Operations, aka MLSecOps.Jürgen Schmidhuber @SchmidhuberAI
107K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.Melanie Mitchell @MelMitchell1
44K Followers 657 Following Professor, Santa Fe Institute. More thoughts at https://t.co/nC43NHRozX.Dave Luber @NSA_CSDirector
34K Followers 335 Following Director of Cybersecurity at @NSAgov. Follow @NSAcyber for unique, actionable, and timely cybersecurity guidance. https://t.co/Jivn9PjkOMChris Simpson @simplechris
1K Followers 2K Following Principal Software Engineer @vimeo, passionate about programming. Search/AI practitioner #search #relevancy #ml #semanticsearch #rag #video Always Learning.MIT CSAIL @MIT_CSAIL
298K Followers 22K Following MIT's Computer Science & Artificial Intelligence Laboratory (CSAIL). Media Inquiries: [email protected]Montgomery Fire Wire @MoCoFireWire
4K Followers 107 Following Real time News outlet for Montgomery County, MD. All Info is unofficial. [email protected] https://t.co/NPrP7bGxfDTim Pruss, MyDrone.Pr.. @MyDronePro
1K Followers 841 Following Founder of https://t.co/R6iKjJi7ji, professional photographer, astrophotographer, photojournalist, and FAA Part 107 Remote Certified Aerial Pilot.LanceDB @lancedb
1K Followers 48 Following Developer-friendly, open-source database for multi-modal AI https://t.co/wXn4tw66HVPete Piringer @mcfrsPIO
35K Followers 206 Following Chief Spokesperson for Montgomery County (MD) Fire & Rescue Service - an Internationally Accredited Combination Career & Volunteer Public Safety OrganizationMohammad Mahmudul Ala.. @rea1mma
15 Followers 29 Following Machine Learning 🌟 Ph.D. candidate in CS at @UMBC 🎯 Graduate Research Assistant at CORAL Lab ⚙️Tim Oates @oatesbag
54 Followers 49 FollowingGB @GBSCyber
215 Followers 1K Following Infosec | Geopolitics | National Security. Views and comments are my own.Stanislav Kozlovski @BdKozlovski
9K Followers 259 Following Have worked on Apache Kafka for 5+ years, now I also write about it. Low-frequency, highly-technical tweets.work louder® @work_louder
21K Followers 251 Following I make modular keyboards for digital creators… with a guy I met on the internetSid Probstein @sidprobstein
2K Followers 391 Following Creator/CEO of SWIRL. Advisor to Sense Artificial Memory. Futurist, python hacker, live music fan. Opinions mine!!Luke Zettlemoyer @LukeZettlemoyer
8K Followers 2K FollowingDavid Schneider-Josep.. @TheDavidSJ
956 Followers 2K Following Math, ML, AI x-risk. Former telemetry database lead and first stage landing GNC software @ SpaceX. Before that: AWS and Google.Edward Raff @EdwardRaffML
2K Followers 613 Following Director @BoozAllen. Chair @CamlisOrg. Author of #InsideDeepLearning @ManningBooks & of JSAT Machine Learning library. PhD from & Visiting Prof @UMBCNora Belrose @norabelrose
8K Followers 125 Following Working toward a free and fair future powered by friendly AI. Head of interpretability research at @AiEleuther, but tweets are my own views, not Eleuther’s.MMitchell @mmitchell_ai
80K Followers 1K Following Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Same content in the Sky, Threads, & the Prehistoric ElephantBlinkDL @BlinkDL_AI
7K Followers 92 Following RWKV = 100% RNN with GPT-level performance. https://t.co/TkdxOJSFWX and https://t.co/86DzS6arA0BigCode @BigCodeProject
9K Followers 3 Following Open and responsible research and development of large language models for code. #BigCodeProject run by @huggingface + @ServiceNowRSRCHAnthropic @AnthropicAI
265K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.EleutherAI @AiEleuther
19K Followers 76 Following A non-profit research lab focused on interpretability, alignment, and ethics of artificial intelligence. Creators of GPT-J, GPT-NeoX, and VQGAN-CLIPAK @_akhaliq
311K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxTimothy B. Lee @binarybits
43K Followers 1K Following Reporting on AI and the future of the economy. Computer science masters degree from Princeton. @arstechnica alum. Subscribe to my AI newsletter!Irene Solaiman @IreneSolaiman
4K Followers 580 Following ai social impact+safety+policy, @huggingface 🤗 views=mine former: @OpenAI @Harvard aspiring ukulele-singer she/herclem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersNumPy @numpy_team
39K Followers 11 Following Official account of NumPy. Announcements only. For questions and comments, please use the mailing list [email protected].The MoCoShow (MCS) @TheMoCoShow
48K Followers 726 Following News, Entertainment, and Information about Montgomery County, MD (and beyond) @tastemoco for food.Roguelike Celebration @roguelike_con
3K Followers 1 Following A community event about roguelikes and other procedural art. Videos: https://t.co/sf7DLRZW0RAsahi Lina / 朝日�.. @LinaAsahi
30K Followers 133 Following Hello everyone, Asahi Lina here! I'm a Linux developer VTuber! EN/日本語|🎨 #AsahiLinArt|Model @NananoNanase|Design @shiranui_illustSandia National Labs @SandiaLabs
80K Followers 481 Following U.S. @Energy labs dedicated to securing a peaceful and free world through #science and #technology. Follows, RTs and mentions ≠ endorsements.Baba Is You @babaisyou_
28K Followers 10 Following Account no longer active. Newsletter: https://t.co/uVEpHKf0Gp Cohost: https://t.co/BQPmbSyDPc Mastodon: https://t.co/3h4fS8MBn8SI Data Science @SIDataScience
2K Followers 255 Following Tweets from the Smithsonian Data Science Lab, part of the Office of the Chief Information Officer. Terms of use: https://t.co/DNZrZoyDYzBerkeley Lab @BerkeleyLab
92K Followers 2K Following Official account of Lawrence Berkeley National Laboratory (LBNL), a U.S. Department of @ENERGY #nationallab. #BringingScienceSolutionsToTheWorldBrookhaven Lab @BrookhavenLab
49K Followers 469 Following We're an @ENERGY Lab that delivers discovery science and transformative technology to power and secure America's future. Home to #RHIC #NSLSII #CFNatBrookhavenLos Alamos National L.. @LosAlamosNatLab
57K Followers 996 Following Meeting national security challenges with #science and #technology. (Note: RTs and MTs do not imply endorsements.)Argonne National Lab @argonne
79K Followers 1K Following As an @ENERGY lab, Argonne delivers science and tech solutions to energy, climate change, environmental and security challenges.Brendan Dolan-Gavitt @moyix
25K Followers 6K Following Associate Professor @ NYU Tandon. Security, RE, ML. PGP https://t.co/3WXr0RfRkv Founder of the MESS Lab: https://t.co/zGycrX3Gmn "an orc smiling into the camera" — CLIPDOE Office of Science @doescience
73K Followers 571 Following Official account for @Energy Office of Science, the largest single supporter of basic research in the physical sciences in the U.S.Vector search isn't just VectorSearch™️ It's all the other, completely different and orthogonal data structures you append to your vector softwaredoug.com/blog/2024/03/2…
The ethical questions that people are asking and should be thinking through will also be discussed at both a usage and design level. New chapter drafts are already written and prepped, and @ManningBooks uses an early-access system so you can give us feedback right now!
A year ago, my mom asked me if I had heard about ChatGPT since she had seen it on CNN. Yet I'm still fielding questions about what it is, how it works, and why it is right, wrong, or insubordinate. So that must mean it's time for a new book? How GPT Works manning.com/books/how-gpt-…
This has been a time coming, thanks to co-authors @BlancheMinerva and @drewfarris , and of course, @BoozAllen for giving us the time to support this crazy plan/idea. Oh, and of course, the discount code is mlboozallen for 45% off :)
Proud of #BoozAllen's Holly Levanto for leading the discussion on broadening the boundaries of #AI training & implementation! #AI #innovation #BoozAllen bit.ly/3VygWRu
"My benchmark for large language models" nicholas.carlini.com/writing/2024/m… Nice post but even more than the 100 tests specifically, the Github code looks excellent - full-featured test evaluation framework, easy to extend with further tests and run against many LLMs.…
The first few branches of (I think) cherry blossoms began. Little pink petals dotted wet moss-surrounded stones beneath the down-ward handing branches
Let's implement Mamba in Triton. (srush.github.io/annotated-mamb…) A gentle, (but mildly obsessive) tutorial notebook about GPU programming in Triton. We're getting close to mere mortals being able to do this 😂
Interesting trend in AI: the best results are increasingly obtained by compound systems, not monolithic models. AlphaCode, ChatGPT+, Gemini are examples. In this post, we discuss why this is and emerging research on designing & optimizing such systems. bair.berkeley.edu/blog/2024/02/1…
We will see that a lot of weird behaviors and problems of LLMs actually trace back to tokenization. We'll go through a number of these issues, discuss why tokenization is at fault, and why someone out there ideally finds a way to delete this stage entirely.
Announcing GA of hauler.dev, our Air Gap / Disconnected distribution tool. Simplifies cloud-native asset distribution in environments that lack connectivity such as DoD, FinSec, PharmaSec.
Major life update: My family and me will return home to Berlin this summer, after five exciting years abroad in NYC and Amsterdam! I will join the newly founded @bifoldberlin institute as a full professor and start my own research group. PhD and Postdoc openings coming soon!
BM25
Year-end review, pick for Best Paper of 2023: “First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models” Naomi Saphra, Eve Fleisig, Kyunghyun Cho, Adam Lopez This happens in the tech industry, just about every 20 years. blog.derwen.ai/best-paper-202…
This blog post by @clefourrier shows some really excellent work analyzing the DROP benchmark and its limitations. If you use the eval harness and want to chime in to the conversation, we'd love your feedback! Join the conversation here: github.com/EleutherAI/lm-…
⚠️ We are removing DROP from the Open LLM Leaderboard! With leaderboard evaluation data openly shared on 2000+ models, we did a deep dive with our friends @AiEleuther and @try_zeno, & found out that its original implementation is unfair to many models 😱 huggingface.co/blog/leaderboa…
🎙️🎉 New pod episode is live! S2Ep3: "Evaluating Real-World Adversarial ML Attack Risks and Effective Management: Robustness vs. Non-ML Mitigations" with guests, Drew Farris and Edward Raff. 👀 Watch, listen, or read the transcript ➡️bit.ly/47BUomi #MLSecOps #ProtectAI
Thanks so much to @EdwardRaffML and @drewfarris for joining us in the studio for this conversation. Check out their paper that inspired the talk here: arxiv.org/abs/2306.09951 "You Don't Need Robust Machine Learning to Manage Adversarial Attack Risks."
A silly math question posed in our discord: There are two hospitals in a city, a small and a bigger one. In which of them is the likelihood higher of more boys than girls being born in one day? (Provided that girls and boys are on average born equally frequently).
A couple years ago, I was rejected from a job when I bombed an interview asking me to implement a backend to check if a Connect Four game was won. I am now obsessed with how beautiful this solution using np.covolve2d is: stackoverflow.com/questions/2994…
one easy way to make a simulation of falling sand piling up- just define what happens for all 16 possible 2x2 grids, and repeatedly apply these rules to the picture