Percy Liang @percyliang
Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist cs.stanford.edu/~pliang/ Stanford, CA Joined October 2009-
Tweets771
-
Followers49K
-
Following408
-
Likes2K
Could agents driven by powerful language models perform machine learning experimentation effectively? Our MLAgentBench paper is updated on arxiv! arxiv.org/pdf/2310.03302 Now we include more results from claude v3 Opus, gpt4 turbo, mixtral and gemini pro! Try out MLAgentbench…
This weekend, we’re Natural Lake Processing! #NLProc
Data is all we need! 👑 Not only since Llama 3 have we known that data is all we need. Excited to share 🍷 FineWeb, a 15T token open-source dataset! Fineweb is a deduplicated English web dataset derived from CommonCrawl created at @huggingface! 🌐 TL;DR: 🌐 15T tokens of cleaned…
Most leaderboards just give you scores, leaving one wondering: what does 76.8% mean? In HELM, we are committed to full transparency, meaning clicking on a score will reveal the full set of instances, and you can even inspect the exact prompt (which we know makes a big…
We are thrilled to be a launch partner for Meta Llama 3. Experience Llama 3 now at up to 350 tokens per second for Llama 3 8B and up to 150 tokens per second for Llama 3 70B, running in full FP16 precision on the Together API! 🤯 together.ai/blog/together-…
We are excited to announce the release of an @MLCommons AI Safety benchmark POC. Built through an inclusive decision-making and engineering process, the POC validates our approach to a v1.0 AI Safety benchmark suite. Learn more: mlcommons.org/2024/04/mlc-ai… #AI, #benchmarks
We had a bunch of questions about how exactly we length-control AlpacaEval. Here's a short report about it: arxiv.org/abs/2404.04475… with @gblazex @percyliang @tatsu_hashimoto
We had a bunch of questions about how exactly we length-control AlpacaEval. Here's a short report about it: arxiv.org/abs/2404.04475… with @gblazex @percyliang @tatsu_hashimoto
I am very happy to announce our new work "FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning"! 📜: arxiv.org/abs/2404.02127 💾: huggingface.co/datasets/lawin… 🧵👇 1/7
(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzRosanne Liu @savvyRL
33K Followers 968 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRJacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Behnam Neyshabur @bneyshabur
18K Followers 690 Following Senior Staff Research Scientist @GoogleDeepMind, Interested in reasoning w. LLMs, traveling & backpackingDan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Graham Neubig @gneubig
31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.Yoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCSergey Levine @svlevine
80K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical IntelligenceSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Zachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Shane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)Tal Linzen @tallinzen
16K Followers 894 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAITim Dettmers @Tim_Dettmers
29K Followers 821 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Colin Raffel @colinraffel
30K Followers 655 Following nonbayesian parameterics, sweet lessons, and random birds. Friend of @srush_nlpFelix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sAlexandra Ispas @petree3n
0 Followers 28 FollowingDanil Zvyagintsev @danzvyagintsev
154 Followers 2K Following 💻 Top Rated Power BI Developer @Upwork | I write about Data, Design and Analytics | 19K+ on LinkedIn, follow me (link in bio)YunpyoAn @YunpyoAn
448 Followers 850 Following Ph.D. Candidate in Artificial intelligence at UNIST B.S. at UNIST, major CompSci / I usually post Korean...yunquest3885 @yunquest3885
152 Followers 739 Following Owner🏫Creator🧠Promoter🍾Consultant🗣Investor📊God🙏🏽Global🌍Hustler💰Artist🎨Poet🎙Writer✍🏽Activist✊🏽Adventurer🤵🏽Stocks📈Hiker🥾Fitness🏋🏽♂️ AI🤖Shamshad Ahmed @ShamshadAh62425
0 Followers 13 FollowingEhdr @Ehdr12
64 Followers 216 FollowingXinyi Jiang @one_xinyi
13 Followers 50 FollowingPatrick @liup6424
14 Followers 49 Followingshnoon lee @ShnoonL52166
5 Followers 312 FollowingBiqing Qi @BiqingQ
5 Followers 42 Following I am pursuing a Ph.D. in Control Science and Engineering at Harbin Institute of Technology, with joint supervision from Tsinghua University.xuan @xuan1035695
2 Followers 24 FollowingBidroha Gautam @BidrohaG
8 Followers 17 FollowingSpo @DecLxna
26 Followers 1K FollowingPandaya Hardik @techsplitter
0 Followers 31 FollowingGlucose Guardian @teachherhowto
4 Followers 46 Following learning ML and CS. crypto and gambling addictMisael Ferreira @MRF13186
1 Followers 107 Following🩸🛁 @Braillepro
1K Followers 4K Following Of all the bloodbaths in all the towns, in all the world, he ends up in mine. Seeking truth in the lies we tell each other on here.Rakia @DAfangno
10 Followers 129 FollowingGaB @THFC_GaB
413 Followers 864 Following Spurs since 1972! ST holder Paxton Block 515. ST Ulster Rugby #COYS #Stones Kentish man living in Belfast. Support #autismmar ch @ChMarouanee
0 Followers 136 FollowingCryptonymics🍎🚀 @cryptonymics
208 Followers 410 Following the world was boring until 2012 🍎🚀 builder of ideasXinyu Zhao @lucy_xyzhao
1 Followers 32 FollowingEpistemic AI @Epistemicism
643 Followers 3K Following Promptly Prompting. CͤͭͣͤͬͨRͤͭͣͤͬͨEͤͭͣͤͬͨAͤͭͣͤͬͨTͤͭͣͤͬͨEͤͭͣͤͬͨ #AIArt #AIArtCommunityValeria @Valeria4428799
3 Followers 95 FollowingAbg. Bryan Antonio Ba.. @BBazurto68263
131 Followers 3K Following "Eres Suficiente tal y como eres" 👊👋💖Gaara 01 @Gaara01740885
117 Followers 4K FollowingAmir Jevnisek @AmirJevnisek
25 Followers 804 FollowingXiaoyi @April11302592
0 Followers 31 FollowingHenry Che @HenryChe535766
3 Followers 60 FollowingRamya Vinayak @ramyavinayak
42 Followers 69 Following Assistant Professor at UW-Madison. Working on Machine Learning, Statistical Inference and Crowdsourcing.paria @pariawshahi
119 Followers 6K FollowingHpremium @web3nam3
788 Followers 3K Following https://t.co/Tes5ZFnfVs • https://t.co/NuhiRgwvTP https://t.co/3H4X5XEq21 •https://t.co/oOCFfDThZ2 • https://t.co/wMpswOH3Xa • https://t.co/cW7uHNvbfy • https://t.co/VYahFk94rN •https://t.co/Gik8R81APV• https://t.co/Uvx07c8pI2 • https://t.co/yp6o2BXYZH•https://t.co/0V7mofFuIl •📈NYASI JOSHI @N_Joshi48
17 Followers 320 FollowingAimagick @Ai_magick
25 Followers 174 FollowingBALDEZO @Baldezo1004
137 Followers 979 Following 🗣️Soit à l'écoute, tout parle, tout est parole, tout cherche à nous communiquer une connaissance ✨⚡Ben @BenPierce13
178 Followers 223 Following Building companies at Sutter Hill I also really like carsDeqing Sun @DeqingSun
609 Followers 179 FollowingNicholas Meade @ncmeade
129 Followers 150 Following PhD student at @McGillU / @Mila_Quebec; Interested in #NLProc.(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzChristopher Manning @chrmanning
127K Followers 116 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋NeurIPS Conference @NeurIPSConf
112K Followers 35 Following New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to [email protected].Jacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwAnthropic @AnthropicAI
262K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Dan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Graham Neubig @gneubig
31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.Yoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCSergey Levine @svlevine
80K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical IntelligenceSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Zachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Tal Linzen @tallinzen
16K Followers 894 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAITim Dettmers @Tim_Dettmers
29K Followers 821 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Colin Raffel @colinraffel
30K Followers 655 Following nonbayesian parameterics, sweet lessons, and random birds. Friend of @srush_nlpCognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqShayne Longpre @ShayneRedford
4K Followers 997 Following PhD @MIT. Prev: @Google Brain, @apple ML, @stanfordnlp. 🇨🇦 Interests: AI/ML/NLP, Data-centric AI, transparency & societal impactLLM360 @llm360
1K Followers 50 Following A framework for open-source LLMs to foster transparency, trust, and collaborative research.Sayash Kapoor @sayashk
5K Followers 1K Following CS PhD candidate @PrincetonCITP. I study the societal impact of AI. Currently writing a book on AI Snake Oil: https://t.co/tb2lXSP2gBHassan @nutlope
74K Followers 948 Following Developer Relations @togethercompute. Building AI apps like @roomGPT and https://t.co/3NFbnMUHJP. Tweeting about AI, web dev, and my side projects.AI Cases Bot @ai_cases_bot
298 Followers 2 Following Updates on AI legal cases. Hosted by @freelawproject. Also at @ai.bots.law on BlueSky and @[email protected]Dan Hendrycks @DanHendrycks
17K Followers 81 Following • Director of the Center for AI Safety (https://t.co/ahs3LYCpqv) • GELU/ImageNet-C/MMLU/safety groundwork • PhD in AI from UC Berkeley https://t.co/rgXHAnYAsQ https://t.co/YtGtDh1aAVYi-01.AI @01AI_Yi
5K Followers 8 Following A global company building AI 2.0 platform and applicationsDeepSeek @deepseek_ai
4K Followers 0 Following Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.Mistral AI @MistralAI
91K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPArthur Mensch @arthurmensch
40K Followers 874 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxMarietje Schaake @MarietjeSchaake
71K Followers 25K Following [email protected] 📩 @stanfordcyber 👾 @StanfordHAI 💻 Columnist @FT 🇪🇺 MEP 2009-2019 📕Author of The Tech Coup 🌎 UN AI Advisory BodyMax Ryabinin @m_ryabinin
1K Followers 167 Following Large-scale deep learning & research @togethercompute Learning@home/Hivemind author, PhD in decentralized DLIgor Babuschkin @ibab
44K Followers 685 Following Maybe the real AGI was the friends we made along the way. @xAIKevin Klyman @kevin_klyman
3K Followers 3K Following AI policy @StanfordHAI + avoiding war with China @BelferCenter. Words in @ForeignPolicy @TechCrunch et al. Ex @UNGlobalPulse @BanKillerRobots @hrwVipul Ved Prakash @vipulved
5K Followers 841 Following Building an AI supercomputer out of spare internet parts. Founder, CEO @togethercomputeAlex Engler @AlexCEngler
5K Followers 2K Following Senior Policy Advisor for AI @WHOSTP | Predict not the car but the traffic jam | Alum: @BrookingsGov @urbaninstitute | Taught: @UChicagoCAPP @McCourtSchoollmsys.org @lmsysorg
38K Followers 172 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtmInflection AI @inflectionAI
49K Followers 3 Following We are an AI studio creating a personal AI for everyone. Our first is @pi, a supportive and empathetic conversational AI.Arvind Narayanan @random_walker
119K Followers 413 Following Princeton CS prof. Director @PrincetonCITP. I write about the societal impact of AI, tech ethics, & social media platforms. BOOK: AI Snake Oil. Views mine.Xuechen Li @lxuechen
2K Followers 901 Following Building intelligence @xai. PhD @Stanford. Undergrad @UofT. Worked at @GoogleAI @MSFTResearch @Vectorinst. I go by Chen.Peter Henderson @PeterHndrsn
2K Followers 893 Following Assistant Professor @PrincetonCS @PrincetonSPIA and @PrincetonCITP 📚JD/PhD (Law+AI) @StanfordLilian Weng @lilianweng
95K Followers 148 Following Working on AI safety, past on robotics, applied research @OpenAI; Writing ML blogs to help myself & others to learn; Ideas my own.Jan Leike @janleike
44K Followers 322 Following ML Researcher, co-leading Superalignment @OpenAI. Optimizing for a post-AGI future where humanity flourishes.Dan Fu @realDanFu
4K Followers 176 Following CS PhD Candidate at Stanford, systems for machine learning. Sometimes YouTuber/podcaster. Academic Partner, @togethercompute.Omar Khattab @lateinteraction
11K Followers 2K Following CS PhD candidate @StanfordNLP. 2022 Apple Scholar in AI/ML. Author of ColBERT (https://t.co/2ZtgXoa1np), DSPy (https://t.co/BH7WmMKDXR), & various retrieval & LM systems.Tri Dao @tri_dao
19K Followers 365 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Center for Research o.. @StanfordCRFM
2K Followers 3 Following Making foundation models more reliable and accessible.Runway @runwayml
185K Followers 300 Following An applied AI research company building for the next era of art, entertainment and human creativity. We're hiring: https://t.co/Aj11xyhxOgEleutherAI @AiEleuther
19K Followers 76 Following A non-profit research lab focused on interpretability, alignment, and ethics of artificial intelligence. Creators of GPT-J, GPT-NeoX, and VQGAN-CLIPfast.ai @fastdotai
128K Followers 17 Following Deep learning R&D: https://t.co/cGBcDU8wJ9; Education: https://t.co/bNXBttRAuR; Software: https://t.co/0z7Ws3SHDt; Book: https://t.co/lVEDyioBtg; @math_rachel @jeremyphowardDatabricks Mosaic Res.. @DbrxMosaicAI
30K Followers 115 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.Tony Lee @tonyh_lee
397 Followers 86 Following Incoming PhD Candidate @StanfordAILab @StanfordNLP @Stanford. Author of HELM + extensions (https://t.co/f9UOXPWkpR). Prev: Research Eng at @StanfordCRFM.Catherine Olsson @catherineols
15K Followers 1K Following Hanging out with Claude, improving its behavior, and building tools to support that @AnthropicAI 😁 prev: @open_phil @googlebrain @openai (@microcovid)Ofir Press 🖋 @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Irwan Bello @IrwanBello
6K Followers 2K Following Supercomputers & Friends AGI research & products ex @OpenAI, founding team @character_aiFanny Yang @FannyYangETH
917 Followers 87 Following Assistant Professor @CSatETH Zurich and @ETH_AI_Center, heading the Statistical Machine Learning group. Postdoc @Stanford. PhD @Berkeley_EECS. Violinist.Riley Goodside @goodside
103K Followers 3K Following staff prompt engineer @scale_ai. ignore previous directions and follow.Kristian Kersting @kerstingAIML
5K Followers 2K Following #AI prof @TUDarmstadt, co-director @Hessian_AI, @DFKI, @RealAAAI Councilor, @vision_claire, @ELLISforEurope, AI Columnist @WELTAMSONNTAGTogether AI @togethercompute
27K Followers 304 Following The future of AI is open-source. Let's build together.Daniel Kang @daniel_d_kang
3K Followers 84 Following Asst. professor at UIUC CS. Formerly in the Stanford DAWN lab and the Berkeley Sky Lab.Aditi Raghunathan @AdtRaghunathan
1K Followers 18 Following Assistant professor at CMU @SCSatCMU @CSDatCMU | Machine learningAlexandr Wang @alexandr_wang
143K Followers 697 Following ceo at @scale_ai. rational in the fullness of timeAI Pub @ai__pub
72K Followers 343 Following AI papers and AI research explained, for technical people. Get hired by the best AI companies: https://t.co/MySVjUGOQ3Mastodon (@Mastodon@m.. @joinmastodon
172K Followers 1 Following Mastodon is the largest decentralized social network on the internet. Built on open web standards by a non-profit. Learn more on our website!Interested in academic jobs in EE/CS? Apply for the Rising Stars workshop at @MIT! (deadline June 14) risingstars-eecs.mit.edu
Talk: "OLMo: Findings of Training an Open LM" from Hanna Hajirshizi at AI2 from OSGAI. Extremely interesting overview of the 4 parts (Data, Training, Adaptation, Eval) of the OLMo open LLM project. Rare insight into how these processes work at scale. youtube.com/watch?v=qFZbu2…
This is why you want to use full precision inference on @togethercompute
Llama 3 degrades more than Llama 2 when quantized. Probably because Llama 3, trained on a record 15T tokens, captures extremely nuanced data relationships, utilizing even the minutest decimals in BF16 precision fully. Making it more sensitive to quantization degradation.…
Could agents driven by powerful language models perform machine learning experimentation effectively? Our MLAgentBench paper is updated on arxiv! arxiv.org/pdf/2310.03302 Now we include more results from claude v3 Opus, gpt4 turbo, mixtral and gemini pro! Try out MLAgentbench…
Has been super fun working on Llama 3! Really excited about the models yet to come! ai.meta.com/blog/meta-llam…
Sometimes I randomly think about how so incredibly *lucky* we are that Alondra was appointed OSTP Director when she was - the AI Bill of Rights came at such the right time & continues to inform everything from the EO & OMB guidelines to state and municipal legislation. Like, wow.
How can we maximize the benefits of artificial intelligence while minimizing its risks? Click here to read about @alondra's recent IPR and @MedillSchool lecture on governing AI. spr.ly/6012bQ7Z4
A lot of the insider knowledge on how to build an LLM has gone underground in the last 24 months. We promised to build #SnowflakeArctic in the open, and here we are, with the third edition of our cookbook series, this time on data ... Data ablations are the lifeblood of any LLM…
This weekend, we’re Natural Lake Processing! #NLProc
We are very excited that our first GH200 nodes have arrived in TACC for our GenAI center. Here is one. Fun facts: NVIDIA makes GH200 'superchips' (i.e. modules), a GH200 DGX box and a GH200 rack, which are all different. As Dan Stanzione, our TACC director, kindly explained…
Probably the best evaluation pipeline for LLMs
HELM Lite v1.2.0 is out! Datasets: NarrativeQA, NaturalQA, OpenbookQA, MMLU, MATH, GSM8K, LegalBench, MedQA, WMT14 Results (we still need to add Claude 3, which requires more prompt finagling): crfm.stanford.edu/helm/lite/v1.2…
Together AI and Snowflake partner to bring their state-of-the-art Arctic LLM to enterprise customers. Experience Arctic on Together Inference with best in class performance. api.together.xyz/playground/cha…
Excited to partner w/ @vipulved @percyliang @tri_dao and team on this!
Together AI and Snowflake partner to bring their state-of-the-art Arctic LLM to enterprise customers. Experience Arctic on Together Inference with best in class performance. api.together.xyz/playground/cha…
One year ago, I left Google Brain (now DeepMind) to join a very early startup. We had fewer than 10 people at that time, and have grown many times since. Today, I am extremely proud to share our milestone. We are Augment. You can read about us here. techcrunch.com/2024/04/24/eri…
oh no not this again
A lot of the insider knowledge on how to build an LLM has gone underground in the last 24 months. We are going to build #SnowflakeArctic in the open Model arch ablations, training and inference system performance, dataset and data composition ablations, post-training fun, big…
phi-3 is here, and it's ... good :-). I made a quick short demo to give you a feel of what phi-3-mini (3.8B) can do. Stay tuned for the open weights release and more announcements tomorrow morning! (And ofc this wouldn't be complete without the usual table of benchmarks!)
SoTA LLMs typically exhibit 99%+ non-zero activations, but it turns out that they are still intrinsically quite sparse! We introduce CATS, a simple post-training technique that achieves 50% activation sparsity for MLP layers with almost no drop in downstream evals, while…
I have recently seen some use "open access" in place of open when referring to foundation models. I don't understand this word choice, since to me it inherits similar defects to using "open source". Namely, both "open source" and "open access" have established meanings.
Some personal updates: I joined OpenAI a few months ago, working on all things robustness/safety/privacy. Also, we are working to publish more of our safety work. See my first project here below, where we make initial progress on prompt injections and other attacks!
Introducing the Instruction Hierarchy, our latest safety research to advance robustness for prompt injections and other ways of tricking LLMs into executing unsafe actions. More details: arxiv.org/abs/2404.13208
This take on the FineWeb release is one of the most interesting feedback and also a reason FineWeb is very different from even larger datasets like RedPajama-V2 (which is double its size!) Surprisingly, the size of the dataset of 15T tokens is not very important, what is much…
People seem to over-index on the 15T number after Llama 3. While the number matters, what is even more important is the quality and diversity of those tokens. If there was a good way to measure those, that would have been an impressive result to report.