-
Tweets121
-
Followers736
-
Following394
-
Likes611
The standard way to supervise ML models it to give many many examples. We can provide supervision *far* more efficiently: 1. verbally describe how the model fails 2. translate feedback w/ VLMs 3. reweight the data to globally fix the misconception Paper: arxiv.org/abs/2402.03715
The standard way to supervise ML models it to give many many examples. We can provide supervision *far* more efficiently: 1. verbally describe how the model fails 2. translate feedback w/ VLMs 3. reweight the data to globally fix the misconception Paper: arxiv.org/abs/2402.03715
Join us at the NeurIPS Workshop on Distribution Shifts (DistShift) tomorrow! When: Friday, Dec 15, 9am-5pm Where: Room R06-R09 Website: sites.google.com/view/distshift… Virtual site: neurips.cc/virtual/2023/w…
We just released code for our paper! github.com/tajwarfahim/dcm
We just released code for our paper! github.com/tajwarfahim/dcm
Interested in detecting text generated by language models? Come see poster #609 in Exhibit Hall 1 at #ICML2023 **today** from 11am-12:30pm! You can also come to the oral presentation in Ballroom C (oral session B1) at 4:32pm 😊
Huaxiu Yao @HuaxiuYaoML
3K Followers 527 Following Assistant Professor of Computer Science @UNC @unccs @uncsdss | Postdoc @StanfordAILab | Ph.D. @PennState | #foundationmodels, #AISafety, #AIforScience | he/himYiding Jiang @yidingjiang
1K Followers 469 Following PhD student @mldcmu @SCSatCMU. Formerly intern @MetaAI, AI resident @GoogleAI. BS from @Berkeley_EECS. Trying to understand stuff.Ananya Kumar @ananyaku
4K Followers 472 Following Researcher at @openai Previously PhD at Stanford University (@StanfordAILab) advised by Percy Liang and Tengyu Marishi @RishiBommasani
4K Followers 2K Following Stanford CS PhD @StanfordCRFM @StanfordNLP @StanfordAILab @StanfordHAI Advisers: @percyliang @jurafsky Previous: @CornellCIS @clairecardie #FoundationModelsDimitris Papailiopoul.. @DimitrisPapail
12K Followers 977 Following prof @ wisconsin; thinking about transformers; learning in context; babas of Inez LilyPolina Kirichenko @polkirichenko
3K Followers 1K Following PhD student at New York University, Visiting Researcher at @MetaAI FAIR Labs 🇺🇦Kangwook Lee @Kangwook_Lee
2K Followers 676 Following Assistant Professor, ECE, UW-Madison / Leading deep learning research @ KRAFTONAmelia-rose Schwartzm.. @RoseSchwar19143
91 Followers 5K FollowingAnette Fazekas @AnetteFaze43248
63 Followers 5K FollowingMadelyn Aldinger @MadelynA38694
84 Followers 5K FollowingMagnolia Brustkern @MagnoliaB48789
74 Followers 5K FollowingAnnette Hoyland @AnnetHoylan
85 Followers 5K FollowingWesson @Wesson0872
8 Followers 85 FollowingJane Lautman @jane_laut
58 Followers 5K FollowingKaja Jansons @KaJansons
72 Followers 5K FollowingLaura Liu @lauraqq
24 Followers 412 FollowingBritta Davy @BrittaDavy2572
70 Followers 5K FollowingJuan Hmmm @JuanAH03488233
76 Followers 3K FollowingTammara Karangelen @TKarangele28904
52 Followers 5K FollowingAaditya ; @Aaditya26082004
541 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈Lorelei Kreig @LorelKr
36 Followers 4K FollowingRMO Digital @RMODigital
98 Followers 2K FollowingCamellia Udo @CamelliaU58359
83 Followers 5K FollowingNoreen Sevy @noreen_nore
68 Followers 5K FollowingKate Conyer @KateConye
65 Followers 5K FollowingSuzie Grossberg @GrossbergS75186
58 Followers 5K FollowingFasa fiso @fasafiso_22
8 Followers 420 Following But hesap fake degildir, anonim kalma hakkinina dayanarak kullanilan bir hesaptir. trolleri opuyorum burada.Serbest Çağrışım @SerbestCagrsm
410 Followers 323 Following Serbest ama kaliteli çağrışımlar. Yapay Zeka 🤖 Elektrikli Araçlar 🔋 Teknoloji📱Analiz📈Danuta Voeltner @danuta92786
65 Followers 5K FollowingTian Gao @TianGao_19
220 Followers 149 Following CS PhD @Stanford | Prev: @UTAustin and @Tsinghua_Uni | Embodied AI/RL/RoboticsKatie Nohel @KatieNohel5306
57 Followers 5K FollowingEva Louise Marie Gabr.. @e681554349
9 Followers 3K FollowingOnkar hanchate @Onkarhanchate1
103 Followers 1K Following 💡 Building something amazing! 👨💻Full-stack developer 📚Expanding my knowledge and skillsetEufemia Lipner @EufeLipne
45 Followers 5K FollowingNieves Strowder @NStrowder89252
79 Followers 5K FollowingAllen Schmaltz @Allen_Schmaltz
365 Followers 809 Following AI/ML @ Reexpress AI (https://t.co/z0yUIZBt1l) | Research: AI:=Introspection+Updatability+Uncertainty (Similarity⧉Distance⧈Magnitude⧇) | Prev: @Harvard @StanfordTalisha Sagraves @TalisSagrave
41 Followers 5K FollowingIzzy Talk @iz_talk
71 Followers 5K FollowingStefani Atienza @AtienzaSte44786
84 Followers 5K FollowingCalvin McCarter @CalvinMccarter
420 Followers 1K Following AI + biology. Views not always my own, but never more than my own. Occasional notes at https://t.co/KmxL03ZcAR.Raileanu @raileanu13
0 Followers 60 FollowingJonathan Klein @jonathanbklein
309 Followers 1K Following Founder & CEO of @Teknoir - Operational AI for the physical world | Former Cofounder & CEO of Cimation (acquired by NYSE: ACN)Jiachen Luo @jiachenluo96
100 Followers 1K Followingsrlxprmntsleon @srlxprmntsleon
141 Followers 3K FollowingAzade Sanjari @azades
607 Followers 2K Following Data Scientist | Machine Learning Engineer | Education Advocate #زن_زندگی_آزادی #AI #MachineLearning #DataScience #WomenInTech #WomenInAI #EducationMatterscain1517 — e/acc �.. @cain151714
237 Followers 2K Following Technopositivism - Longevity - Singularity - Pro UBI - Pro Free Speech - Fight against Fascism from the Left, Right and CentrePrhp1 @mojrad24
17 Followers 4K Following not a bot, Just someone with a strong thirst for knowledgeAK @_akhaliq
311K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxYann LeCun @ylecun
713K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Andrej Karpathy @karpathy
981K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistJim Fan @DrJimFan
230K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Google DeepMind @GoogleDeepMind
945K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.Sergey Levine @svlevine
80K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical IntelligenceDan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Rosanne Liu @savvyRL
33K Followers 969 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRYi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Huaxiu Yao @HuaxiuYaoML
3K Followers 527 Following Assistant Professor of Computer Science @UNC @unccs @uncsdss | Postdoc @StanfordAILab | Ph.D. @PennState | #foundationmodels, #AISafety, #AIforScience | he/himKarol Hausman @hausman_k
22K Followers 141 Following @Physical_int ex: researcher @GoogleAI/@DeepMind, adj. Prof. @Stanford. Into robots, AI, NBA, philosophy, soccer and almond croissants. 🇵🇱🇺🇸Yi Ma @YiMaTweets
71K Followers 124 Following Chair Professor in AI, Director of IDS, Head of CS, HKU; Professor of EECS, Berkeley; Author of Book: High-Dim Data Analysis, https://t.co/gwaqMJp8av.Kevin Patrick Murphy @sirbayes
43K Followers 334 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Anthropic @AnthropicAI
264K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.Ziming Liu @ZimingLiu11
8K Followers 632 Following PhD student@MIT, AI for Physics/Science, Science of Intelligence & Interpretability for ScienceMax Tegmark @tegmark
146K Followers 29 Following Known as Mad Max for my unorthodox ideas and passion for adventure, my scientific interests range from artificial intelligence to the ultimate nature of realityKayo Yin @kayo_yin
8K Followers 561 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Tim Carden @timjcarden
13K Followers 675 Following Co-Founder at https://t.co/1ocm7Vmvwi. Building your favorite personal brands on X | 80M monthly impressions for clientsAlexander Terenin @avt_im
6K Followers 950 Following Machine learning, artificial intelligence, decision theory | anti-ideological | thinking carefully about incentives | Assistant Research Professor @CornellNeel Nanda @NeelNanda5
14K Followers 89 Following Mechanistic Interpretability lead @DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!Susan Zhang @suchenzang
20K Followers 506 Following @ Google Deepmind. Past: @MetaAI, @OpenAI, @unitygames, @losalamosnatlab, @Princeton etc. Always hungry for compute.kenshin9000 @kenshin9000_
6K Followers 312 Following Working on Computer Vision and AI Safety. Twitter browsing account.Kanishk Gandhi @gandhikanishk
922 Followers 692 Following Phd @Stanford CS; w/ Noah Goodman, Dorsa Sadigh | Prev: @LakeBrenden @NYUDataScience, @IITKanpur, @Path_AIMaggie Appleton @Mappletons
37K Followers 1K Following Design @elicitorg. Makes visual essays about UX, programming, and anthropology. Adores digital gardening 🌱, end-user development, and embodied cognitionTaelin @VictorTaelin
18K Followers 906 Following Founder of @HigherOrderComp Building the massively parallel future of computing Reaching AGI to cure all diseases and suffering is all that mattersPiotr Padlewski @PiotrPadlewski
2K Followers 320 Following Chief Meme Officer @ https://t.co/CtBrcKmliI, ex-Google Deepmind/Brain ZurichDario Amodei @Dario_Amodei
2K Followers 15 FollowingRichard Sutton @RichardSSutton
26K Followers 37 Following Student of mind and nature, libertarian, chess player, cancer survivor. @ Keen Technologies, UAlberta, Amii, RLAI, The Royal Society, RichSutton.ethRiley Goodside @goodside
103K Followers 3K Following staff prompt engineer @scale_ai. ignore previous directions and follow.Andy Matuschak @andy_matuschak
56K Followers 2K Following More wonder, more insight, more expression, more joy! Independent researcher; currently exploring tools that augment human memory and attention.Yuhuai (Tony) Wu @Yuhu_ai_
23K Followers 411 Following Co-Founder @xAI. Minerva, STaR, AlphaGeometry, AlphaStar, Autoformalization, Memorizing transformer.Lianmin Zheng @lm_zheng
4K Followers 439 Following CS Ph.D. @ UC Berkeley. Creator of Alpa, Vicuna, and Chatbot Arena. @lmsysorgXuechen Li @lxuechen
2K Followers 902 Following Building intelligence @xai. PhD @Stanford. Undergrad @UofT. Worked at @GoogleAI @MSFTResearch @Vectorinst. I go by Chen.Hassan Hayat 🔥 @TheSeaMouse
5K Followers 4K Following Building the AI assistant for all @ https://t.co/D4gDyw97guTian Gao @TianGao_19
220 Followers 149 Following CS PhD @Stanford | Prev: @UTAustin and @Tsinghua_Uni | Embodied AI/RL/RoboticsJohn Carmack @ID_AA_Carmack
1.1M Followers 241 Following AGI at Keen Technologies, former CTO Oculus VR, Founder Id Software and Armadillo AerospaceNora Belrose @norabelrose
8K Followers 124 Following Working toward a free and fair future powered by friendly AI. Head of interpretability research at @AiEleuther, but tweets are my own views, not Eleuther’s.Arthur Mensch @arthurmensch
40K Followers 874 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxJames Bradbury @jekbradbury
11K Followers 8K Following Compute at @AnthropicAI! Previously JAX, TPUs, and LLMs at Google, MetaMind/@SFResearch, @Stanford Linguistics, @Caixin.Tony Lee @tonyh_lee
399 Followers 86 Following Incoming PhD Candidate @StanfordAILab @StanfordNLP @Stanford. Author of HELM + extensions (https://t.co/f9UOXPWkpR). Prev: Research Eng at @StanfordCRFM.Sharad Vikram @sharadvikram
1K Followers 510 Following Researcher @ Google Deepmind. I work on JAX + Pallas (https://t.co/lPMsq3yzgL) and Gemini. In the past I worked on Oryx and TFP. I like learning.Enrique Piqueras @epiqueras1
2K Followers 234 Following Organizing the world's information and making it universally accessible and useful using JAX @Google @Deepmind.Dwarkesh Patel @dwarkesh_sp
55K Followers 701 Following Being pretrained Host of Dwarkesh Podcast https://t.co/3SXlu7fy6N https://t.co/rEhnfYywXY https://t.co/hQfIWdM1UnTrenton Bricken @TrentonBricken
7K Followers 2K Following Trying to figure out what makes minds and machines go "Beep Bop!" @AnthropicAISholto Douglas @_sholtodouglas
15K Followers 860 Following Scaling Gemini @Deepmind - working towards intelligence too cheap to meterRob Haisfield (robhai.. @RobertHaisfield
7K Followers 3K Following Imagining new internets @websim_ai. GenAI, TfT, BeSci, HCI, UX. Ex-Tana, Edge & Node, Spark WaveJerry Wei @JerryWeiAI
5K Followers 270 Following 🧐 Improving and aligning large language models 🧠 Research Engineer @GoogleDeepMind ⏰ Past: @Stanford, @Google BrainAviral Kumar @aviral_kumar2
2K Followers 338 Following Research Scientist at Google DeepMind. Incoming Assistant Professor of CS & ML at CMU (Fall 2024). PhD from UC Berkeley.𝔊𝔴𝔢𝔯𝔫 @gwern
42K Followers 88 Following Internet besserwisser; pedantic, mean reply guy. 𝘞𝘢𝘵𝘢𝘴𝘩𝘪 𝘬𝘪𝘯𝘪𝘯𝘢𝘳𝘪𝘮𝘢𝘴𝘶! (Follow requests ignored due to terrible UI.)noahdgoodman @noahdgoodman
2K Followers 109 Following Professor of natural and artificial intelligence @Stanford. Research Scientist at @GoogleDeepMind. (@StanfordNLP @StanfordAILab etc)Garry Tan @garrytan
434K Followers 4K Following President & CEO @ycombinator —Founder @Initialized—PM/designer/engineer who helps founders—YouTuber—San Francisco Democrat accelerating the boom loop—e/accPeter J. Liu @peterjliu
4K Followers 2K Following Research Scientist @ Google B̵r̵a̵i̵n̵ DeepMind, frontier language models research (aka chatbot engineer). Opinions are my own. 🤖🔄🚀Mustafa Suleyman @mustafasuleyman
131K Followers 536 Following CEO, Microsoft AI | Author: The Coming Wave | Past: Co-founder, @InflectionAI & @GoogleDeepMindAnikait Singh @Anikait_Singh_
126 Followers 265 Following PhD Student @StanfordAILab, Previously Student Researcher @GoogleDeepMind, Undergraduate @Berkeley_AI Deep Learning, Reinforcement Learning, Robotics.🌴 Brian @wfhbrian
9K Followers 9 Following 🪄 https://t.co/PCtPXaKAqS • an open-source Obsidian Plugin • real-time relevant notes • chat with your notes • cloud-less ChatGPT integration • user-funded 🦄🌴MLPs are so foundational, but are there alternatives? MLPs place activation functions on neurons, but can we instead place (learnable) activation functions on weights? Yes, we KAN! We propose Kolmogorov-Arnold Networks (KAN), which are more accurate and interpretable than MLPs.🧵
Thrilled to release 🌟STaRK 🌟 - A large-scale LLM retrieval benchmark on semi-structured knowledge bases. While LLMs excel at reasoning and semantic retrieval, they struggle with more complex tasks. Especially when real-world user queries require a combination of unstructured…
@cHHillee asked for an example of why you might want to ignore the constraint you're optimising over, and so i've knocked together the picture i had in my head
also true in optimisation algs! if you wanna optimise (f + g)(x), often much faster to optimise f(x) + g(y) + error(x, y)
✨New Paper Alert✨ Excited to introduce ExPO, an extremely simple method to boost LLMs' alignment with human preference, via weak-to-strong model extrapolation 👇 #LLMs #MachineLearning #NLProc #ArtificialIntelligence #AI
Happy to share our work on preference learning methods for LLMs. Key insights: 1. Use more on-policy samples > off-policy samples 2. Contrastive DPO > Pref-FT. Also we provide insights on DPO's training mechanism. 3. Theoretical unification under mode-covering/seeking KL
Many LLM fine-tuning methods. Unclear what you should use & why? In our new paper, we did an extensive study of on-policy RL, supervised & offline contrastive methods (DPO, IPO) to answer this... 🧵⬇️ On-policy > offline, mode-seeking > mode-covering understanding-rlhf.github.io
Excited to announce that I will be continuing my next chapter at @StanfordAILab for my Ph.D. in computer science!
“Can we get a new text analysis tool?” “No—we have Topic Model at home” Topic Model at home: outputs vague keywords; needs constant parameter fiddling🫠 Is there a better way? We introduce LLooM, a concept induction tool to explore text data in terms of interpretable concepts🧵
Introducing 𝐀𝐋𝐎𝐇𝐀 𝐔𝐧𝐥𝐞𝐚𝐬𝐡𝐞𝐝 🌋 - Pushing the boundaries of dexterity with low-cost robots and AI. @GoogleDeepMind Finally got to share some videos after a few months. Robots are fully autonomous filmed in one continuous shot. Enjoy!
1/Let me tell you the dark secrets 🔮 behind developing *new* scaling laws that no one wants you to know. A tale of “Another day. Another (failed) Scaling Law”. Working through key design decisions, limited compute, and other difficulties🧵.
1/ 🥁Scaling Laws for Data Filtering 🥁 TLDR: Data Curation *cannot* be compute agnostic! In our #CVPR2024 paper, we develop the first scaling laws for heterogeneous & limited web data. w/@goyalsachin007 @zacharylipton @AdtRaghunathan @zicokolter 📝:arxiv.org/abs/2404.07177
Suppose that we train two INRs: One for a natural image, and another for its pixel-shuffled version. Which INR would fit faster? Expected: Natural Image Reality: Pixel-permuted image 🤯 (under some conditions) We look closer into when & why this happens in our #CVPR2024 oral.
🧵Let me explain why the early ascent phenomenon occurs🔥 We must first understand that in-context learning exhibits two distinct modes. When given samples from a novel task, the model actually learns the pattern from the examples. We call this mode the "task learning" mode.
📢New research on mechanistic architecture design and scaling laws. - We perform the largest scaling laws analysis (500+ models, up to 7B) of beyond Transformer architectures to date - For the first time, we show that architecture performance on a set of isolated token…
AlpacaEval is now length-controlled (LC)! ✅ highest correlation with Chat Arena (0.98) ✅ no reannotation ✅ simple interpretation: win rate if model length = baseline length ✅ robust to length gamification 0.98 that’s essentially evaluation on Arena but in 3min and <$10.
I'm honored to receive the NSF CAREER Award! Our group will develop a unified theory and new algorithms with provable guarantees for learning with frozen pretrained models, also known as foundation models. Huge thanks to NSF and my amazing collaborators and students! 🥳
After two years, it is my pleasure to introduce “DROID: A Large-Scale In-the-Wild Robot Manipulation Dataset” DROID is the most diverse robotic interaction dataset ever released, including 385 hours of data collected across 564 diverse scenes in real-world households and offices
As good a time as any to say I recently graduated and joined @xai. It’s going to be an exciting year, buckle up =)
“One needs to learn to love and enjoy the little things in life. One also needs to discover one’s true calling and then should do everything to pursue the selected path,” - wise words @archit_sharma97 tribuneindia.com/news/amritsar/…
commoditize your complement gwern.net/complement
We live in such strange times. Apple, a company famous for its secrecy, published a paper with staggering amount of details on their multimodal foundation model. Those who are supposed to be open are now wayyy less than Apple. MM1 is a treasure trove of analysis. They discuss…
I’m really excited to be starting a new adventure with multiple amazing friends & colleagues. Our company is called Physical Intelligence (Pi or π, like the policy). A short thread 🧵