-
Tweets582
-
Followers5K
-
Following581
-
Likes4K
Another thorny safety challenge for LLMs. Like Sleeper Agents (x.com/jayelmnop/stat…), @cem__anil has found behavior that is stubbornly resistant to finetuning. Training on MSJ shifts the intercept, but not the slope, of the relationship b/t # of shots and attack efficacy.
Another thorny safety challenge for LLMs. Like Sleeper Agents (x.com/jayelmnop/stat…), @cem__anil has found behavior that is stubbornly resistant to finetuning. Training on MSJ shifts the intercept, but not the slope, of the relationship b/t # of shots and attack efficacy. https://t.co/PXH5qhJS4A
Language models today are trained to reason either 1) generally, imitating online reasoning data or 2) narrowly, self-teaching on their own solutions to specific tasks Can LMs teach themselves to reason generally?🌟Introducing Quiet-STaR, self-teaching via internal monologue!🧵
I too have gotten Claude 3 to vertically center a <div>
I too have gotten Claude 3 to vertically center a <div>
gpt4: gets most of mmlu correct claude: gets most of mmlu correct gemini: gets most of mmlu correct mmlu: gets most of mmlu correct
Claude 3 Opus is great at following multiple complex instructions. To test it, @ErikSchluntz and I had it take on @karpathy's challenge to transform his 2h13m tokenizer video into a blog post, in ONE prompt, and it just... did it Here are some details:
Today, we're announcing Claude 3, our next generation of AI models. The three state-of-the-art models—Claude 3 Opus, Claude 3 Sonnet, and Claude 3 Haiku—set new industry benchmarks across reasoning, math, coding, multilingual understanding, and vision.
Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.
How can we check LLM outputs in domains where we are not experts? We find that non-expert humans answer questions better after reading debates between expert LLMs. Moreover, human judges are more accurate as experts get more persuasive. 📈 github.com/ucl-dark/llm_d…
Jim Fan @DrJimFan
231K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Jacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingPercy Liang @percyliang
50K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistFelix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sRosanne Liu @savvyRL
33K Followers 969 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Edward Grefenstette @egrefen
36K Followers 778 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.Christopher Potts @ChrisGPotts
11K Followers 620 Following Stanford Professor of Linguistics and, by courtesy, of Computer Science, and member of @stanfordnlp and @StanfordAILab. He/Him/His.Laura Ruis @LauraRuis
3K Followers 640 Following Currently research intern @cohere, PhD supervised by @_rockt and @egrefen. Language and LLMs. Spent time at FAIR, Google, and NYU (@LakeBrenden). She/her.Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pEugene Vinitsky @EugeneVinitsky
13K Followers 2K Following Lets make multi-agent learning easy. Anti-cynic. RS at Apple, Asst. Prof at @nyutandon. He/him. Anonymous feedback: https://t.co/Mmmg7uPm1tKayo Yin @kayo_yin
8K Followers 565 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Horace He @cHHillee
24K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemalerishi @RishiBommasani
4K Followers 2K Following Stanford CS PhD @StanfordCRFM @StanfordNLP @StanfordAILab @StanfordHAI Advisers: @percyliang @jurafsky Previous: @CornellCIS @clairecardie #FoundationModelsMiles Brundage @Miles_Brundage
43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.Tim Rocktäschel @_rockt
29K Followers 2K Following Open-Endedness Team Lead @GoogleDeepMind, Professor of AI @AI_UCL, PI @UCL_DARK, @ELLISforEurope Scholar. ex @MetaAI (FAIR), @CompSciOxford. Opinions my own.Andrew Lampinen @AndrewLampinen
7K Followers 1K Following Interested in cognition and artificial intelligence. Research Scientist @DeepMind. Previously cognitive science @StanfordPsych. Tweets are mine.Stanford NLP Group @stanfordnlp
145K Followers 180 Following Computational Linguists—Natural Language—Machine Learning @chrmanning @jurafsky @percyliang @ChrisGPotts @tatsu_hashimoto @MonicaSLam @Diyi_Yang @StanfordAILabMMM @MMM1897775
9 Followers 833 FollowingMohammed Hamdy @mhamdy_res
87 Followers 3K Following A curious explorer of human and machine learning 🧐🤝🤖Andi Peng @TheAndiPenguin
678 Followers 387 Following PhD student @MIT_CSAIL | formerly @MSFTResearch @Yale @WHOSTP | cats are dope.Maheep Chaudhary | �.. @ChaudharyMaheep
48 Followers 532 Following MS @NTU || Collab w/ Stanford || Ex-MIT Driverless, UIUC.Autoregressive SVD�.. @RedDevil1301
128 Followers 1K Following First Year MS Student of Statistics - Data Science || Interested in NLP, Machine Learning and Data-driven-decision making|| Avid Football and F1 FollowerJanhavee Shinde @SJanhavee
64 Followers 2K FollowingSadaf Gulshad @sadafgulshad
120 Followers 523 Following Postdoc in Machine Learning and Computer Vision @ University of AmsterdamWangui Waweru @wanguiwaweru15
3 Followers 22 Following$$$ @sp1d3r_8eyes
62 Followers 445 FollowingHarsh Pareek @harshhpareek
728 Followers 3K Following ML @prodigaltech, ex-(@Meta|@UTAustin|@iitbombay), 1/sqrt(2) (e/acc+AINotKillEveryone)Ross Matican @rossmatican
1K Followers 4K Following Marketing @BessemerVP | he/him 🏳️🌈✡️| Probably thinking about mindfulness, marketing, or startups/VC | Read my Substack 👇🏻Aryan Pandey (Look fo.. @AryanPa66861306
1K Followers 3K Following Half Machine Learning Engineer || DevOps and Machine Learning || Open Source at OpenVINOAman Chandra @amanchandra333
61 Followers 184 Following Software Developer | Robotics Enthusiast | PotterheadGagan Jain @gaganjain1582
56 Followers 760 Following Research Associate @GoogleDeepMind | IIT Bombay'22Sarah Wooders @sarahwooders
379 Followers 220 Following PhD @ucbrise @Berkeley_EECS working on systems for ML. Previously @glisten_ai (@ycombinator W20), CS/Math @MITesp @EspToTheFuture
1K Followers 3K Following building free & open source projects https://t.co/FbjSHFNbZ1,@dspacegame • videos @futuroptimist •🪐space,🌿plants, 🤖genai,software,hardware • 32M • loml❤️@fairyarcade 🥰Josh Bickett @josh_bickett
7K Followers 1K Following New dad | Engineer @hyperwriteai @othersideai | On the side - experimenting with VLMs playing gamesAnastasios Nikolas An.. @ml_angelopoulos
3K Followers 785 Following @Berkeley_EECS Ph.D. with Mike Jordan/Jitendra Malik. Conformal prediction, distribution-free uncertainty quantification, vision/imaging. Former @stanford_ee.Abdi M. @scaredmonad
1K Followers 4K Following PLs, µ-compilers, type systems, λ-abstractions, trivia, thoughts @ https://t.co/x0BSfAqRpmTed Moskovitz @ted_moskovitz
746 Followers 193 Following PhD student at @GatsbyUCL. Formerly: intern at @DeepMind, @UberAILabs, student at @ColumbiaCompSci, @PrincetonNeuro.Anurag Mishra @anuragm75160136
118 Followers 803 Following Building Scalable AI Applications | Senior Data Scientist @ EY | CSE Btech @ NIT MN | Linkedin: https://t.co/pCmSV6FmOeRohan Paul @rohanpaul_ai
14K Followers 1K Following ML Engineer (e/acc) 📌 https://t.co/x0IIWfnOt8 🚀 https://t.co/QEO4CKRl1b Open LLMs is Happiness 💡 Ex Deutsche & HSBC. DM for collaboration.Liangyu Chen @cliangyu_
526 Followers 1K FollowingHaoyi Fu @Haoyi_Fu
12 Followers 60 FollowingPankaj Gupta @pankaj_ipynb
32 Followers 920 Following The English language can not fully capture the depth and complexity of my thoughts. So I'm incorporating Emoji into my speech to better express myself 😉.teduzx @zhixuan_long
2 Followers 171 FollowingJustin Friesen @justin__friesen
14 Followers 39 Following Just trying to learn about businesses and talk about cool stuffAgent Columbus @AgentColumbus
14 Followers 64 FollowingIvelina Petrova @ivelinapetrovaX
31 Followers 785 Following Industrial Management and development Master Degree and Architect in Architecture [email protected] [email protected] [email protected]Harsh Desai @dreamerharsh
1 Followers 3K FollowingJonathan hind @import_hind
69 Followers 963 FollowingJerry Hellden @Yariv_hellden
765 Followers 7K Following Founder @ Andromedaerospace, Experimental Aerospace engineer, Theoretical physicist, Quantum cosmologist, Futurist, Reverse engineer, Designer 🌌Jim Fan @DrJimFan
231K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Jacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingPercy Liang @percyliang
50K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistSasha Rush @srush_nlp
52K Followers 465 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzNatasha Jaques @natashajaques
25K Followers 1K Following Senior Research Scientist at @GoogleAI and Assistant Professor @uwcse. Social Reinforcement Learning in multi-agent and human-AI interactions. PhD from @MIT.Felix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sChristopher Manning @chrmanning
127K Followers 116 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Rosanne Liu @savvyRL
33K Followers 969 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Edward Grefenstette @egrefen
36K Followers 778 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.Christopher Potts @ChrisGPotts
11K Followers 620 Following Stanford Professor of Linguistics and, by courtesy, of Computer Science, and member of @stanfordnlp and @StanfordAILab. He/Him/His.Laura Ruis @LauraRuis
3K Followers 640 Following Currently research intern @cohere, PhD supervised by @_rockt and @egrefen. Language and LLMs. Spent time at FAIR, Google, and NYU (@LakeBrenden). She/her.Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pEugene Vinitsky @EugeneVinitsky
13K Followers 2K Following Lets make multi-agent learning easy. Anti-cynic. RS at Apple, Asst. Prof at @nyutandon. He/him. Anonymous feedback: https://t.co/Mmmg7uPm1tTal Linzen @tallinzen
16K Followers 893 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIYoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCAndi Peng @TheAndiPenguin
678 Followers 387 Following PhD student @MIT_CSAIL | formerly @MSFTResearch @Yale @WHOSTP | cats are dope.Eric Hambro @erichammy
535 Followers 1K Following member of technical staff @AnthropicAI formerly FAIR @MetaAI @Bloomberg @UCL @Cambridge_Uni @recursecenter opinions, regrettably, minedavid rein @idavidrein
2K Followers 983 Following Sentio ergo sum. AI alignment research at NYU, early employee @cohereEmmanuel Ameisen @mlpowered
7K Followers 211 Following Research Engineer @AnthropicAI Previously: Staff ML Engineer @stripe, Wrote BMLPA by @OReillyMedia, Head of AI at @InsightFellows, ML @ZipcarOrowa Sikder @OrowaSikder
1K Followers 304 Following the future could be amazing. let’s get to work | Research @AnthropicAI, ex: PhD @UCLCSAnton Bakhtin @ SF @anton_bakhtin
2K Followers 126 Following MTS at @AnthropicAI, Ex @MetaAI, Ex @Google Three logicians walk into a bar ...Chenlin Meng @chenlin_meng
8K Followers 836 Following Co-founder & CTO @pika_labs | ex @StanfordAILab @StanfordAaron Begg @aaron_begg
2K Followers 1K Following Community at @AnthropicAI | Chat with Claude: https://t.co/7w2gEKteuC | Build with Claude: https://t.co/ktsbQNA9D2PatronusAI @PatronusAI
993 Followers 308 Following Automated evaluation for LLMs 🦄 Boost your confidence in generative AI ✨Fei Xia @xf1280
6K Followers 696 Following Research Scientist at @GoogleDeepMind, Robot Learning, Computer Vision. PhD from @StanfordAILab @StanfordSVL, previously @Tsinghua_Uni. #AGI through EmbodimentJascha Sohl-Dickstein @jaschasd
19K Followers 626 Following Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.John Thickstun @jwthickstun
1K Followers 536 Following Postdoc at Stanford. @StanfordCRFM @StanfordNLP @StanfordAILab Previous @uwcse @uw_wail Controllable Generative Models. AI for Music.Joy He-Yueya @JoyHeYueya
73 Followers 71 Following CS PhD student working on AI for education @StanfordAILabmrinank ⭐️ learni.. @MrinankSharma
834 Followers 443 Following alignment, poetry, soulmaking, devotion "live to the point of tears", camusDylan HadfieldMenell @dhadfieldmenell
2K Followers 2K Following Assistant Prof @MITEECS working on value (mis)alignment in AI systems; @[email protected] @[email protected] he/himrohit @krishnanrohit
19K Followers 2K Following Building God at https://t.co/frWeoc7IVB - buy the book, it makes me happy! | essays weekly at https://t.co/TbCaC6VaaMDavid Duvenaud @DavidDuvenaud
28K Followers 3K Following Machine learning prof @UofT. Working on generative models, inference, & latent structure.jessica dai @jessicadai_
2K Followers 679 Following phd student @berkeley_ai !? also editorial @reboot_hq @kernel_magazine (she/her)Dan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Roger Grosse @RogerGrosse
10K Followers 751 FollowingGabe Grand @gabe_grand
954 Followers 281 Following Computation 🤖 & cognition 🧠 PhD student @MIT CSAILKatherine Lee @katherine1ee
6K Followers 933 Following understanding ourselves and our models. senior research scientist @GoogleBrain, @genlawcenter and @CornellCIS, formerly @Princeton @[email protected]Prithviraj (Raj) Amma.. @rajammanabrolu
5K Followers 521 Following Interactive & grounded AI, RL, NLP. Assistant Prof @UCSanDiego. Research Scientist @DbrxMosaicAI. Prev: @allen_ai, @GeorgiaTechCem Anil @cem__anil
2K Followers 1K Following Machine learning / AI Safety at @AnthropicAI and University of Toronto / Vector Institute. Prev. student researcher @google (Blueshift Team) and @nvidia.Atoosa Kasirzadeh @Dr_Atoosa
3K Followers 2K Following societal impacts of AI | asst Prof @EdinburghUni | research lead @CentreTMFutures, @turinginst | @GovAI_ fellowXindi Wu @cindy_x_wu
955 Followers 809 Following PhD student @PrincetonCS | Data-centric multimodal ml | prev @RealityLabs @roboVisionCMU @CMU_Robotics @SnapchatSasha Sheng 🫶🏼 @hackgoofer
4K Followers 2K Following Builder, Dancer; @aiengfoundation & on a mission to help people be well. Lover of hackathons and updating my beliefs. Staying grounded. Prev: @MetaAIFrieda Rong @frieda_rong
331 Followers 957 Following CS PhD @Stanford, formerly 🚗 @UberATG, 🎓@UWaterloo.Tristan Hume @trishume
6K Followers 330 Following Performance optimization lead @AnthropicAI. Profiling, distributed systems, dev tools, interpretability. [email protected]jack morris @jxmnop
11K Followers 772 Following getting my phd in nlp @cornell_tech 🚠 // academic optimist // tweeting from the snack aisle at trader joesPreetum Nakkiran @PreetumNakkiran
10K Followers 2K Following ML research @Apple. @sh_reya’s fiancé | PhD @Harvard, postdoc @UCSanDiego, EECS @Berkeley_EECS, "AI" @OpenAI, @GoogleAIRafael Rafailov @rm_rafailov
4K Followers 642 Following Ph.D. Student at @StanfordAILab. I work on Foundation Models and Decision Making. Previously @GoogleDeepMind @UCBerkeleyAmanda Askell @AmandaAskell
26K Followers 655 Following Philosopher & ethicist teaching models to be good @AnthropicAI. Personal account. All opinions come from my training data.shreya rajpal @ShreyaR
6K Followers 775 Following ML, systems, and everything in between. Building @guardrails_ai. Previously founding eng @predibase, @Apple SPG, @driveai_, @IllinoisCS, @iitdelhi.@StephenLCasper I think you might be wrong, but for the opposite reason to others. Single humans are not unified principle agents that can be aligned to, either. Cas this morning, Cas tonight, Cas near donuts vs salad, etc. A human has incoherent goals. Just like a society.
Introducing LGA (Language-Guided Abstraction) at ICLR 2024! 🧵 📰 Paper: rb.gy/89268y 🌐 Website: rb.gy/10thlm 🗞️ MIT News: rb.gy/7ske0y State abstraction is key to generalizable learning, but how do we know which features are task-relevant?
I'm really excited to share what I've been working on for the last few months. The official Claude app for iOS! Really proud of what we've built as well as what's to come. You can check it out here apps.apple.com/us/app/claude/…
What do great teachers do to be good at teaching? What can this teach us about LMs? Most of us experience the “front stage” of teaching—as students. Few see the *back stage*: the planning, pedagogical decisions… 🌉 Bridge, at NAACL’24, surfaces these hidden decisions🧵
The Claude iOS app has arrived. The power of frontier intelligence is now in your back pocket. Download now on the App Store: apps.apple.com/us/app/claude/…
I will present my thesis defense tomorrow! Language Agents: From Next-Token Prediction to Digital Automation - 10am EST on Thursday, May 2 - princeton.zoom.us/my/shunyuy - WebShop, ReAct, ToT, CoALA - Briefly: SWE-bench/agent - Thoughts on the future of language agents
Happy to share that I’ve been working at @pika_labs as a PM intern for the past 4 months! I was the PM behind lip sync, sound effects, styles, and Pikaboo 💛 Lmk if you’re also interested in AI x Creativity or have feature requests for Pika - would love to talk!
Life update: today is my first day as a Member of Technical Staff at @cohere!
It’s important to remember LLM capability is bounded by the skill of the humans who train them. The only reason ChatGPT can identify common, short strings given their MD5 or SHA1 hashes is because that’s a completely ordinary talent that many humans have.
Those of you who think AI can produce no stroke of genius, what human, pray, in the last 350 years of this portrait's existence conceived of such a refreshing elaboration?
We have finally done it. After all this time and due to countless requests from our users, we've shipped what I think is our most important and revolutionary feature yet. You can now interrupt Claude's yapping with our new stop generation button!
@alexalbert__ I hope @AnthropicAI realizes how much value you are contributing by making these updates relatable and being the voice for the community. keeps anthropic at top of mind a lot more between model updates.
Some personal updates: I joined OpenAI a few months ago, working on all things robustness/safety/privacy. Also, we are working to publish more of our safety work. See my first project here below, where we make initial progress on prompt injections and other attacks!
Introducing the Instruction Hierarchy, our latest safety research to advance robustness for prompt injections and other ways of tricking LLMs into executing unsafe actions. More details: arxiv.org/abs/2404.13208
✨🎓 I defended my dissertation “The Relationship between Linguistic Representations in Biological and Artificial Neural Networks” on Tuesday! 🎓✨ Incredibly grateful for my amazing PhD advisor @ev_fedorenko and a wonderful journey at @mitbrainandcog! 🧠🤖
In absolute awe at these old MIT course posters designed by Dietmar Winkler in the 60's:
@mattshumer_ Hey Matt, appreciate you bringing this to our attention. We haven't modified any of the Claude 3 models since we launched them. On claude.ai, there's currently two layers that may contribute to perceived model performance: our T&S measures (standard mechanisms…
Not everyone who dies gets to come back and tell their story, but thankfully Freddy did. A reminder to hold the people you love a little closer tonight, that medicine and health are the greatest gifts, and that at the end of the day, we're all patients. jamanetwork.com/journals/jaman…
Another gem to remember! The Mondrian Process papers.nips.cc/paper_files/pa…
New work on the Battleship Game accepted to CogSci '24! ⚓️🧠 How do people pose informative, grounded questions in uncertain environments? And how can we build machines that ask human-like questions? arxiv.org/abs/2402.19471 🧵 (1 / n)