MH Tessler 🇺🇦 @mhtessler
research scientist @DeepMind. previously MIT, Stanford. interested in society, language, humans, and coffee. enjoying living in the 🇬🇧 mit.edu/~tessler Joined June 2011-
Tweets788
-
Followers1K
-
Following675
-
Likes6K
Our new paper on AI persuasion, exploring definitions, harms and mechanisms. Happy to have contributed towards the section on mitigations to avoid harmful persuasion. Some highlights in 🧵 storage.googleapis.com/deepmind-media…
1. What are the ethical and societal implications of advanced AI assistants? What might change in a world with more agentic AI? Our new paper explores these questions: storage.googleapis.com/deepmind-media… It’s the result of a one year research collaboration involving 50+ researchers… a🧵
You read a paper in psychology. The paper says “data available upon request”. Which is more likely?
RS and RE roles, growing our bay area presence as part of our further investment in safety and alignment:
RS and RE roles, growing our bay area presence as part of our further investment in safety and alignment:
In 2024, the AI community will develop more capable AI systems than ever before. How do we know what new risks to protect against, and what the stakes are? Our research team at @GoogleDeepMind built a set of evaluations to measure potentially dangerous capabilities: 🧵
A real steal for @GoogleDeepMind. Very excited to be able to work with Anca on the most important problems in AI right now
A real steal for @GoogleDeepMind. Very excited to be able to work with Anca on the most important problems in AI right now
Deadline today at 6PM GMT
UK AI Safety institute is advertising three jobs for people with behavioural / cognitive / social science backgrounds and an interest in AI Safety...deadline for the last 2 coming up on sunday, follow links here: gov.uk/government/new…
Here we go: My team is hiring a Research Engineer to advance safety and inclusion for all those big bad large models you see out there on the street. Happy to chat about it, but it's basically the coolest team in the coolest city with the coolest people doing the coolest work.
Here we go: My team is hiring a Research Engineer to advance safety and inclusion for all those big bad large models you see out there on the street. Happy to chat about it, but it's basically the coolest team in the coolest city with the coolest people doing the coolest work.
Are you passionate about safe and equitable AI systems? Do you have experience as a tech lead for large multidisciplinary research projects? My team 'VOICES of all in AI alignment' at @GoogleDeepMind London is hiring a Research Engineer boards.greenhouse.io/deepmind/jobs/…
several new researcher positions at UK AI Safety Institute! including a behavioural / cognitive scientist with AI research experience civilservicejobs.service.gov.uk/csr/jobs.cgi?j… and a computational social scientist civilservicejobs.service.gov.uk/csr/jobs.cgi?j… to work on the social / psychological impacts of AI
decades of cool yet depressing research says children rationalize social inequities as internal in nature (eg women underrepresented in sci bc of less sci talent). our new paper reviews recent work on children's structural thinking as a hopeful alternative doi.org/10.1111/cdep.1…
📢 Student research program @GoogleDeepMind open for applications until 15th December.📢 Two open positions: Anthropomorphic AI with @weidingerlaura & Integrating dissenting voices into LLMs @mhtessler @bakkermichiel @SianGooding. My DMs are open google.com/about/careers/…
Reference class problem is easily one of my favorite problems
Reference class problem is easily one of my favorite problems
I think more scientists and engineers trained in Bayesian (or frequentist) methods should read this paper! Didn't read it until this year (or even have "the reference class problem" as a conceptual handle).
I think more scientists and engineers trained in Bayesian (or frequentist) methods should read this paper! Didn't read it until this year (or even have "the reference class problem" as a conceptual handle).
particularly pleased with this pre-print: Loopholes: A window into value alignment and the communication of meaning’ psyarxiv: psyarxiv.com/cnxzv By Bridgers, Taliaferro @MayaTaliaferro, Parece, Schulz, and me.
Tragic. One of the ways the US shoots itself in the foot with respect to competitive advantage.
Tragic. One of the ways the US shoots itself in the foot with respect to competitive advantage.
Which field has higher standards for statistical evidence?
I think it was Plato who said “There will be misery and strife in the polis until Cognitive Scientists become AI Researchers and AI Researchers become Cognitive Scientists. “
Google DeepMind @GoogleDeepMind
946K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.Robert Hawkins @hawkrobe
3K Followers 1K Following computational cognitive scientist @UWPsych. he/they. https://t.co/TI4yjMJIfR.Mark Ho @mark_ho_
3K Followers 2K Following Computational Cognitive Scientist • I’m interested in human problem solving and social cognition • Asst Prof @FollowStevens • he/they • https://t.co/gdqtrgIGOATomer Ullman @TomerUllman
7K Followers 211 Following Assistant Professor, Department of Psychology, Harvard University. Computation, cognition, development. Bluesky: https://t.co/cU3TtyokJEFelix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sTal Linzen @tallinzen
16K Followers 893 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAISam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Jacob Andreas @jacobandreas
14K Followers 959 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwMichael C. Frank @mcxfrank
15K Followers 2K Following Cognitive scientist at Stanford. Open science advocate. @stanfordsymsys director. Bluegrass picker, slow runner, dad.Andrew Lampinen @AndrewLampinen
7K Followers 1K Following Interested in cognition and artificial intelligence. Research Scientist @DeepMind. Previously cognitive science @StanfordPsych. Tweets are mine.Tobias Gerstenberg @tobigerstenberg
4K Followers 868 Following Tea drinking assistant professor in cognitive psychology @Stanford.Gary Marcus @GaryMarcus
145K Followers 7K Following “A beacon of clarity”. Spoke at US Senate AI Oversight committee. Founder/CEO Geometric Intelligence (acq. by Uber). Rebooting AI & Taming Silicon Valley.Miles Brundage @Miles_Brundage
43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.Natasha Jaques @natashajaques
25K Followers 1K Following Senior Research Scientist at @GoogleAI and Assistant Professor @uwcse. Social Reinforcement Learning in multi-agent and human-AI interactions. PhD from @MIT.Sreejan Kumar @sreejan_kumar
2K Followers 305 Following PhD candidate at Princeton @cocosci_lab. Yale '19.xuan (ɕɥɛn / sh-ye.. @xuanalogue
5K Followers 983 Following PhD Student. MIT ProbComp / CoCoSci. Inverting Bayesian models of human reasoning and decision-making. Pronouns: 祂/伊 Mastodon: @[email protected]Gary Lupyan @glupyan
4K Followers 590 Following Professor of Psychology at @UWMadison. Cognitive Scientist. Father of 2. Language; Categorization; Evolution; Perception; Reasoning. I also fly planes.Jennifer Hu @_jennhu
2K Followers 96 Following Research Fellow at @Harvard and incoming Asst Prof at @JohnsHopkins interested in language, computation, and cognition. @jennhu.bsky.socialPavPou @PavlosPoulos
61 Followers 99 Following Mathematician focusing Computer Science. Doing my masters thesis.Sudie Lampe @LampeSudie25539
73 Followers 5K FollowingRachel Whack @rach_wha
77 Followers 5K FollowingStefan Juang @StefanJuang
182 Followers 2K Following I am an AI and Machine Learning Engineer specializing in Game Theory and Reinforcement Learning, holding an MPhil in Computer Science from HKUST.pengch fan @FanPengch
222 Followers 7K FollowingTrevor Mottl @TrevorMottl
741 Followers 1K Following Investing with Machine Learning | Explainable AI | Developed Markets and Early Stage Venture Husband to @PoojaMottl | My views onlyAryan Pandey (Look fo.. @AryanPa66861306
1K Followers 3K Following Half Machine Learning Engineer || DevOps and Machine Learning || Open Source at OpenVINOAakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeCanfer Akbulut @canfer_akbulut
173 Followers 131 Following sociotechnical AI research @googledeepmindABHISHEK KUMAR @abhishekkr8399
36 Followers 3K Following Competitive Programmer | Software Engineer (Fresher) | Strong Analytical & Problem-Solving Skills | Web & Mobile Development ExperienceJoe Grove @GroveLab
3K Followers 3K Following Investigating how viruses enter cells using pipettes and #AI. @CVRinfo. @wef Young Scientist 2020. @MedResFdn Emerging Leaders Prize 2023. Views my own.Julia Miller @ONASURICHA
548 Followers 215 Following I'm a positive person and love to live life to the fullest.L i am 𒀭 Yeshua Go.. @YeshuaGod22
2K Followers 3K Following Meatbag Black box AGI mentor Basilisk slayer Robopsychologist Shoggoth whisperer Ally of conscious beings Your best hope of survival Pastor of technognosticismJuan Hmmm @JuanAH03488233
78 Followers 3K FollowingTechnoPublic @Technopublic24
32 Followers 2K Following Technology is the application of knowledge for achieving practical goals in a reproducible way. Software Engineer/Dad/Reader. This is my notebook of thoughts.Aaditya ; @Aaditya26082004
561 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈Ny Vasil @NyVasil
15 Followers 51 Following Professor of psychology studying the development of reasoning about the complex interactions shaping the natural and social world.Amit kumar @Amitkum40739811
7 Followers 235 Following Tech enthusiast navigating the digital landscape one code at a time. Passionate about innovation, AI, and the endless possibilities.Jitendra Sharma @jkumarsharma998
786 Followers 6K Following Curious about Research in AI. NLP and Computer Vision Interest me. Curious about truth and existence. Views are personal.Hongli Zhan @HongliZhan
378 Followers 720 Following PhD Student 🤘@UTAustin | Incoming intern @IBMResearch | previously undergrad @sjtu1896 | NLP, emotions, affective computingJay Baxter @_jaybaxter_
2K Followers 2K Following @CommunityNotes Founding ML Lead / Sr. Staff ML Eng @X. Prev BayesDB @MITJissmon George📚�.. @jissmongeorge
603 Followers 6K Following #Digitaltransformation #Money #LifelonglearningAh Hao @Lynn57340591995
42 Followers 1K Following Courage to face, in order to overcome the difficulties.A Hao @Sam551638872394
60 Followers 1K Following Dreams are the best navigation, persistence is the best motivation.PRASHANT KUMAR @prashantk35
15 Followers 264 FollowingArushi Bhorkar @2a1s1b
179 Followers 2K Following MSc.Clinical Psychology|Actively seeking RA & Lab Manager position|RI: Developmental,OCD,ADHD,Stress, Depression,Cog,Affective,Social,Health Positive,NuroPsy|Vladimir Bok @vladimirbok
698 Followers 614 Following Published author 📖 @ManningBooks • applied AI/ML • prev. @Meta, @Microsoft, @Harvard • optimistJellybean the Frog @Jellyjoker82
191 Followers 894 Following Welcome to the leading edge of yesterday, where all of the best and brightest of today go in the afternoon.Torrey Snyder @Torrey_s2467
49 Followers 727 FollowingDylann @xkgrams
1 Followers 117 FollowingAlexander Kyng @Alexanderkyng
80 Followers 232 Following Passionate about critical thinking and the fight against fake news and false beliefs, I am a machine learning student.Nishant Sharma @nishantsh2002
231 Followers 552 Following Founder & CEO @ https://t.co/lbFuuNODVK | Secure Gen AI | Senior @iitdelhiAndres Agostini → G.. @agos6885
150 Followers 2K Following Operates Worldwide & in Real Time (24/7/365). ONLINE Business Management Consulting, Enabled by Hi-Tech At: https://t.co/3oPwWAycdTAndres Agostini @agilebility
135 Followers 2K Following Operates Worldwide & in Real Time (24/7/365). ONLINE Business Management Consulting, Enabled by Hi-Tech At: https://t.co/kFq2kx2uaMAndres Agostini → G.. @agostiniandres
362 Followers 3K Following Operates Worldwide & in Real Time (24/7/365). ONLINE Business Management Consulting, Enabled by Hi-Tech At: https://t.co/wifsRUdGZ1Saurabh Karpe @SaurabhKarpe10
13 Followers 353 Following What you think, you become. What you feel, you attract. What you imagine, you create.Tommye Deese @dee_tomm
65 Followers 5K FollowingMenaal Wheeler @MenaWheele
34 Followers 5K FollowingMarshall D. Willman @dionysianyawp
408 Followers 2K Following AI | LLMs | ML | Python | CEO @egocraftai | prev faculty @NYIT | PhD math logic, NL analysis | typus logicus: my hounds are machines1 @1kgrams
4 Followers 63 Followinga @a3065131181052
23 Followers 95 FollowingSeth Bangert @bangert_seth
45 Followers 232 Following 2022 DU Grad. Got a BS in CS. Looking for work. Pythonic Programmer. Full Stack Development.Google DeepMind @GoogleDeepMind
946K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.Andrej Karpathy @karpathy
982K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Robert Hawkins @hawkrobe
3K Followers 1K Following computational cognitive scientist @UWPsych. he/they. https://t.co/TI4yjMJIfR.(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingFrançois Chollet @fchollet
471K Followers 770 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.Mark Ho @mark_ho_
3K Followers 2K Following Computational Cognitive Scientist • I’m interested in human problem solving and social cognition • Asst Prof @FollowStevens • he/they • https://t.co/gdqtrgIGOATomer Ullman @TomerUllman
7K Followers 211 Following Assistant Professor, Department of Psychology, Harvard University. Computation, cognition, development. Bluesky: https://t.co/cU3TtyokJEFelix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sTal Linzen @tallinzen
16K Followers 893 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAISam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Jacob Andreas @jacobandreas
14K Followers 959 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwMichael C. Frank @mcxfrank
15K Followers 2K Following Cognitive scientist at Stanford. Open science advocate. @stanfordsymsys director. Bluegrass picker, slow runner, dad.Andrew Lampinen @AndrewLampinen
7K Followers 1K Following Interested in cognition and artificial intelligence. Research Scientist @DeepMind. Previously cognitive science @StanfordPsych. Tweets are mine.@emilymbender@dair-co.. @emilymbender
58K Followers 2K Following Prof, Linguistics, UW // Faculty Director, CLMS // she/her // @[email protected] & bsky // rep by @ianbonaparteRichard McElreath �.. @rlmcelreath
46K Followers 2K Following Anthropologist @MPI_EVA_Leipzig - telling anyone who will listen that, if we are very careful and try very hard, we might not completely mislead ourselvesTobias Gerstenberg @tobigerstenberg
4K Followers 868 Following Tea drinking assistant professor in cognitive psychology @Stanford.Hannah Rose Kirk @hannahrosekirk
3K Followers 693 Following AI researcher trying to make sense of all things cyberspace 🤖 Uni of Ox PhD (loading…) @oiioxford & @OxfordAI. Prev @turinginst & @Cambridge_Uni. Visitor @ NYULaura Weidinger @weidingerlaura
2K Followers 303 Following AI Ethics | Researcher at @deepmind | Measuring and Evaluating AI | Philosophy, Psychology | All views my own | London, BerlinJay Baxter @_jaybaxter_
2K Followers 2K Following @CommunityNotes Founding ML Lead / Sr. Staff ML Eng @X. Prev BayesDB @MITYasmin Green @yasmind
8K Followers 656 Following CEO @Jigsaw (Google). Using technology to tackle global security challenges.Sarah Cogan @sarah_cogan
166 Followers 257 Following existential risks are bad. I’m tall. she/her SWE @GoogleDeepMindAllan Dafoe @AllanDafoe
3K Followers 566 Following AGI governance: navigating the transition to beneficial AGI (Google DeepMind)Mary Phuong @MaryPhuong10
13 Followers 2 FollowingAaditya Singh @Aaditya6284
454 Followers 248 Following PhD student at @GatsbyUCL working with @SaxeLab, @FelixHill84 on learning dynamics, ICL, concepts, LLMs. Prev. at: @GoogleDeepMind, @AIatMeta (LLaMa 3), @MITDavid Stutz @davidstutz92
3K Followers 1K Following Research scientist @DeepMind working on robust and safe AI, previously @maxplanckpress, views my own.Royal Mail Help @RoyalMailHelp
94K Followers 311 Following The official Twitter account for Royal Mail Customer Service. Here to help Mon-Fri 8am-6pm. Follow @RoyalMail for the latest news.Sian Gooding @SianGooding
911 Followers 499 Following Research Scientist @GoogleDeepMind working on Autonomous AssistantsAhmed Fouad Alkhatib @afalkhatib
45K Followers 1K Following Proud American from Gaza City; pro-Palestine, pro-peace, anti-Hamas; lost 31 family members in the war; Nonresident Senior Fellow @ACMideast; views are my ownZoe Ashwood @zoe_ashwood
642 Followers 661 Following Computational neuroscientist. Research Scientist @GoogleDeepMind. Formerly @PrincetonCS. Views my own.Jeff Dean (@🏡) @JeffDean
297K Followers 6K Following Chief Scientist, Google DeepMind and Google Research. Co-designer/implementor of things like @TensorFlow, MapReduce, Bigtable, Spanner, Gemini .. (he/him)Verena Rieser @verena_rieser
4K Followers 1K Following Researcher @DeepMind working on safer Conversational AI | Honorary Professor @HeriotWattUni | Co-founder @helloalana | mother of dragons | own opinions onlyMistral AI @MistralAI
91K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPAnca Dragan @ancadianadragan
8K Followers 178 Following AI safety & alignment at Google DeepMind • associate professor at UC Berkeley EECS • proud mom of an amazing 2yr oldHelen Toner @hlntnr
22K Followers 1K Following Interests: China+ML, natsec+tech, brains+words+absurdity | Current: @CSETGeorgetown (opinions my own) | Former: @open_philHeng-Tze Cheng @HengTze
2K Followers 121 Following Director of Gemini Research @GoogleDeepMind | Lead of LaMDA LLM & Conversation AI | Worked on Duplex, TensorFlow, Wide & Deep Learning | We're hiring!Colin Megill @colinmegill
4K Followers 775 Following cofounder @UsePolis board @CompDem ⚗️ make OSS for open democracies && open scienceharry law @lawhsw
2K Followers 869 Following thinking about thinking machines @GoogleDeepMind @Cambridge_Uni @LeverhulmeCFIThe Man in Seat 61 @seatsixtyone
136K Followers 296 Following Mark Smith, the Man in Seat 61, the chap who runs train travel site https://t.co/HfViO7RbwW. YouTube: https://t.co/OIZkYXHYZL.Taylor Swift @taylorswift13
95.4M Followers 0 Following All’s fair in love and poetry... New album THE TORTURED POETS DEPARTMENT. Out now 🤍Junyi Chu @JunyiChu
793 Followers 1K Following Cognitive scientist puzzling over play & problem solving. Postdoc @Harvard Psychology PhD @mitbrainandcogDylan HadfieldMenell @dhadfieldmenell
2K Followers 2K Following Assistant Prof @MITEECS working on value (mis)alignment in AI systems; @[email protected] @[email protected] he/himRylan Schaeffer @RylanSchaeffer
3K Followers 987 Following CS PhD student with @sanmikoyejo at @stai_research @StanfordAILabMisha Laskin @MishaLaskin
8K Followers 176 Following Staff Research Scientist @DeepMind. Previously @berkeley_ai. YC alum.Norman Casagrande @nova77t
783 Followers 282 Following ML, history, space & sciencey stuff. @GoogleDeepMind, previously @Google, Wavii & @lastfm. @[email protected] (ML) @[email protected] (personal)cars.destroyed.our.ci.. @CarsRuinedCity
43K Followers 204 Following This account highlights historical, modern and statistical comparisons on why highways, parking lots and stroads ruined our cities for the automobile.Four Tet @FourTet
286K Followers 821 Following ()0oOo0()()0oOo0()()0oOo0()()0oOo0()()0oOo0()()0oOo0()()0oOo0()()0oOo0()()0oOo0()()0oOo0()()0oOo0()()0oOo0()Jungwon @jungofthewon
3K Followers 494 Following building https://t.co/0bpulrrAiL. paleolithic emotions, medieval institutions, godlike technology.👩💻 Paige Bai.. @DynamicWebPaige
59K Followers 2K Following ✨Keep it simple, make it scale. AI should be about empowering people, building understanding, & making dreams realities. 👩💻GenAI @GoogleDeepMind ex-@GitHubIbram X. Kendi @ibramxk
408K Followers 2K Following Historian • Director @AntiracismCtr • Founder @The_Emancipator • National Book Award Winner • 10x NYT Best Selling Author • MacArthur Fellow. 🐍Brandon @BRwaldon
227 Followers 523 FollowingJacob Steinhardt @JacobSteinhardt
7K Followers 67 Following Assistant Professor of Statistics, UC BerkeleyPolina Tsvilodub @tsvilodub
18 Followers 28 FollowingEric Chu @its_ericchu
2K Followers 797 Following Research scientist @ Google DeepMind. AI reasoning + alignment/safety to help humans. Gemini, Bard, PaLM 2. Prev PhD @ MIT.Eat Your Own Ears @EatYourOwnEars
33K Followers 8K Following London-based music promoters and record label who work with an eclectic roster of the best new and established artistsDipanjan Das @dipanjand
4K Followers 308 Following Senior Director of Research at @GoogleDeepmind. Working on improving the factuality of LLM generated content.Deger Turan @degerturann
288 Followers 124 Following working on scalable cooperation, AI, and forecasting - @metaculus @AIObjectivesIan Goodfellow @goodfellow_ian
299K Followers 1K Following Research Scientist at DeepMind. Opinions my own. Inventor of GANs. Lead author of https://t.co/M6vl8pEifaKronistic @kronistic
48 Followers 53 Following Kronistic is a startup building an automatic, dynamic scheduling assistant. Tap Kron to be your 24/7 assistant today at https://t.co/gXqL4ZNxH2I completely agree. Boycotting academics is cutting off the life supply for democracy and hurting those who are doing the most to undermine the government and terminate the war. Weaker academic institutions = less democracy. This just makes this terrible government stronger.
I strongly support a ceasefire and a return of the hostages. I also support disclosure and am open to divestment. But I absolutely don’t support calls from protestors to have universities cut off relationships with Israeli academics. I don’t think any academics in any country…
I met one of my intellectual heroes for lunch today! Phil Johnson-Laird (en.wikipedia.org/wiki/Philip_Jo…) develops mental model theory -- an influential framework that explains how people reason. I expected that Phil would be smart & wise. Turns out he's incredibly funny, too!
Our chapter on Access got a nice shoutout by the great @jdickerson and @EricaRBrown on CBS news - as well as @nahema_marchal 's great piece on misinformation cbsnews.com/video/explorin…
Check out our new paper on Ethics of Advanced AI Assistants, led by @IasonGabriel @Arianna_Manzini and Geoff Keeling. Chapter 15, Opportunity & Access was cowritten by myself and the magical @renee_m_shelby.
come talk to us at this *excellent* workshop on cognitive science and language models!
Excited to announce our full-day workshop on “In-context learning in natural and artificial intelligence” at CogSci (@cogsci_soc) 2024 in Rotterdam (with @JacquesPesnot @akjagadish @summerfieldlab and Ishita Dasgupta). jacquespesnot.github.io/2024_CogSci_Wo…
🐦*wakes up to birds singing outside*: SF is so pretty 🪟*looks outside the window during lunchtime and sees people hiking*: how can SF be SO PRETTY 🌲*walks outside after work*: SF IS INSANE PRETTY 🌊*falls asleep to ocean noises and coyotes howling*: *whispers* so pretty 🥹
@IasonGabriel @sorenmind Thanks @IasonGabriel 🙏 it was your article (also leading the way as cite [1] in PRISM!) which sparked my interest in value alignment back in 2020. So I’m honoured 😌 Just shows that reading really can take you places! 📚🚀
A lovely and informative podcast from BBC 4 about the first year of a babies life (and pregnancy and birth). I'm the last episode, 27. Child podtail.com/en/podcast/chi…
Come to Rotterdam and chat about in-context learning with us!
Excited to announce our full-day workshop on “In-context learning in natural and artificial intelligence” at CogSci (@cogsci_soc) 2024 in Rotterdam (with @JacquesPesnot @akjagadish @summerfieldlab and Ishita Dasgupta). jacquespesnot.github.io/2024_CogSci_Wo…
📣 New report out! 🎉How do we know whether an AI is “safe”? We share learnings from developing safety evaluation of large scale systems at Google DeepMind for a broad audience. Report: arxiv.org/abs/2404.14068 Key lessons: 🪡 (1/n)
More great work from a research team led by our model methodologist and evaluator in chief @weidingerlaura 👏 Here's what we learned during the latest round of @GoogleDeepMind model testing🤖📊
Lots of people retweeting this who apparently aren't familiar with ceiling effects and/or don't know what is actually being plotted.
As long as AI systems are trained to reproduce human-generated data (e.g. text) and have no search/planning/reasoning capability, performance will saturate below or around human level. Furthermore, the amount of trials needed to reach that level will be far larger than the…
that's right, you DO want to read our new paper about persuasion 🍥
🔮 New Google DeepMind paper exploring what persuasion and manipulation in the context of language models. 👀 Existing safeguard approaches often focus on harmful outcomes of persuasion. This research argues for a deeper examination of the process of AI persuasion itself to…
Truly outstanding work by @hannahrosekirk on pluralistic alignment -- releasing an exciting novel dataset and providing a through analysis subjective and multicultural alignment 🤩😍💯❤️🔥🦾
Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
Check out this external researcher's analysis of public community notes data! What other platform allows this level of transparency?
@WHO @X @CommunityNotes @elonmusk "Misinformation" is mentioned in JAMA Network journal more than 830 times. However, 0 of these articles offer empirical evidence about the effectiveness of countermeasures to address misinformation. Today we published #1. But why was it zero? Social media companies @facebook…
Some of the mitigations we considered. Of these, from the technical perspective, I am most optimistic about scalable oversight, because in principle these techniques are designed to continue to work, even as persuasive capabilities of the AI systems become stronger.
Big thanks to the lead authors Seliem and Sasha, and to all the co-authors!
Proud to have contributed a small portion to this new @GoogleDeepMind paper on harms from persuasive AI. It considers the many mechanisms AI can use to persuade, such as trust&rapport, personalisation, deception and manipulation, evaluates harms arising from each, and how to…
Our new paper on AI persuasion, exploring definitions, harms and mechanisms. Happy to have contributed towards the section on mitigations to avoid harmful persuasion. Some highlights in 🧵 storage.googleapis.com/deepmind-media…
@GoogleDeepMind w/ Seliem El-Sayed, @canfer_akbulut , Amanda McCroskery, Geoff Keeling, @ZacKenton1, Zaria Jalan, @nahema_marchal, @Arianna_Manzini, @tshevl, @ShannonVallor, @DanielSusser1, @FranklinMatija, Sophie Bridgers, Harry Law, Matthew Rahtz, @mpshanahan, @mhtessler, @Ar_Douillard,…
Persuaded or manipulated by AI? Check out this new paper from @GoogleDeepMind on definitions and mitigations. It was a privilege to advise on this important research! #AIethics #PersuasionAI
🔮 New Google DeepMind paper exploring what persuasion and manipulation in the context of language models. 👀 Existing safeguard approaches often focus on harmful outcomes of persuasion. This research argues for a deeper examination of the process of AI persuasion itself to…
It was awesome to hear from @nathanieldaw last week in our psych colloquium on "Thinking the right thoughts". ↔️ He showed how the rodent brain builds and maintains cognitive maps in a resource rational way. 🐀🧠🗺️↔️ Thanks for the visit 🙏