Paul Röttger @paul_rottger
Postdoc @MilaNLProc, working on evaluating and improving LLM safety. Previously PhD @oiioxford & CTO/co-founder @rewire_online paulrottger.com Joined July 2020-
Tweets274
-
Followers2K
-
Following455
-
Likes2K
If you are working on AI alignment, you should really check out PRISM. It is hard to overstate how rich and exciting this dataset is. What a great week to be a co-author of @hannahrosekirk!
If you are working on AI alignment, you should really check out PRISM. It is hard to overstate how rich and exciting this dataset is. What a great week to be a co-author of @hannahrosekirk!
Personalised LLMs are great, but should there be limits to personalisation? If so, who should set these limits? For answers to these questions and more, check out our paper on the risks and benefits of personalising LLMs, led by @hannahrosekirk 👇 out in @NatMachIntell today!
Personalised LLMs are great, but should there be limits to personalisation? If so, who should set these limits? For answers to these questions and more, check out our paper on the risks and benefits of personalising LLMs, led by @hannahrosekirk 👇 out in @NatMachIntell today! https://t.co/6cwwb8aoLF
New paper at #NAACL2024 🥳 We present GAHD, an 11k German Adversarial Hate speech Dataset 📜 and show that mixing annotator support strategies for finding adv. examples leads to a more effective dataset! Great collab with @paul_rottger and @center_text! Highlights below ⬇️
How many safety examples do #LLMs need? What examples are most useful? Why is it unethical to kill Python processes?🤯 Our new #ICLR2024 paper studies these + more! openreview.net/pdf?id=gT5hALc… We analyze safey/utility tradeoff (100s safe demos suffice) and exaggerated safety. Great…
This is very good, careful, and considerate work. Come for the headline results, stay for the many details!
This is very good, careful, and considerate work. Come for the headline results, stay for the many details!
Hannah Rose Kirk @hannahrosekirk
3K Followers 689 Following AI researcher trying to make sense of all things cyberspace 🤖 Uni of Ox PhD (loading…) @oiioxford & @OxfordAI. Prev @turinginst & @Cambridge_Uni. Visitor @ NYUFelix M. Simon @_FelixSimon_
5K Followers 1K Following Leverhulme Scholar @oiioxford | AI in news and journalism & Misinfo | Research Assistant @risj_oxford | Fellow @TowCenter | Affiliate @unc_citap | My views etc…Nayana Prakash @NayanaPrakash1
1K Followers 1K Following PhDing @oiioxford ✍🏼 // podcasting @skeptechs 🎙️// writing about India, gender, storytelling, postcolonialism, Internet. loves dogs. unusually tall 💅Zeerak@{mastodon,bsky.. @ZeerakTalat
3K Followers 3K Following Past: @SFU_DDI @SheffieldNLP @CoastalCPH. Reluctant Machine Learner. Researching hate online & AI ethics. Organising @woahworkshop.Dr. Mona Elswah من�.. @monaelswah
2K Followers 2K Following I research tech stuff @CenDemTech @oiioxford @OxDemTech @CarrCenter, @AUC @CairoUnivOxford Internet Insti.. @oiioxford
50K Followers 2K Following The Oxford Internet Institute (OII) is a multidisciplinary research and teaching dept at University of Oxford, dedicated to the social science of the Internet.Luca Soldaini 🎀 @ .. @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)Anne Lauscher (she/he.. @anne_lauscher
1K Followers 477 Following Ethical AI, Computational Argumentation, Conversational AI Assoc. Professor of Data Science @unihh Previously @MilaNLProc @dwsunima @allen_ai @grammarlyDirk Hovy @dirk_hovy
9K Followers 1K Following Prof @Unibocconi @MilaNLProc: #NLProc, compsocsci, #ML. #ERCStG INTEGRATOR Python NLProc books: https://t.co/vXm98Ns3D7 https://t.co/Ll7E6PDOKEDr. Nahema Marchal @nahema_marchal
2K Followers 1K Following research scientist @deepmind | 🔎 tech governance, online harms, socio-technical ai | previously @prodigi_erc @oiioxford | flibbertigibbet, views her ownShaily @shaily99
5K Followers 2K Following PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on ethics and evaluation in #NLProc. Usually ranting, often about research & DEI. 📚 @readsndrantsskeptechs @skeptechs
633 Followers 398 Following Spotify award-winning podcast hosted by Dr @JoshCowls & @NayanaPrakash1 featuring tech news and research from @oiioxford students. Views all our ownVerena Rieser @verena_rieser
4K Followers 1K Following Researcher @DeepMind working on safer Conversational AI | Honorary Professor @HeriotWattUni | Co-founder @helloalana | mother of dragons | own opinions onlyManoel @manoelribeiro
3K Followers 1K Following CS PhD student @ EPFL — On the job market for 2023-2024! 🐘: @[email protected] Keywords: Computational Social Science, Platforms, Communities, ModerationMilaNLP @MilaNLProc
4K Followers 447 Following The Milan Natural Language Processing Group #NLProc #ML #AIVivek Gupta @keviv9
2K Followers 5K Following PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #mlKathryn Eccles @KathrynEccles
2K Followers 2K Following Associate Professor and Senior Research Fellow, Oxford Internet Institute and Pembroke College, University of OxfordCassidy Bereskin @cassidybereskin
6K Followers 1K Following PhD-ing @UniofOxford | Director @OxGenAI | [email protected]Preslav Nakov @preslav_nakov
2K Followers 2K Following Professor, MBZUAI LLMs, Jais-chat, "fake news", disinformation, propaganda, media bias Past: UC Berkeley, NUS, BAS, SUJekanyika| @tedoex@te.. @tedoex
2K Followers 1K Following Gender and Technology Researcher• PhDing @UniofOxford,@oiioxford• Views=mine• Sometimes Writer/Poet•TEDx2016 Speaker• Founder:@tedoexmediaMyah Dedaj @dedaj_my
89 Followers 5K FollowingSalvatore Greco @_salvatoregreco
107 Followers 495 Following PhD Student | Explainable Artificial Intelligence and Natural Language Processing at Politecnico di TorinoTwin 2 @danielr4s
43 Followers 502 FollowingJacek (Jomsborg.eth) @timelessdev
1K Followers 5K Following The DAO investor. Early @Aleph__zero inv. Decentralization. Born on Vikings island called Jomsborg. Applied math. My posts are not financial advise.Clare Schmiesing @cl_schmies
80 Followers 5K Followinghuansong @huansong514
8 Followers 173 FollowingLj Miranda @ljvmiranda
780 Followers 431 Following processing language naturally • predoc at @allen_aiIulia Cioroianu @IuliaCioroianu
686 Followers 2K Following Senior Lecturer (Associate Professor) @UniofBath. Computational social science, social media, campaigns and elections. Ph.D. New York Universityliuyong @forrestbing
281 Followers 5K Following I am a researcher in AIGC, Multi-modality and VitrualHuman tech directionEce Takmaz @ecekt2
887 Followers 2K Following Postdoc at @UniUtrecht, previously PhD candidate at @UvA_AmsterdamTorrey Snyder @Torrey_s2467
47 Followers 726 FollowingJaap Jumelet @JumeletJ
492 Followers 315 Following PhD candidate at UvA with @wzuidema NLP ∩ Interpretability ∩ Linguistics PhD in 5 words: Finding structure in language modelset al. @basujindal
154 Followers 1K Following Grad @UCSanDiego | @iitmadras | https://t.co/NbicDWiTMTSong Yifan @gooffanita
14 Followers 44 Following ling student @UniofOxford; interested in language, speech, gender and tech.Capybara ai @capybara_ai
49 Followers 505 Following Capybara doing PhD@TsinghuaCS, checkout my blog @ https://t.co/2Iz05C84xd. Interested in Reinforcement Learning, LLM-based Agents, Alignment.Oskar van der Wal @oskarvanderwal
298 Followers 350 Following PhD student social bias NLP @AmsterdamNLP @UvA_Amsterdam @AiEleuther Supervisors @wzuidema & Katrin Schulz 🟦@ovdw.bsky.social 🐘@[email protected]Sayash Kapoor @sayashk
5K Followers 1K Following CS PhD candidate @PrincetonCITP. I study the societal impact of AI. Currently writing a book on AI Snake Oil: https://t.co/tb2lXSP2gBDaniel Hershcovich @daniel_hers
877 Followers 2K Following Assistant Professor @coastalcph @DIKU_Institut, University of Copenhagen. Interests: meaning, multiculturalism and promoting responsible behavior with #NLProcScott Hale @computermacgyve
2K Followers 1K Following Associate Prof @oiioxford, Director of Research @meedan, Fellow @turinginst.・widening access to quality info・multilingualism・mobilization・NLP・agenda settingYan Meng @vivian_yanmy
81 Followers 185 Following PhD student at Language Technology Lab, University of AmsterdamAndreas Kirsch 🇮�.. @BlackHC
9K Followers 5K Following Past: 🧑🎓 DPhil @AIMS_oxford @ExeterCollegeOx @UniofOxford (4.5yr) 🧙♂️ RE @DeepMind (1yr) 📺 SWE @Google (3yrs) 🎓 @TU_Muenchen 👤 Fellow @nwspkMimansa Jaiswal @MimansaJ
1K Followers 3K Following MoTS @normativeai. Ex @UMichCSE, 2x @MetaAI, @allen_ai | Speech & NLP | Robustness, Data & Annotations, Evaluation & Interpretability in LLMsHunar Batra @hunarbatra
2K Followers 2K Following DPhil/MSc CS @UniOfOxford | prev: aligning LLMs @AnthropicAI @NYUDataScience @stanford | Tweets are my own, not through a bot 🤖ByeRose @byerose365
0 Followers 520 FollowingJiaxing Cui @cuijiaxing
13 Followers 119 FollowingEva Maria Vecchi @emvecchi
175 Followers 373 Following NLP Researcher @ims_stuttgart & @Cambridge_NLP Argument Mining, e-Deliberation, Bias, Meaning representations, Cognitive Modeling, #NLProc methodologyCongfeng Cao @Congfeng_Cao
11 Followers 73 Following NLP, Graph Learning CV, Remote Sensing PhD student @UvA_Amsterdam @illc_amsterdamJoan @Joanvelja
24 Followers 149 Following Container of multitudes | MS AI @ University of AmsterdamLuisa Fernanda Isaza @luisaza
2K Followers 3K Following Soy un montón de cosas humanas mezcladas con cosas santas | Free speech - Digital Tech - Human Rights https://t.co/dvYvixLssLMinje Choi @minje__choi
370 Followers 557 Following Postdoctoral fellow @ICatGT | PhD @UMSI | Prev. intern at @BellLabs, @Snap & @Visa | Interested in #NLProc #CompSocialScience #socialnetworks #misinformationoxford.games @GamesOxford
81 Followers 89 Following Bringing together students, academics, and industry professionals to discuss research initiatives and future directions for inclusive gaming 🎮👾j soma @dangerscarf
3K Followers 2K Following data head, python kid, prof @columbiajourn, director @ledeprog, co-founder @bkbrains + @catrepublicbk. collecting all cats, widening all gyresRyan Hauser @R__Hauser
2K Followers 4K Following Editor, https://t.co/2vFebqEBeg | HPS/STS + Political Economy | Grad Student @STS_VT | Usual caveats.Ana @alekseevaas21
44 Followers 153 Following Master's student at the University of Tübingen. I wanted to be an artist, but have ended up with political science, machine learning and Spätzle.Bóxī Wú 吴泊曦 @boxiwu_
490 Followers 1K Following AI ethics & safety @googledeepmind / Politics of AI infrastructure @oiioxford @uniofoxford / organising + climate justice with ESEA Green Lions 🐉💚 they/themPavel Goldman Kalaidi.. @facultyofwonder
399 Followers 777 Following Head of AI at a startup, borzoi dogs owner, language enthusiast (en-ru-de-jp-he) #IStandWithIsraeltestuser @testuser12331
11 Followers 51 FollowingSaurav Sahay @sauravsahay
507 Followers 1K Following Research Science Manager, Multimodal Dialogue and Interactions at Intel Labs🅱 @b_douik
2 Followers 70 Followingsmith jane76 @AdebayoM44392
3 Followers 35 FollowingMikel K. Ngueajio @KNew_Mikel
241 Followers 82 Following Machine Learning Research Scholar @Apple Siri. Ph.D. student in computer science at Howard University. Research assistant for the Affective Biometric Lab.Hannah Rose Kirk @hannahrosekirk
3K Followers 689 Following AI researcher trying to make sense of all things cyberspace 🤖 Uni of Ox PhD (loading…) @oiioxford & @OxfordAI. Prev @turinginst & @Cambridge_Uni. Visitor @ NYUMMitchell @mmitchell_ai
80K Followers 1K Following Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Same content in the Sky, Threads, & the Prehistoric ElephantFelix M. Simon @_FelixSimon_
5K Followers 1K Following Leverhulme Scholar @oiioxford | AI in news and journalism & Misinfo | Research Assistant @risj_oxford | Fellow @TowCenter | Affiliate @unc_citap | My views etc…Nayana Prakash @NayanaPrakash1
1K Followers 1K Following PhDing @oiioxford ✍🏼 // podcasting @skeptechs 🎙️// writing about India, gender, storytelling, postcolonialism, Internet. loves dogs. unusually tall 💅EMNLP 2024 @emnlpmeeting
12K Followers 41 Following EMNLP 2024 - The 2024 Conference on Empirical Methods in Natural Language Processing, November 12 –16, 2024 Hashtag: #EMNLP2024Zeerak@{mastodon,bsky.. @ZeerakTalat
3K Followers 3K Following Past: @SFU_DDI @SheffieldNLP @CoastalCPH. Reluctant Machine Learner. Researching hate online & AI ethics. Organising @woahworkshop.@emilymbender@dair-co.. @emilymbender
58K Followers 2K Following Prof, Linguistics, UW // Faculty Director, CLMS // she/her // @[email protected] & bsky // rep by @ianbonaparteAbeba Birhane @Abebab
53K Followers 2K Following Senior Advisor, AI Accountability @Mozilla |Cognitive science PhD |Adjunct prof @tcddublinscss, @tcddublin |Ethiopian in Ireland |She/her @abeba.bsky.socialLeon Derczynski ✍�.. @LeonDerczynski
6K Followers 1K Following NLP/ML/language/security. Principal research scientist @NVIDIA, & Prof @ITUkbh. Views ostensibly professional. llmsec stan acctBarbara Plank @barbara_plank
9K Followers 1K Following Prof, Chair for AI & Computational Linguistics @LMU_Muenchen, Head of @MaiNLPlab & co-director @CisLMU Prof at @ITUkbh @NLPnorth @ELLISforEurope scholar(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingOxford Internet Insti.. @oiioxford
50K Followers 2K Following The Oxford Internet Institute (OII) is a multidisciplinary research and teaching dept at University of Oxford, dedicated to the social science of the Internet.Luca Soldaini 🎀 @ .. @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)Google DeepMind @GoogleDeepMind
945K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.Anne Lauscher (she/he.. @anne_lauscher
1K Followers 477 Following Ethical AI, Computational Argumentation, Conversational AI Assoc. Professor of Data Science @unihh Previously @MilaNLProc @dwsunima @allen_ai @grammarlyDirk Hovy @dirk_hovy
9K Followers 1K Following Prof @Unibocconi @MilaNLProc: #NLProc, compsocsci, #ML. #ERCStG INTEGRATOR Python NLProc books: https://t.co/vXm98Ns3D7 https://t.co/Ll7E6PDOKEDr. Nahema Marchal @nahema_marchal
2K Followers 1K Following research scientist @deepmind | 🔎 tech governance, online harms, socio-technical ai | previously @prodigi_erc @oiioxford | flibbertigibbet, views her ownSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzSachin Kumar @shocheen
955 Followers 636 Following Incoming Asst. Prof. at @OhioStateCSE ('24). Postdoc at @allen_ai. Visiting @UWNLP. Ph.D. from @LTICMU. He/Him. Taking new students this cycle, reach out!Sandro Pezzelle @sandropezzelle
806 Followers 668 Following Assistant Professor at the University of Amsterdam. #NLProc #AI #CogSci #interpretabilityNicholas Meade @ncmeade
138 Followers 150 Following PhD student at @McGillU / @Mila_Quebec; Interested in #NLProc.Daniel Hershcovich @daniel_hers
877 Followers 2K Following Assistant Professor @coastalcph @DIKU_Institut, University of Copenhagen. Interests: meaning, multiculturalism and promoting responsible behavior with #NLProcWei-Lin Chiang @infwinston
3K Followers 853 Following CS PhD student at UC Berkeley. co-lead of Chatbot Arena @lmsysorgMinje Choi @minje__choi
370 Followers 557 Following Postdoctoral fellow @ICatGT | PhD @UMSI | Prev. intern at @BellLabs, @Snap & @Visa | Interested in #NLProc #CompSocialScience #socialnetworks #misinformationNino Scherrer @ninoscherrer
594 Followers 2K Following Interested in rigorous LLM evals, synthetic data, causality, robustness & AI safety/ethics | RS @PatronusAI | Prev: @VectorInst, @Mila_Quebec, @MPI_IS, @ETH_enCanyu Chen @CanyuChen3
852 Followers 2K Following CS Ph.D. student @illinoistech | Truthful, Safe and Responsible LLMs | LLMs Meet Misinformation: https://t.co/up5sEN5r1gRaquel Fernández @raquel_dmg
2K Followers 167 Following Professor at the ILLC in Amsterdam, head of the Dialogue Modelling Group, research on linguistic interaction, visual grounding & semantics/pragmaticsFangru Lin @FangruLin99
203 Followers 196 Following DPhil NLP student @UniofOxford; Clarendon Scholar; Ex SDE intern @Microsoft; Computational Linguist; know how and know whyFaeze Brahman @faeze_brh
2K Followers 1K Following Postdoc @allen_ai @ai2_mosaic @uw | Ph.D. from UCSC | Former Intern @MSFTResearch , @allen_ai | Researcher in #NLProc, #MLJames Zou @james_y_zou
10K Followers 59 Following @Stanford professor. Chan-Zuckerberg investigator. Sloan Fellow. AI for biotech + health. Making AI more trustworthy, reliable and human compatible.Lorenzo Lupo @lorelupo
34 Followers 50 Following processing natural language to tackle social and economic challenges @Unibocconi @MilaNLProclmsys.org @lmsysorg
38K Followers 173 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtmBill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscWill Held @WilliamBarrHeld
1K Followers 792 Following Modeling Linguistic Variation for Inclusive NLP ML PhD w/ @Diyi_Yang at @MLatGT/@StanfordNLP Alum @NYUAbuDhabi @Sunshine @GoogleAI @AIatMeta Burqueño he/himharry law @lawhsw
2K Followers 899 Following thinking about thinking machines @GoogleDeepMind @Cambridge_Uni @LeverhulmeCFIana vldv @ana_valdi
5K Followers 4K Following lecturer in ai, government & policy at the @oiioxford | critical ai studies | co-editor at @bigdatasoc | radical ecology, politics & algorithms | 🚲📖🌻Tanise Ceron @taniseceron
172 Followers 336 Following PhD student in computational linguistics and computational social science @ims_stuttgartNiklas Stoehr @niklas_stoehr
801 Followers 754 Following PhD student @ETH advised by @RyanDCotterell, @Cervisiarius and @AaronSchein ⭕️ Language Model Interpretability and Application ⭕️ @GoogleAI, @TechAtBloombergTzu-Sheng Kuo 郭子�.. @tzushengkuo
971 Followers 654 Following PhD student @cmuhcii | building systems that support community-driven approaches to AI design and evaluation | @stanfordhci '21 | #firstgen, he, 🇹🇼Nathan Benaich @nathanbenaich
51K Followers 32K Following solo member of investment staff @airstreet, brewing ambition @airstreetcafe, next token predictor @airstreetpressGeoffrey Irving @geoffreyirving
8K Followers 259 Following Research Director at the UK AI Safety Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc. @[email protected]Yarin @yaringal
38K Followers 222 Following Associate Professor of Machine Learning, University of Oxford @OATML_Oxford Group Leader Director of Research at AISI (formerly UK Taskforce on Frontier AI)Conference on Languag.. @COLM_conf
2K Followers 6 Following https://t.co/GhGCMEoa4A Abstract submission: March 22, 2024Javier Rando @javirandor
910 Followers 589 Following Red-Teaming LLMs | PhD Student @ETH_AI_Center | Incoming intern @Meta | Vegan 🌱Hanna Hajishirzi @HannaHajishirzi
6K Followers 328 Following Associate professor at @uw_cse; senior director at @allen_ai co-leading @allenNLP; AI/NLP researcher at @uw_nlpSander Schulhoff @SanderSchulhoff
448 Followers 62 Following DRL & NLP & Botany https://t.co/E3m8NZH88Y maintainer https://t.co/yly3QzZ3CS competition runnerAlex Tamkin 🦣 @AlexTamkin
4K Followers 1K Following machine learning, science & society @AnthropicAI | prev: phd @StanfordAILab, @stanfordnlpSam Toyer @sdtoyer
220 Followers 330 Following PhD student @berkeley_ai | Thief Executive Officer @ https://t.co/oY65mYDu3wDaniel van Strien @vanstriendaniel
3K Followers 1K Following Machine Learning Librarian @huggingface 🤗 | Championing Open Science & ML | Sharing the latest ML datasets 🌟 | Tips for mastering the HF HubHyunwoo Kim @hyunw__kim
1K Followers 438 Following Social Reasoning/Commonsense + AI | Postdoc @allen_ai | PhD @SeoulNatlUniNiloofar (Fatemeh) Mi.. @niloofar_mire
5K Followers 1K Following Postdoc @uwcse-@uwnlp, Ph.D. from @ucsd_cse /Privacy, ML, NLP, @winlpworkshop chair, @MSFTResearch - Semantic MachinesSharon Levy @sharonlevy21
1K Followers 573 Following Incoming Assistant Professor @RutgersCS Fall 2024 | Current Postdoc @jhuclsp PhD from UCSB @ucsbNLP Research in Responsible NLPDan Hendrycks @DanHendrycks
17K Followers 81 Following • Director of the Center for AI Safety (https://t.co/ahs3LYCpqv) • GELU/MMLU/MATH • PhD in AI from UC Berkeley https://t.co/rgXHAnYAsQ https://t.co/nPSyQMaY9bAlicia Parrish @AliciaVParrish
557 Followers 675 Following Research scientist at Google. I like CogSci & NLP. PhD from @nyuling. She/her.Owain Evans @OwainEvans_UK
7K Followers 242 Following Research Associate @fhioxford, Oxford University. AI alignment. Prefer email to DM.Anna Rogers 🇺🇦�.. @annargrs
9K Followers 865 Following Associate professor @ITUkbh: LLM interpretability, generalization, AI & society. Co-editor-in-chief @ACLRollingReviewValentina Pyatkin @valentina__py
2K Followers 1K Following Postdoc at the Allen Institute for AI @allen_ai and @uwnlpEve Fleisig @enfleisig
377 Followers 332 Following PhD student @Berkeley_EECS | Princeton ‘21 | NLP, ethical + equitable AI, and linguistics enthusiastTanmoy Chakraborty @Tanmoy_Chak
2K Followers 823 Following Associate Professor @iitdelhi; ACM Distinguished Speaker; Lab @lcs2lab; Previously @IIITDelhi @UofMaryland @iitkgp; #NLP #SocialComputing #GraphNeuralNetworksIndira Sen @indiiigosky
712 Followers 882 Following Computational Social Science PhD Student at RWTH - She/her - Saving up to fund my own biopic and migrating to https://t.co/tQ6ujjzKH8 + https://t.co/jflkRWngV3Ray Duch @RayDuch
1K Followers 546 Following Director of the Centre for Experimental Social Sciences at Nuffield College, Oxford. Visitor @IAST1 Toulouse.Tanvi Dinkar @t_dinkar
301 Followers 583 Following Post doc in Safety and Ethics in Conversational AI @HeriotWattUni | Background in CS and Linguistics. Safety+Robustness, Ethics, Dialogue, PragmaticsAfter months of preparation, we're super excited to launch the LMSYS @kaggle competition on human preference prediction today! Welcome to join the competition and play with our dataset! Can't wait to see the innovative preference models that will emerge from this challenge.
Exciting news -- we're thrilled to announce that LMSYS + @kaggle are launching a human preference prediction competition with $100,000 in prizes! Your challenge is to predict which responses users will prefer in head-to-head battles between LLMs in the Chatbot Arena real-world…
Slowly but surely making progress on @COLM_conf reviewing. Reviews are due May 10! 😱🤗 9%|🦙▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐| [no idea what's going on with these llamas frantically working, but I am scared]
HarmBench (arxiv.org/abs/2402.04249) and WMDP (arxiv.org/abs/2403.03218) were accepted!
I’m so glad that our paper is accepted at #ICML2024! Again many thanks to my fantastic co-authors and see you in Vienna!🤩🤩🤩
Excited to share our paper with @iperboreo_ @vjhofmann @ellemichelley Anthony Cohn and Janet Pierrehumbert: fangru-lin.github.io/publications/! We release a benchmark for asynchronous plan *AsyncHow*. When *Plan Like a Graph*, GPT-4/3.5 get consistent boost over all task complexities. 1/n
Truly outstanding work by @hannahrosekirk on pluralistic alignment -- releasing an exciting novel dataset and providing a through analysis subjective and multicultural alignment 🤩😍💯❤️🔥🦾
Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
Life update: today is my first day as a Member of Technical Staff at @cohere!
It's becoming a challenge to keep up with the outstanding research the @hannahrosekirk – and also, relatedly @sorenmind – are producing these days! People keep asking me what comes after the Advanced AI Assistants paper: The answer is always "reading" 📚📚📚
Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
The first chats from the ShareLM plugin are up, together with >4GB of chat datasets, organized in a unified format! ✨Whether you use models, create data, or spaces there is always a way to help✨ 💬:sharelm.github.io 🤗:huggingface.co/datasets/shach… 🧩:chromewebstore.google.com/detail/sharelm…
Super duper work on ShareLM for collecting "in-the-wild" preferences in more naturalistic interactions with LLMs. I really like how this can just be integrated simply into daily workflows. Small effort but mighty payoff 🦾
The first chats from the ShareLM plugin are up, together with >4GB of chat datasets, organized in a unified format! ✨Whether you use models, create data, or spaces there is always a way to help✨ 💬:sharelm.github.io 🤗:huggingface.co/datasets/shach… 🧩:chromewebstore.google.com/detail/sharelm…
Ha sido un placer participar como jurado del Hackathon #Somos600M. Enhorabuena a todos los equipos por la originalidad y calidad de los proyectos👏🏻👏🏻. Es increíble lo que la comunidad de #PLN en español puede conseguir en tan poco tiempo 💪🏼
Los proyectos ganadores del Hackathon #Somos600M son... 🥁🥁🥁 ¡Enhorabuena a tooodos los equipos! Cada año nos sorprendéis con proyectos originales, de gran impacto, y gran calidad 🤩 🧵 Os contamos más sobre los proyectos premiados y las menciones de honor
📜 New paper unpacking Google DeepMind’s approach to safety evals for advanced AI models, with lessons learned to support the advancement of similar efforts by other actors in this space. Covers foresight, evaluation design, and the wider ecosystem. arxiv.org/abs/2404.14068
More great work from a research team led by our model methodologist and evaluator in chief @weidingerlaura 👏 Here's what we learned during the latest round of @GoogleDeepMind model testing🤖📊
📜 New paper unpacking Google DeepMind’s approach to safety evals for advanced AI models, with lessons learned to support the advancement of similar efforts by other actors in this space. Covers foresight, evaluation design, and the wider ecosystem. arxiv.org/abs/2404.14068
📣 New report out! 🎉How do we know whether an AI is “safe”? We share learnings from developing safety evaluation of large scale systems at Google DeepMind for a broad audience. Report: arxiv.org/abs/2404.14068 Key lessons: 🪡 (1/n)
More great work from a research team led by our model methodologist and evaluator in chief @weidingerlaura 👏 Here's what we learned during the latest round of @GoogleDeepMind model testing🤖📊
It's been my pleasure to collaborate with Hannah etc al. for the last few years leading up to this 💛 the dataset has turned out to be so much more than I could ever have imagined! There's a lot in here (>100pg!), hopefully, our findings will inform future work on preferences!
Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
📖For our weekly @MilaNLProc lab seminar, it was a pleasure to have @shocheen presenting "Adapting language models to improve reliability: Experiments with refusals and diverse preference modeling" #NLProc
Thesis submitted, no turning back now… Acknowledgements to follow — for now, thank you all!
This is so cool! Happy to see it out in the world
Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
Adversarial Triggers For LLMs Are 𝗡𝗢𝗧 𝗨𝗻𝗶𝘃𝗲𝗿𝘀𝗮𝗹!😲 It is believed that adversarial triggers that jailbreak a model transfer universally to other models. But we show triggers don't reliably transfer, especially to RLHF/DPO models. Paper: arxiv.org/abs/2404.16020
A very impressive human preference dataset (including annotator demographics) that will without doubt lead to many interesting studies! Congrats on the massive effort @hannahrosekirk!!
Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
More preference data for people to play with in alignment research! It also comes with surveys asking people what their state preferences are! Dataset: huggingface.co/datasets/Hanna…
Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019