Paul Röttger @paul_rottger

Postdoc @MilaNLProc, working on evaluating and improving LLM safety. Previously PhD @oiioxford & CTO/co-founder @rewire_online paulrottger.com Joined July 2020

Tweets

274
Followers

2K
Following

455
Likes

2K

Paul Röttger @paul_rottger

a week ago

If you are working on AI alignment, you should really check out PRISM. It is hard to overstate how rich and exciting this dataset is. What a great week to be a co-author of @hannahrosekirk!

Hannah Rose Kirk @hannahrosekirk

a week ago

If you are working on AI alignment, you should really check out PRISM. It is hard to overstate how rich and exciting this dataset is. What a great week to be a co-author of @hannahrosekirk!

20 91 383 72K 186

Download Image

2 4 30 3K 7

Personalised LLMs are great, but should there be limits to personalisation? If so, who should set these limits? For answers to these questions and more, check out our paper on the risks and benefits of personalising LLMs, led by @hannahrosekirk 👇 out in @NatMachIntell today!

Hannah Rose Kirk @hannahrosekirk

a week ago

4 51 224 31K 117

Download Image

1 7 57 5K 15

Download Image

Janis Goldzycher @jagoldz

a month ago

New paper at #NAACL2024 🥳 We present GAHD, an 11k German Adversarial Hate speech Dataset 📜 and show that mixing annotator support strategies for finding adv. examples leads to a more effective dataset! Great collab with @paul_rottger and @center_text! Highlights below ⬇️

3 4 49 3K 8

Download Image

James Zou @james_y_zou

a month ago

How many safety examples do #LLMs need? What examples are most useful? Why is it unethical to kill Python processes?🤯 Our new #ICLR2024 paper studies these + more! openreview.net/pdf?id=gT5hALc… We analyze safey/utility tradeoff (100s safe demos suffice) and exaggerated safety. Great…

5 27 121 20K 77

Download Image

Paul Röttger @paul_rottger

2 months ago

This is very good, careful, and considerate work. Come for the headline results, stay for the many details!

Valentin Hofmann @vjhofmann

2 months ago

This is very good, careful, and considerate work. Come for the headline results, stay for the many details!

87 606 2K 398K 858

Download Image

2 2 22 2K 9

AI researcher trying to make sense of all things cyberspace 🤖 Uni of Ox PhD (loading…) @oiioxford & @OxfordAI. Prev @turinginst & @Cambridge_Uni. Visitor @ NYU

Hannah Rose Kirk @hannahrosekirk

3K Followers 689 Following AI researcher trying to make sense of all things cyberspace 🤖 Uni of Ox PhD (loading…) @oiioxford & @OxfordAI. Prev @turinginst & @Cambridge_Uni. Visitor @ NYU

Leverhulme Scholar @oiioxford | AI in news and journalism & Misinfo | Research Assistant @risj_oxford | Fellow @TowCenter | Affiliate @unc_citap | My views etc…

Felix M. Simon @_FelixSimon_

PhDing @oiioxford ✍🏼 // podcasting @skeptechs 🎙️// writing about India, gender, storytelling, postcolonialism, Internet. loves dogs. unusually tall 💅

Nayana Prakash @NayanaPrakash1

1K Followers 1K Following PhDing @oiioxford ✍🏼 // podcasting @skeptechs 🎙️// writing about India, gender, storytelling, postcolonialism, Internet. loves dogs. unusually tall 💅

Past: @SFU_DDI @SheffieldNLP @CoastalCPH. Reluctant Machine Learner. Researching hate online & AI ethics. Organising @woahworkshop.

Zeerak@{mastodon,bsky.. @ZeerakTalat

3K Followers 3K Following Past: @SFU_DDI @SheffieldNLP @CoastalCPH. Reluctant Machine Learner. Researching hate online & AI ethics. Organising @woahworkshop.

Dr. Mona Elswah من�.. @monaelswah

2K Followers 2K Following I research tech stuff @CenDemTech @oiioxford @OxDemTech @CarrCenter, @AUC @CairoUniv

The Oxford Internet Institute (OII) is a multidisciplinary research and teaching dept at University of Oxford, dedicated to the social science of the Internet.

Oxford Internet Insti.. @oiioxford

50K Followers 2K Following The Oxford Internet Institute (OII) is a multidisciplinary research and teaching dept at University of Oxford, dedicated to the social science of the Internet.

I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)

Luca Soldaini 🎀 @ .. @soldni

6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)

Ethical AI, Computational Argumentation, Conversational AI
Assoc. Professor of Data Science @unihh
Previously @MilaNLProc @dwsunima @allen_ai @grammarly

Anne Lauscher (she/he.. @anne_lauscher

1K Followers 477 Following Ethical AI, Computational Argumentation, Conversational AI Assoc. Professor of Data Science @unihh Previously @MilaNLProc @dwsunima @allen_ai @grammarly

Prof @Unibocconi @MilaNLProc: #NLProc, compsocsci, #ML.
#ERCStG INTEGRATOR
Python NLProc books:
https://t.co/vXm98Ns3D7
https://t.co/Ll7E6PDOKE

Dirk Hovy @dirk_hovy

9K Followers 1K Following Prof @Unibocconi @MilaNLProc: #NLProc, compsocsci, #ML. #ERCStG INTEGRATOR Python NLProc books: https://t.co/vXm98Ns3D7 https://t.co/Ll7E6PDOKE

research scientist @deepmind | 🔎 tech governance, online harms, socio-technical ai | previously @prodigi_erc @oiioxford | flibbertigibbet, views her own

Dr. Nahema Marchal @nahema_marchal

2K Followers 1K Following research scientist @deepmind | 🔎 tech governance, online harms, socio-technical ai | previously @prodigi_erc @oiioxford | flibbertigibbet, views her own

PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on ethics and evaluation in #NLProc. Usually ranting, often about research & DEI.
📚 @readsndrants

Shaily @shaily99

5K Followers 2K Following PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on ethics and evaluation in #NLProc. Usually ranting, often about research & DEI. 📚 @readsndrants

Spotify award-winning podcast hosted by Dr @JoshCowls & @NayanaPrakash1 featuring tech news and research from @oiioxford students. Views all our own

skeptechs @skeptechs

633 Followers 398 Following Spotify award-winning podcast hosted by Dr @JoshCowls & @NayanaPrakash1 featuring tech news and research from @oiioxford students. Views all our own

Researcher @DeepMind working on safer Conversational AI | Honorary Professor @HeriotWattUni | Co-founder @helloalana | mother of dragons | own opinions only

Verena Rieser @verena_rieser

4K Followers 1K Following Researcher @DeepMind working on safer Conversational AI | Honorary Professor @HeriotWattUni | Co-founder @helloalana | mother of dragons | own opinions only

CS PhD student @ EPFL — On the job market for 2023-2024!
🐘: @manoel@hci.social
Keywords: Computational Social Science, Platforms, Communities, Moderation

Manoel @manoelribeiro

3K Followers 1K Following CS PhD student @ EPFL — On the job market for 2023-2024! 🐘: @[email protected] Keywords: Computational Social Science, Platforms, Communities, Moderation

MilaNLP @MilaNLProc

4K Followers 447 Following The Milan Natural Language Processing Group #NLProc #ML #AI

PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #ml

Vivek Gupta @keviv9

2K Followers 5K Following PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #ml

Associate Professor and Senior Research Fellow, Oxford Internet Institute and Pembroke College, University of Oxford

Kathryn Eccles @KathrynEccles

2K Followers 2K Following Associate Professor and Senior Research Fellow, Oxford Internet Institute and Pembroke College, University of Oxford

PhD-ing @UniofOxford | Director @OxGenAI | cassidy.bereskin@oii.ox.ac.uk

Cassidy Bereskin @cassidybereskin

6K Followers 1K Following PhD-ing @UniofOxford | Director @OxGenAI | [email protected]

Preslav Nakov @preslav_nakov

2K Followers 2K Following Professor, MBZUAI LLMs, Jais-chat, "fake news", disinformation, propaganda, media bias Past: UC Berkeley, NUS, BAS, SU

Gender and Technology Researcher• PhDing @UniofOxford,@oiioxford• Views=mine• Sometimes Writer/Poet•TEDx2016 Speaker• Founder:@tedoexmedia

Jekanyika| @tedoex@te.. @tedoex

2K Followers 1K Following Gender and Technology Researcher• PhDing @UniofOxford,@oiioxford• Views=mine• Sometimes Writer/Poet•TEDx2016 Speaker• Founder:@tedoexmedia

Myah Dedaj @dedaj_my

89 Followers 5K Following

Salvatore Greco @_salvatoregreco

107 Followers 495 Following PhD Student | Explainable Artificial Intelligence and Natural Language Processing at Politecnico di Torino

Twin 2 @danielr4s

43 Followers 502 Following

The DAO investor. Early @Aleph__zero inv. Decentralization. Born on Vikings island called Jomsborg. Applied math. My posts are not financial advise.

Jacek (Jomsborg.eth) @timelessdev

1K Followers 5K Following The DAO investor. Early @Aleph__zero inv. Decentralization. Born on Vikings island called Jomsborg. Applied math. My posts are not financial advise.

Clare Schmiesing @cl_schmies

80 Followers 5K Following

huansong @huansong514

8 Followers 173 Following

Lj Miranda @ljvmiranda

780 Followers 431 Following processing language naturally • predoc at @allen_ai

james.near @jwaup

4K Followers 4K Following multidaomensional human @nearbuilders #OpenWeb

Senior Lecturer (Associate Professor) @UniofBath. Computational social science, social media, campaigns and elections. Ph.D. New York University

Iulia Cioroianu @IuliaCioroianu

686 Followers 2K Following Senior Lecturer (Associate Professor) @UniofBath. Computational social science, social media, campaigns and elections. Ph.D. New York University

liuyong @forrestbing

281 Followers 5K Following I am a researcher in AIGC, Multi-modality and VitrualHuman tech direction

Charleno Pires @charlenopires

2K Followers 5K Following Creative Man

Ece Takmaz @ecekt2

887 Followers 2K Following Postdoc at @UniUtrecht, previously PhD candidate at @UvA_Amsterdam

Torrey Snyder @Torrey_s2467

47 Followers 726 Following

PhD candidate at UvA with @wzuidema

NLP ∩ Interpretability ∩ Linguistics

PhD in 5 words: Finding structure in language models

Jaap Jumelet @JumeletJ

492 Followers 315 Following PhD candidate at UvA with @wzuidema NLP ∩ Interpretability ∩ Linguistics PhD in 5 words: Finding structure in language models

Expected Parrot @ExpectedParrot

268 Followers 218 Following AI-powered social science research.

et al. @basujindal

154 Followers 1K Following Grad @UCSanDiego | @iitmadras | https://t.co/NbicDWiTMT

Song Yifan @gooffanita

14 Followers 44 Following ling student @UniofOxford; interested in language, speech, gender and tech.

V. Gurucharan @GuruCharan4936

73 Followers 1K Following Chief AI Officer | PhD in String Theory

Capybara doing PhD@TsinghuaCS, checkout my blog @ https://t.co/2Iz05C84xd.
Interested in Reinforcement Learning, LLM-based Agents, Alignment.

Capybara ai @capybara_ai

49 Followers 505 Following Capybara doing PhD@TsinghuaCS, checkout my blog @ https://t.co/2Iz05C84xd. Interested in Reinforcement Learning, LLM-based Agents, Alignment.

PhD student social bias NLP @AmsterdamNLP @UvA_Amsterdam
@AiEleuther
Supervisors @wzuidema & Katrin Schulz
🟦@ovdw.bsky.social
🐘@oskarvanderwal@sigmoid.social

Oskar van der Wal @oskarvanderwal

298 Followers 350 Following PhD student social bias NLP @AmsterdamNLP @UvA_Amsterdam @AiEleuther Supervisors @wzuidema & Katrin Schulz 🟦@ovdw.bsky.social 🐘@[email protected]

CS PhD candidate @PrincetonCITP. I study the societal impact of AI. Currently writing a book on AI Snake Oil: https://t.co/tb2lXSP2gB

Sayash Kapoor @sayashk

5K Followers 1K Following CS PhD candidate @PrincetonCITP. I study the societal impact of AI. Currently writing a book on AI Snake Oil: https://t.co/tb2lXSP2gB

Assistant Professor @coastalcph @DIKU_Institut, University of Copenhagen. Interests: meaning, multiculturalism and promoting responsible behavior with #NLProc

Daniel Hershcovich @daniel_hers

877 Followers 2K Following Assistant Professor @coastalcph @DIKU_Institut, University of Copenhagen. Interests: meaning, multiculturalism and promoting responsible behavior with #NLProc

Associate Prof @oiioxford, Director of Research @meedan, Fellow @turinginst.・widening access to quality info・multilingualism・mobilization・NLP・agenda setting

Scott Hale @computermacgyve

2K Followers 1K Following Associate Prof @oiioxford, Director of Research @meedan, Fellow @turinginst.・widening access to quality info・multilingualism・mobilization・NLP・agenda setting

Yan Meng @vivian_yanmy

81 Followers 185 Following PhD student at Language Technology Lab, University of Amsterdam

Yizhong Wang @yizhongwyz

3K Followers 1K Following CS PhD student @uwcse @uwnlp. NLP/ML

Andreas Kirsch 🇮�.. @BlackHC

9K Followers 5K Following Past: 🧑‍🎓 DPhil @AIMS_oxford @ExeterCollegeOx @UniofOxford (4.5yr) 🧙‍♂️ RE @DeepMind (1yr) 📺 SWE @Google (3yrs) 🎓 @TU_Muenchen 👤 Fellow @nwspk

MoTS @normativeai. Ex @UMichCSE, 2x @MetaAI, @allen_ai | Speech & NLP | Robustness, Data & Annotations, Evaluation & Interpretability in LLMs

Mimansa Jaiswal @MimansaJ

1K Followers 3K Following MoTS @normativeai. Ex @UMichCSE, 2x @MetaAI, @allen_ai | Speech & NLP | Robustness, Data & Annotations, Evaluation & Interpretability in LLMs

Hunar Batra @hunarbatra

2K Followers 2K Following DPhil/MSc CS @UniOfOxford | prev: aligning LLMs @AnthropicAI @NYUDataScience @stanford | Tweets are my own, not through a bot 🤖

ByeRose @byerose365

0 Followers 520 Following

Jiaxing Cui @cuijiaxing

13 Followers 119 Following

NLP Researcher @ims_stuttgart & @Cambridge_NLP
Argument Mining, e-Deliberation, Bias, Meaning representations, Cognitive Modeling, #NLProc methodology

Eva Maria Vecchi @emvecchi

175 Followers 373 Following NLP Researcher @ims_stuttgart & @Cambridge_NLP Argument Mining, e-Deliberation, Bias, Meaning representations, Cognitive Modeling, #NLProc methodology

Congfeng Cao @Congfeng_Cao

11 Followers 73 Following NLP, Graph Learning CV, Remote Sensing PhD student @UvA_Amsterdam @illc_amsterdam

Joan @Joanvelja

24 Followers 149 Following Container of multitudes | MS AI @ University of Amsterdam

Tim Baumgärtner @timbmg

280 Followers 763 Following 🎓 PhD student at @UKPLab @TUDarmstadt

Luisa Fernanda Isaza @luisaza

2K Followers 3K Following Soy un montón de cosas humanas mezcladas con cosas santas | Free speech - Digital Tech - Human Rights https://t.co/dvYvixLssL

Postdoctoral fellow @ICatGT | PhD @UMSI | Prev. intern at @BellLabs, @Snap & @Visa | Interested in #NLProc #CompSocialScience #socialnetworks #misinformation

Minje Choi @minje__choi

370 Followers 557 Following Postdoctoral fellow @ICatGT | PhD @UMSI | Prev. intern at @BellLabs, @Snap & @Visa | Interested in #NLProc #CompSocialScience #socialnetworks #misinformation

Bringing together students, academics, and industry professionals to discuss research initiatives and future directions for inclusive gaming 🎮👾

oxford.games @GamesOxford

81 Followers 89 Following Bringing together students, academics, and industry professionals to discuss research initiatives and future directions for inclusive gaming 🎮👾

data head, python kid, prof @columbiajourn, director @ledeprog, co-founder @bkbrains + @catrepublicbk. collecting all cats, widening all gyres

j soma @dangerscarf

3K Followers 2K Following data head, python kid, prof @columbiajourn, director @ledeprog, co-founder @bkbrains + @catrepublicbk. collecting all cats, widening all gyres

Ryan Hauser @R__Hauser

2K Followers 4K Following Editor, https://t.co/2vFebqEBeg | HPS/STS + Political Economy | Grad Student @STS_VT | Usual caveats.

Master's student at the University of Tübingen. I wanted to be an artist, but have ended up with political science, machine learning and Spätzle.

Ana @alekseevaas21

44 Followers 153 Following Master's student at the University of Tübingen. I wanted to be an artist, but have ended up with political science, machine learning and Spätzle.

AI ethics & safety @googledeepmind / Politics of AI infrastructure @oiioxford @uniofoxford / organising + climate justice with ESEA Green Lions 🐉💚 they/them

Bóxī Wú 吴泊曦 @boxiwu_

490 Followers 1K Following AI ethics & safety @googledeepmind / Politics of AI infrastructure @oiioxford @uniofoxford / organising + climate justice with ESEA Green Lions 🐉💚 they/them

Pavel Goldman Kalaidi.. @facultyofwonder

399 Followers 777 Following Head of AI at a startup, borzoi dogs owner, language enthusiast (en-ru-de-jp-he) #IStandWithIsrael

testuser @testuser12331

11 Followers 51 Following

Link_to_Past @Link_to_Future

84 Followers 1K Following Data-Centric AI

Saurav Sahay @sauravsahay

507 Followers 1K Following Research Science Manager, Multimodal Dialogue and Interactions at Intel Labs

J.J. McElroy @JJMcElroy

240 Followers 1K Following What fresh hell will this new day bring?

🅱 @b_douik

2 Followers 70 Following

smith jane76 @AdebayoM44392

3 Followers 35 Following

Machine Learning Research Scholar @Apple Siri. Ph.D. student in computer science at Howard University. Research assistant for the Affective Biometric Lab.

Mikel K. Ngueajio @KNew_Mikel

241 Followers 82 Following Machine Learning Research Scholar @Apple Siri. Ph.D. student in computer science at Howard University. Research assistant for the Affective Biometric Lab.

PardisSzah @snowyubaba

52 Followers 204 Following سلاااام

Hannah Rose Kirk @hannahrosekirk

3K Followers 689 Following AI researcher trying to make sense of all things cyberspace 🤖 Uni of Ox PhD (loading…) @oiioxford & @OxfordAI. Prev @turinginst & @Cambridge_Uni. Visitor @ NYU

Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics.

Same content in the Sky, Threads, & the Prehistoric Elephant

MMitchell @mmitchell_ai

80K Followers 1K Following Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Same content in the Sky, Threads, & the Prehistoric Elephant

Felix M. Simon @_FelixSimon_

Nayana Prakash @NayanaPrakash1

1K Followers 1K Following PhDing @oiioxford ✍🏼 // podcasting @skeptechs 🎙️// writing about India, gender, storytelling, postcolonialism, Internet. loves dogs. unusually tall 💅

EMNLP 2024 - The 2024 Conference on Empirical Methods in Natural Language Processing, November 12 –16, 2024
Hashtag: #EMNLP2024

EMNLP 2024 @emnlpmeeting

12K Followers 41 Following EMNLP 2024 - The 2024 Conference on Empirical Methods in Natural Language Processing, November 12 –16, 2024 Hashtag: #EMNLP2024

Zeerak@{mastodon,bsky.. @ZeerakTalat

3K Followers 3K Following Past: @SFU_DDI @SheffieldNLP @CoastalCPH. Reluctant Machine Learner. Researching hate online & AI ethics. Organising @woahworkshop.

Douwe Kiela @douwekiela

10K Followers 380 Following @ContextualAI CEO, @Stanford Adjunct Prof.

Prof, Linguistics, UW // Faculty Director, CLMS // she/her // @emilymbender@dair-community.social & bsky // rep by @ianbonaparte

@emilymbender@dair-co.. @emilymbender

58K Followers 2K Following Prof, Linguistics, UW // Faculty Director, CLMS // she/her // @[email protected] & bsky // rep by @ianbonaparte

Senior Advisor, AI Accountability @Mozilla |Cognitive science PhD |Adjunct prof @tcddublinscss, @tcddublin |Ethiopian in Ireland |She/her

@abeba.bsky.social

Abeba Birhane @Abebab

53K Followers 2K Following Senior Advisor, AI Accountability @Mozilla |Cognitive science PhD |Adjunct prof @tcddublinscss, @tcddublin |Ethiopian in Ireland |She/her @abeba.bsky.social

NLP/ML/language/security. Principal research scientist @NVIDIA, & Prof @ITUkbh. Views ostensibly professional. llmsec stan acct

Leon Derczynski ✍�.. @LeonDerczynski

6K Followers 1K Following NLP/ML/language/security. Principal research scientist @NVIDIA, & Prof @ITUkbh. Views ostensibly professional. llmsec stan acct

Prof, Chair for AI & Computational Linguistics @LMU_Muenchen, Head of @MaiNLPlab & co-director @CisLMU
Prof at @ITUkbh @NLPnorth
@ELLISforEurope scholar

Barbara Plank @barbara_plank

9K Followers 1K Following Prof, Chair for AI & Computational Linguistics @LMU_Muenchen, Head of @MaiNLPlab & co-director @CisLMU Prof at @ITUkbh @NLPnorth @ELLISforEurope scholar

(((ل()(ل() 'yoav))).. @yoavgo

46K Followers 2K Following

Oxford Internet Insti.. @oiioxford

50K Followers 2K Following The Oxford Internet Institute (OII) is a multidisciplinary research and teaching dept at University of Oxford, dedicated to the social science of the Internet.

Luca Soldaini 🎀 @ .. @soldni

6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)

ruchowdh.bsky.social @ruchowdh

44K Followers 4K Following find me at https://t.co/hrk5quIFJI

We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Google DeepMind @GoogleDeepMind

945K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Anne Lauscher (she/he.. @anne_lauscher

1K Followers 477 Following Ethical AI, Computational Argumentation, Conversational AI Assoc. Professor of Data Science @unihh Previously @MilaNLProc @dwsunima @allen_ai @grammarly

Dirk Hovy @dirk_hovy

9K Followers 1K Following Prof @Unibocconi @MilaNLProc: #NLProc, compsocsci, #ML. #ERCStG INTEGRATOR Python NLProc books: https://t.co/vXm98Ns3D7 https://t.co/Ll7E6PDOKE

Dr. Nahema Marchal @nahema_marchal

2K Followers 1K Following research scientist @deepmind | 🔎 tech governance, online harms, socio-technical ai | previously @prodigi_erc @oiioxford | flibbertigibbet, views her own

Sasha Rush @srush_nlp

52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGz

Incoming Asst. Prof. at @OhioStateCSE ('24). Postdoc at @allen_ai. Visiting @UWNLP. Ph.D. from @LTICMU. He/Him. Taking new students this cycle, reach out!

Sachin Kumar @shocheen

955 Followers 636 Following Incoming Asst. Prof. at @OhioStateCSE ('24). Postdoc at @allen_ai. Visiting @UWNLP. Ph.D. from @LTICMU. He/Him. Taking new students this cycle, reach out!

Sandro Pezzelle @sandropezzelle

806 Followers 668 Following Assistant Professor at the University of Amsterdam. #NLProc #AI #CogSci #interpretability

Nicholas Meade @ncmeade

138 Followers 150 Following PhD student at @McGillU / @Mila_Quebec; Interested in #NLProc.

Daniel Hershcovich @daniel_hers

877 Followers 2K Following Assistant Professor @coastalcph @DIKU_Institut, University of Copenhagen. Interests: meaning, multiculturalism and promoting responsible behavior with #NLProc

Wei-Lin Chiang @infwinston

3K Followers 853 Following CS PhD student at UC Berkeley. co-lead of Chatbot Arena @lmsysorg

Minje Choi @minje__choi

370 Followers 557 Following Postdoctoral fellow @ICatGT | PhD @UMSI | Prev. intern at @BellLabs, @Snap & @Visa | Interested in #NLProc #CompSocialScience #socialnetworks #misinformation

Interested in rigorous LLM evals, synthetic data, causality, robustness & AI safety/ethics | RS @PatronusAI | Prev: @VectorInst, @Mila_Quebec, @MPI_IS, @ETH_en

Nino Scherrer @ninoscherrer

594 Followers 2K Following Interested in rigorous LLM evals, synthetic data, causality, robustness & AI safety/ethics | RS @PatronusAI | Prev: @VectorInst, @Mila_Quebec, @MPI_IS, @ETH_en

Canyu Chen @CanyuChen3

852 Followers 2K Following CS Ph.D. student @illinoistech | Truthful, Safe and Responsible LLMs | LLMs Meet Misinformation: https://t.co/up5sEN5r1g

Professor at the ILLC in Amsterdam, head of the Dialogue Modelling Group, research on linguistic interaction, visual grounding & semantics/pragmatics

Raquel Fernández @raquel_dmg

2K Followers 167 Following Professor at the ILLC in Amsterdam, head of the Dialogue Modelling Group, research on linguistic interaction, visual grounding & semantics/pragmatics

DPhil NLP student @UniofOxford; Clarendon Scholar; Ex SDE intern @Microsoft; Computational Linguist; know how and know why

Fangru Lin @FangruLin99

203 Followers 196 Following DPhil NLP student @UniofOxford; Clarendon Scholar; Ex SDE intern @Microsoft; Computational Linguist; know how and know why

Faeze Brahman @faeze_brh

2K Followers 1K Following Postdoc @allen_ai @ai2_mosaic @uw | Ph.D. from UCSC | Former Intern @MSFTResearch , @allen_ai | Researcher in #NLProc, #ML

@Stanford professor. Chan-Zuckerberg investigator. Sloan Fellow. AI for biotech + health. Making AI more trustworthy, reliable and human compatible.

James Zou @james_y_zou

10K Followers 59 Following @Stanford professor. Chan-Zuckerberg investigator. Sloan Fellow. AI for biotech + health. Making AI more trustworthy, reliable and human compatible.

Lorenzo Lupo @lorelupo

34 Followers 50 Following processing natural language to tackle social and economic challenges @Unibocconi @MilaNLProc

Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtm

lmsys.org @lmsysorg

38K Followers 173 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtm

Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_usc

Bill Yuchen Lin 🤖 @billyuchenlin

6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_usc

Modeling Linguistic Variation for Inclusive NLP

ML PhD w/ @Diyi_Yang at @MLatGT/@StanfordNLP
Alum @NYUAbuDhabi @Sunshine @GoogleAI @AIatMeta
Burqueño
he/him

Will Held @WilliamBarrHeld

1K Followers 792 Following Modeling Linguistic Variation for Inclusive NLP ML PhD w/ @Diyi_Yang at @MLatGT/@StanfordNLP Alum @NYUAbuDhabi @Sunshine @GoogleAI @AIatMeta Burqueño he/him

harry law @lawhsw

2K Followers 899 Following thinking about thinking machines @GoogleDeepMind @Cambridge_Uni @LeverhulmeCFI

lecturer in ai, government & policy at the @oiioxford | critical ai studies | co-editor at @bigdatasoc | radical ecology, politics & algorithms | 🚲📖🌻

ana vldv @ana_valdi

5K Followers 4K Following lecturer in ai, government & policy at the @oiioxford | critical ai studies | co-editor at @bigdatasoc | radical ecology, politics & algorithms | 🚲📖🌻

Taylor Sorensen @ma_tay_

455 Followers 374 Following #NLProc PhD student @uwcse/@uwnlp

Tanise Ceron @taniseceron

172 Followers 336 Following PhD student in computational linguistics and computational social science @ims_stuttgart

PhD student @ETH advised by @RyanDCotterell, @Cervisiarius and @AaronSchein ⭕️ Language Model Interpretability and Application ⭕️ @GoogleAI, @TechAtBloomberg

Niklas Stoehr @niklas_stoehr

801 Followers 754 Following PhD student @ETH advised by @RyanDCotterell, @Cervisiarius and @AaronSchein ⭕️ Language Model Interpretability and Application ⭕️ @GoogleAI, @TechAtBloomberg

Micah Carroll @MicahCarroll

762 Followers 624 Following AI PhD student at UC Berkeley @berkeley_ai

PhD student @cmuhcii | building systems that support community-driven approaches to AI design and evaluation | @stanfordhci '21 | #firstgen, he, 🇹🇼

Tzu-Sheng Kuo 郭子�.. @tzushengkuo

971 Followers 654 Following PhD student @cmuhcii | building systems that support community-driven approaches to AI design and evaluation | @stanfordhci '21 | #firstgen, he, 🇹🇼

Nathan Benaich @nathanbenaich

51K Followers 32K Following solo member of investment staff @airstreet, brewing ambition @airstreetcafe, next token predictor @airstreetpress

Research Director at the UK AI Safety Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc.

@irving@mastodon.social

Geoffrey Irving @geoffreyirving

8K Followers 259 Following Research Director at the UK AI Safety Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc. @[email protected]

Associate Professor of Machine Learning, University of Oxford
@OATML_Oxford Group Leader
Director of Research at AISI (formerly UK Taskforce on Frontier AI)

Yarin @yaringal

38K Followers 222 Following Associate Professor of Machine Learning, University of Oxford @OATML_Oxford Group Leader Director of Research at AISI (formerly UK Taskforce on Frontier AI)

Conference on Languag.. @COLM_conf

2K Followers 6 Following https://t.co/GhGCMEoa4A Abstract submission: March 22, 2024

Javier Rando @javirandor

910 Followers 589 Following Red-Teaming LLMs | PhD Student @ETH_AI_Center | Incoming intern @Meta | Vegan 🌱

Hanna Hajishirzi @HannaHajishirzi

6K Followers 328 Following Associate professor at @uw_cse; senior director at @allen_ai co-leading @allenNLP; AI/NLP researcher at @uw_nlp

Sander Schulhoff @SanderSchulhoff

448 Followers 62 Following DRL & NLP & Botany https://t.co/E3m8NZH88Y maintainer https://t.co/yly3QzZ3CS competition runner

Alex Tamkin 🦣 @AlexTamkin

4K Followers 1K Following machine learning, science & society @AnthropicAI | prev: phd @StanfordAILab, @stanfordnlp

Sam Toyer @sdtoyer

220 Followers 330 Following PhD student @berkeley_ai | Thief Executive Officer @ https://t.co/oY65mYDu3w

Machine Learning Librarian @huggingface 🤗 | Championing Open Science & ML | Sharing the latest ML datasets 🌟 | Tips for mastering the HF Hub

Daniel van Strien @vanstriendaniel

3K Followers 1K Following Machine Learning Librarian @huggingface 🤗 | Championing Open Science & ML | Sharing the latest ML datasets 🌟 | Tips for mastering the HF Hub

Hyunwoo Kim @hyunw__kim

1K Followers 438 Following Social Reasoning/Commonsense + AI | Postdoc @allen_ai | PhD @SeoulNatlUni

Niloofar (Fatemeh) Mi.. @niloofar_mire

5K Followers 1K Following Postdoc @uwcse-@uwnlp, Ph.D. from @ucsd_cse /Privacy, ML, NLP, @winlpworkshop chair, @MSFTResearch - Semantic Machines

Norman Mu @TheNormanMu

319 Followers 470 Following CS PhD student at @UCBerkeley

Emily Dinan @em_dinan

3K Followers 530 Following eng @ Meta GenAI (she/her)

Eric Michael Smith @ericsmithnyc

142 Followers 102 Following Researcher in generative AI at @MetaAI

Incoming Assistant Professor @RutgersCS Fall 2024 | Current Postdoc @jhuclsp PhD from UCSB @ucsbNLP Research in Responsible NLP

Sharon Levy @sharonlevy21

1K Followers 573 Following Incoming Assistant Professor @RutgersCS Fall 2024 | Current Postdoc @jhuclsp PhD from UCSB @ucsbNLP Research in Responsible NLP

• Director of the Center for AI Safety (https://t.co/ahs3LYCpqv)
• GELU/MMLU/MATH
• PhD in AI from UC Berkeley
https://t.co/rgXHAnYAsQ
https://t.co/nPSyQMaY9b

Dan Hendrycks @DanHendrycks

17K Followers 81 Following • Director of the Center for AI Safety (https://t.co/ahs3LYCpqv) • GELU/MMLU/MATH • PhD in AI from UC Berkeley https://t.co/rgXHAnYAsQ https://t.co/nPSyQMaY9b

Alicia Parrish @AliciaVParrish

557 Followers 675 Following Research scientist at Google. I like CogSci & NLP. PhD from @nyuling. She/her.

Owain Evans @OwainEvans_UK

7K Followers 242 Following Research Associate @fhioxford, Oxford University. AI alignment. Prefer email to DM.

Anna Rogers 🇺🇦�.. @annargrs

9K Followers 865 Following Associate professor @ITUkbh: LLM interpretability, generalization, AI & society. Co-editor-in-chief @ACLRollingReview

Valentina Pyatkin @valentina__py

2K Followers 1K Following Postdoc at the Allen Institute for AI @allen_ai and @uwnlp

Eve Fleisig @enfleisig

377 Followers 332 Following PhD student @Berkeley_EECS | Princeton ‘21 | NLP, ethical + equitable AI, and linguistics enthusiast

Myra Cheng @chengmyra1

496 Followers 198 Following PhD student @StanfordNLP

Associate Professor @iitdelhi; ACM Distinguished Speaker; Lab @lcs2lab; Previously @IIITDelhi @UofMaryland @iitkgp; #NLP #SocialComputing #GraphNeuralNetworks

Tanmoy Chakraborty @Tanmoy_Chak

2K Followers 823 Following Associate Professor @iitdelhi; ACM Distinguished Speaker; Lab @lcs2lab; Previously @IIITDelhi @UofMaryland @iitkgp; #NLP #SocialComputing #GraphNeuralNetworks

Computational Social Science PhD Student at RWTH - She/her - Saving up to fund my own biopic and migrating to https://t.co/tQ6ujjzKH8 + https://t.co/jflkRWngV3

Indira Sen @indiiigosky

712 Followers 882 Following Computational Social Science PhD Student at RWTH - She/her - Saving up to fund my own biopic and migrating to https://t.co/tQ6ujjzKH8 + https://t.co/jflkRWngV3

Ray Duch @RayDuch

1K Followers 546 Following Director of the Centre for Experimental Social Sciences at Nuffield College, Oxford. Visitor @IAST1 Toulouse.

Post doc in Safety and Ethics in Conversational AI @HeriotWattUni | Background in CS and Linguistics. Safety+Robustness, Ethics, Dialogue, Pragmatics

Tanvi Dinkar @t_dinkar

301 Followers 583 Following Post doc in Safety and Ethics in Conversational AI @HeriotWattUni | Background in CS and Linguistics. Safety+Robustness, Ethics, Dialogue, Pragmatics

Wei-Lin Chiang @infwinston

11 hours ago

After months of preparation, we're super excited to launch the LMSYS @kaggle competition on human preference prediction today! Welcome to join the competition and play with our dataset! Can't wait to see the innovative preference models that will emerge from this challenge.

lmsys.org @lmsysorg

11 hours ago

Exciting news -- we're thrilled to announce that LMSYS + @kaggle are launching a human preference prediction competition with $100,000 in prizes! Your challenge is to predict which responses users will prefer in head-to-head battles between LLMs in the Chatbot Arena real-world…

5 51 381 37K 150

Download Image

0 2 26 2K 3

Yoav Artzi @yoavartzi

13 hours ago

Slowly but surely making progress on @COLM_conf reviewing. Reviews are due May 10! 😱🤗 9%|🦙▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐| [no idea what's going on with these llamas frantically working, but I am scared]

0 2 17 3K 0

Download Image

Dan Hendrycks @DanHendrycks

a day ago

HarmBench (arxiv.org/abs/2402.04249) and WMDP (arxiv.org/abs/2403.03218) were accepted!

ICML Conference @icmlconf

a day ago

ICML decisions are out. See you in Vienna.

7 12 240 59K 7

6 1 33 5K 7

Fangru Lin @FangruLin99

22 hours ago

I’m so glad that our paper is accepted at #ICML2024! Again many thanks to my fantastic co-authors and see you in Vienna!🤩🤩🤩

Fangru Lin @FangruLin99

3 months ago

Excited to share our paper with @iperboreo_ @vjhofmann @ellemichelley Anthony Cohn and Janet Pierrehumbert: fangru-lin.github.io/publications/! We release a benchmark for asynchronous plan *AsyncHow*. When *Plan Like a Graph*, GPT-4/3.5 get consistent boost over all task complexities. 1/n

1 3 16 24K 27

Download Image

3 11 107 23K 31

Verena Rieser @verena_rieser

7 days ago

Truly outstanding work by @hannahrosekirk on pluralistic alignment -- releasing an exciting novel dataset and providing a through analysis subjective and multicultural alignment 🤩😍💯❤️‍🔥🦾

Hannah Rose Kirk @hannahrosekirk

a week ago

Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019

20 91 383 72K 186

Download Image

2 0 21 2K 3

Tom Sherborne @tomsherborne

4 days ago

Life update: today is my first day as a Member of Technical Staff at @cohere!

25 5 261 17K 12

Download Image

Iason Gabriel @IasonGabriel

a week ago

It's becoming a challenge to keep up with the outstanding research the @hannahrosekirk – and also, relatedly @sorenmind – are producing these days! People keep asking me what comes after the Advanced AI Assistants paper: The answer is always "reading" 📚📚📚

Hannah Rose Kirk @hannahrosekirk

a week ago

20 91 383 72K 186

Download Image

1 3 33 3K 3

Shachar Don-Yehiya @Shachar_Don

3 days ago

The first chats from the ShareLM plugin are up, together with >4GB of chat datasets, organized in a unified format! ✨Whether you use models, create data, or spaces there is always a way to help✨ 💬:sharelm.github.io 🤗:huggingface.co/datasets/shach… 🧩:chromewebstore.google.com/detail/sharelm…

2 15 46 7K 24

Hannah Rose Kirk @hannahrosekirk

3 days ago

Super duper work on ShareLM for collecting "in-the-wild" preferences in more naturalistic interactions with LLMs. I really like how this can just be integrated simply into daily workflows. Small effort but mighty payoff 🦾

Shachar Don-Yehiya @Shachar_Don

3 days ago

2 15 46 7K 24

0 2 12 3K 5

Flor Plaza @florplaza22

6 days ago

Ha sido un placer participar como jurado del Hackathon #Somos600M. Enhorabuena a todos los equipos por la originalidad y calidad de los proyectos👏🏻👏🏻. Es increíble lo que la comunidad de #PLN en español puede conseguir en tan poco tiempo 💪🏼

SomosNLP @SomosNLP_

a week ago

Los proyectos ganadores del Hackathon #Somos600M son... 🥁🥁🥁 ¡Enhorabuena a tooodos los equipos! Cada año nos sorprendéis con proyectos originales, de gran impacto, y gran calidad 🤩 🧵 Os contamos más sobre los proyectos premiados y las menciones de honor

1 12 29 12K 3

Download Image

1 2 16 1K 0

Séb Krier @sebkrier

a week ago

📜 New paper unpacking Google DeepMind’s approach to safety evals for advanced AI models, with lessons learned to support the advancement of similar efforts by other actors in this space. Covers foresight, evaluation design, and the wider ecosystem. arxiv.org/abs/2404.14068

3 9 77 8K 50

Download Image

Iason Gabriel @IasonGabriel

a week ago

More great work from a research team led by our model methodologist and evaluator in chief @weidingerlaura 👏 Here's what we learned during the latest round of @GoogleDeepMind model testing🤖📊

Séb Krier @sebkrier

a week ago

3 9 77 8K 50

Download Image

0 5 32 7K 12

Laura Weidinger @weidingerlaura

6 days ago

📣 New report out! 🎉How do we know whether an AI is “safe”? We share learnings from developing safety evaluation of large scale systems at Google DeepMind for a broad audience. Report: arxiv.org/abs/2404.14068 Key lessons: 🪡 (1/n)

Iason Gabriel @IasonGabriel

a week ago

More great work from a research team led by our model methodologist and evaluator in chief @weidingerlaura 👏 Here's what we learned during the latest round of @GoogleDeepMind model testing🤖📊

0 5 32 7K 12

1 10 39 5K 27

Adina Williams @adinamwilliams

7 days ago

It's been my pleasure to collaborate with Hannah etc al. for the last few years leading up to this 💛 the dataset has turned out to be so much more than I could ever have imagined! There's a lot in here (>100pg!), hopefully, our findings will inform future work on preferences!

Hannah Rose Kirk @hannahrosekirk

a week ago

20 91 383 72K 186

Download Image

0 2 23 2K 4

MilaNLP @MilaNLProc

7 days ago

📖For our weekly @MilaNLProc lab seminar, it was a pleasure to have @shocheen presenting "Adapting language models to improve reliability: Experiments with refusals and diverse preference modeling" #NLProc

0 2 16 715 0

Download Image

Felix M. Simon @_FelixSimon_

a week ago

Thesis submitted, no turning back now… Acknowledgements to follow — for now, thank you all!

42 0 237 6K 5

Download Image

Cameron Raymond @CJKRaymond

a week ago

This is so cool! Happy to see it out in the world

Hannah Rose Kirk @hannahrosekirk

a week ago

20 91 383 72K 186

Download Image

2 0 4 579 0

Nicholas Meade @ncmeade

a week ago

Adversarial Triggers For LLMs Are 𝗡𝗢𝗧 𝗨𝗻𝗶𝘃𝗲𝗿𝘀𝗮𝗹!😲 It is believed that adversarial triggers that jailbreak a model transfer universally to other models. But we show triggers don't reliably transfer, especially to RLHF/DPO models. Paper: arxiv.org/abs/2404.16020

3 32 91 28K 32

Download Image

Nino Scherrer @ninoscherrer

a week ago

A very impressive human preference dataset (including annotator demographics) that will without doubt lead to many interesting studies! Congrats on the massive effort @hannahrosekirk!!

Hannah Rose Kirk @hannahrosekirk

a week ago

20 91 383 72K 186

Download Image

1 0 6 1K 1

Nathan Lambert @natolambert

a week ago

More preference data for people to play with in alignment research! It also comes with surveys asking people what their state preferences are! Dataset: huggingface.co/datasets/Hanna…