Terra Blevins @TerraBlvns
Grad student researching NLP at the University of Washington. she/her. blvns.github.io Seattle, WA Joined July 2016-
Tweets67
-
Followers501
-
Following421
-
Likes480
I'm excited to present our work on "Translate to Disambiguate" at #EACL2024 🇲🇹. Come by the Multilingual Issues oral session tomorrow (03/19) at 10:30 in Marie Louise to learn more!!
I'm excited to present our work on "Translate to Disambiguate" at #EACL2024 🇲🇹. Come by the Multilingual Issues oral session tomorrow (03/19) at 10:30 in Marie Louise to learn more!!
Quality multilingual annotated data is always scarce, so I'm extra happy to see ✨Universal NER✨ has been accepted at #NAACL2024. We hope the project will help address the data gap and facilitate new multilingual/cross-lingual research! 🎉 Preprint: arxiv.org/pdf/2311.09122…
🔍Looking for some #multilingual #LLM reading for the holidays or just that last minute stocking filler? 🎅 👀Look no further! Our new #preprint explores what's needed to get your chat LLM speaking languages other than English! 📄arxiv.org/abs/2312.12683
Happy to share In-Context Pretraining 🖇️ is accepted as an #ICLR2024 spotlight. We study how to pretrain LLMs with improved context understanding ability paper📄: arxiv.org/pdf/2310.10638… code: github.com/swj0419/in-con…
Happy to share In-Context Pretraining 🖇️ is accepted as an #ICLR2024 spotlight. We study how to pretrain LLMs with improved context understanding ability paper📄: arxiv.org/pdf/2310.10638… code: github.com/swj0419/in-con…
Very cool paper on the curse of multilinguality. Those guys trained more than 10k models over 250 languages ! arxiv.org/pdf/2311.09205… Cherry on the cake: they use the (Smith et al. ) citations format !!! I so want to sing their praises all day :)
#NeurIPS2023 Join us at the RegML Workshop (📅 Sat, Dec 16, 1:00-1:35 PM, Room 215-216). @YangsiboHuang and @xiamengzhou will present our work "Detecting Pretraining Data in Large Language Models". 🔗: swj0419.github.io/detect-pretrai…
#NeurIPS2023 Join us at the RegML Workshop (📅 Sat, Dec 16, 1:00-1:35 PM, Room 215-216). @YangsiboHuang and @xiamengzhou will present our work "Detecting Pretraining Data in Large Language Models". 🔗: swj0419.github.io/detect-pretrai…
We will present QLoRA at NeurIPS! Come to our oral on Tuesday where @Tim_Dettmers will be giving a talk. If you have questions stop by our poster session!
We will present QLoRA at NeurIPS! Come to our oral on Tuesday where @Tim_Dettmers will be giving a talk. If you have questions stop by our poster session! https://t.co/hvnP8Q3vQG
Excited to share this #EMNLP2023 Findings paper here in Singapore! I'll present this tomorrow (Dec. 7) at 11:30 in the East Foyer, and again on Dec. 9 at 9am in the East Foyer. Come chat about how we can 🪄demystify our prompts
Excited to share this #EMNLP2023 Findings paper here in Singapore! I'll present this tomorrow (Dec. 7) at 11:30 in the East Foyer, and again on Dec. 9 at 9am in the East Foyer. Come chat about how we can 🪄demystify our prompts
1. Demystifying Prompts in Language Models via Perplexity Estimation East Foyer, December 09, 9am x.com/hila_gonen/sta…
1. Demystifying Prompts in Language Models via Perplexity Estimation East Foyer, December 09, 9am x.com/hila_gonen/sta…
Do multilingual language models (MultiLMs) have what it takes to reason across languages? Our #EMNLP2023 #NLProc paper proposes a new attention mechanism that considerably improves the cross-lingual generalization of MultiLMs!
If you want a respite from OpenAI drama, how about joining academia? I'm starting Conceptualization Lab, recruiting PhDs & Postdocs! We need new abstractions to understand LLMs. Conceptualization is the act of building abstractions to see something new. conceptualization.ai
This project has been so much fun to work on! Check out this thread to learn about our new multilingual NER dataset, 🪐Universal NER🌟
This project has been so much fun to work on! Check out this thread to learn about our new multilingual NER dataset, 🪐Universal NER🌟
Ever wondered which data black-box LLMs like GPT are pretrained on? 🤔 We build a benchmark WikiMIA and develop Min-K% Prob 🕵️, a method for detecting undisclosed pretraining data from LLMs (relying solely on output probs). Check out our project: swj0419.github.io/detect-pretrai… [1/n]
Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Jim Fan @DrJimFan
230K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Jacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwTim Dettmers @Tim_Dettmers
29K Followers 823 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Machel Reid @machelreid
2K Followers 1K Following Research Scientist @GoogleDeepMind Working on LLMs on the Gemini Team; did gemini 1.5 proGabriel Ilharco @gabriel_ilharco
4K Followers 1K Following Building cool things @xAI. Prev. PhD at UW, Google AIHila Gonen @hila_gonen
1K Followers 229 Following Postdoctoral Researcher at @UWNLP https://t.co/2cDfMi1JtpOfir Press 🖋 @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Weijia Shi @WeijiaShi2
5K Followers 971 Following PhD student @uwcse @uwnlp | Visiting Researcher @MetaAI | Undergrad @CS_UCLA | https://t.co/eLBQmgkvymStephen Mayhew @mayhewsw
2K Followers 858 Following Following Ratinov and Roth (2009), we choose to use a Twitter BILOU instead of a Twitter BIO. @duolingoDjamé.. @zehavoc
6K Followers 3K Following Associate professor in NLP, engaged citizen. Tweeting about work, life and stuffs that I care about. All my tweets can be used freely. Personal account.Greg Durrett @gregd_nlp
6K Followers 753 Following CS professor at UT Austin. I do NLP most of the time. he/himSameer Singh @sameer_
7K Followers 2K Following Cofounder @SpiffyAI and Assoc Prof at @UCIrvine, working on reliable LLMs, explanations for AI+ML, adversaries for NLP, and debugging/evaluation.Niloofar (Fatemeh) Mi.. @niloofar_mire
4K Followers 1K Following Postdoc @uwcse-@uwnlp, Ph.D. from @ucsd_cse /Privacy, ML, NLP, @winlpworkshop chair, @MSFTResearch - Semantic MachinesSarah Wiegreffe @sarahwiegreffe
4K Followers 984 Following At @allen_ai @ai2_aristo @uwnlp. Research in language model transparency & interpretability. PhD from @mlatgt @icatgt @gtcomputing. Views my own.Sebastian Ruder @seb_ruder
80K Followers 1K Following Multilingual LLMs @cohere • Prev: @GoogleDeepMind • Newsletter: https://t.co/7JGh2qpG98Leo Boytsov @srchvrs
7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.MelissaLucia @iQB1hF27MnhUJz
0 Followers 139 FollowingNicholas Lourie @NickLourie
173 Followers 391 Following I build things. 🤖 Doing a PhD at @nyuniversity (@CILVRatNYU) on better empirical methods for deep learning and data science. Advised by @kchonyc and @hhexiy.Daniel Hussey @dnahussey
340 Followers 830 Following I ❤️ engineering biology to improve health & environment + businesses that realize the impact of science. 🤘@UTDiscoveries 🦑@tandem_repeat 🌱@Valley_DAORobert Pless @rbpless
1K Followers 786 Following Professor at GWU, Developer of @traffickcam and @projectRephoto, mercenary interest in glitter and contemporary art. BlueSky: @pless.bsky.socialWei Xu @cocoweixu
9K Followers 1K Following CS professor @GeorgiaTech @gtcomputing @ICatGT @mlatgt. Natural language processing, machine learning, social media research.Pensé FFun @inftyCategory
94 Followers 6K FollowingArif Ahmad @arif_ahmad_py
298 Followers 7K Following All things AI, Computer Science and Circuits! Prev. @GoogleAI𝕋𝕒𝕥𝕤𝕦�.. @tatsuru_kikuchi
375 Followers 3K Following Research Officer at Faculty of Economics, The University of Tokyo. Keywords: Entrepreneur/Developer/OpenAI/Quantum/Crypto/Analytics. Views are my own. Tianjian Li @tli104
137 Followers 268 Following phd student @jhuclsp, I work on data engineering for language models.Muktamani @Datalab24467702
26 Followers 495 Following Data Science , Machine learning,Deep learning,IOT, Reinforcement LearningWill Held @WilliamBarrHeld
1K Followers 792 Following Modeling Linguistic Variation for Inclusive NLP ML PhD w/ @Diyi_Yang at @MLatGT/@StanfordNLP Alum @NYUAbuDhabi @Sunshine @GoogleAI @AIatMeta Burqueño he/himMalcolm Kruger @KrugerMalc84771
1 Followers 106 FollowingIssath @onissathkhan
43 Followers 423 FollowingElizabeth Salesky @esalesk
1K Followers 657 Following PhD student @jhuclsp more commonly known as Liz ☀️ Friend of @NLPwithFriends ☀️ I like bubbles, bicycles, and language variationAbhilasha Ravichander @lasha_nlp
3K Followers 2K Following Postdoc @allen_ai, working on Natural Language Processing (#NLProc) | PhD @SCSatCMU @LTIatCMU | Friend of @NLPWithFriends | @[email protected]Kristina Gligorić @krisgligoric
849 Followers 577 Following CS Postdoc @Stanford @StanfordNLP, @snsf_ch fellow. PhD @EPFL_en, Ex Intern @GoogleAI @mpi_sws_. NLP, Computational Social Science. https://t.co/hclg9MYZ6eOrion Weller @orionweller
865 Followers 745 Following PhD student @jhuclsp. Previously: @apple, @allen_ai, @byu. #NLProc and #IR researchAhmad Beirami @abeirami
4K Followers 2K Following Building safe, helpful, and scalable generative AI @Google | ex-{@AIatMeta, @EA, @MIT, @Harvard, @DukeU} | @GeorgiaTech PhD | زن زندگی آزادی | opinions my ownHamish Ivison @hamishivi
476 Followers 598 Following Antipodean Abroad. he/him. I (try to) do NLP research. PhD student @uwcse, prev @Sydney_Uni @allen_ai 🇦🇺🇨🇦🇬🇧Debjit Paul @DebjitPaul2
479 Followers 2K Following On the job market, actively seeking industry and academic positions. Post-doc @EPFL @ICepfl • Ph.D @HD_NLP • Masters @sic_saar #NLProc #AI #MLErcong Nie @NielKlug
272 Followers 619 Following PhD Student in Computational Linguistics & #NLProc @cislmu of @LMU_Muenchen, affiliated member @munichcenterML, Previously @sjtu1896researcher Gpt LLM @researchGptllm
232 Followers 4K Following努尔哈赤 @nrhch522070
71 Followers 1K FollowingEric Huang @EricHuang4312
3 Followers 52 FollowingHongzhuo (Richard) Ch.. @Richard82656449
15 Followers 453 Following MS student in Networked Systems @UCIrvineMert İnan @Merterm
204 Followers 1K Following CS PhD student @Northeastern Cognitive-aware MM convAI interdisciplinarity lover Former intern @amazonscience @FulbrightPrgrm @SCSatCMU @BilkentUniv alumniTannon Kew @tannonk
131 Followers 239 Following phd student at @cl_uzh, working on controlled conditional text generation, text simplification and leveraging LLMsCatherine Arnett @linguist_cat
91 Followers 276 Following Linguistics with Computational Social Science PhD Candidate @UCSanDiego. Research Intern @pleiasfr. She/her.Marek Šuppa @mr__shu
212 Followers 876 FollowingJenia Jitsev 🏳️�.. @JJitsev
720 Followers 464 Following CLIP Interrogator infers: "Arbeitsrat für Kunst, AI Researcher, meet the actor behind the scenes, with curls" they/them. Co-founder & scientific lead LAION e.V.Felipe Cruz-Salinas @fffffelipec
134 Followers 388 Following Large models @cohere. Prev: @Aleph__Alpha, @microsoftShaona Ghosh @shaona_ghosh
1 Followers 98 FollowingNirupama Ratna (looki.. @ratna_kandala
200 Followers 2K Following Ph.D. student in Linguistics @ IIT Hyderabad BS-MS in Systems Biology #NLP#AI#NeuroscienceSanchit Ahuja @SanchitAhuja7
369 Followers 864 Following Trynna work. Research Fellow at @MSFTResearch x-ml at @SkitTech Alum at @bitspilaniindia.Shivam Rai @imsr282
327 Followers 5K Following Tech enthusiast 🚀 | Embarking on a journey through Machine Learning & Data Science 🤖📊 | Curious mind, coding heart ❤️ | Exploring the data-driven frontier 🌐Shivam Pandey @ShivamPR21
176 Followers 4K Following Past: Research Engineer Intern @_FiveAI | SR. Student Research Associate @ IITK - SERB | ADAS Intern @BoschGlobal | BTech - MTech GeoInformatics, @IITKanpurSandro @sv_lobao
1 Followers 123 Following Biomedical engineer, part-time Computer Science master student. Focusing his work in reinforcement learning, computer computer vision and robotics.Qingcheng Zeng @SteveZeng7
568 Followers 1K Following PhD-ing @linguisticsNU with @rfpvjr / I do research in computational social science and linguistic-motivated NLP / A big fan of @ArsenalWenhao Zhu @Wenhao_NLP
254 Followers 442 Following PhD candidate @NJUNLP, visiting PhD student @EdinburghNLP, interested in multilingual LLM and machine translation.Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingSuchin Gururangan @ssgrn
4K Followers 250 Following he/him Research scientist 🦙 Llama team, @meta GenAI PhD @uwcse + @uwnlpGraham Neubig @gneubig
31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Luca Soldaini 🎀 @ .. @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)Kayo Yin @kayo_yin
8K Followers 560 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Jim Fan @DrJimFan
230K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Ana Marasović @anmarasovic
4K Followers 604 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Mike Lewis @ml_perception
6K Followers 227 Following Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.Yoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCChristopher Manning @chrmanning
127K Followers 116 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzJacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwTim Dettmers @Tim_Dettmers
29K Followers 823 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Machel Reid @machelreid
2K Followers 1K Following Research Scientist @GoogleDeepMind Working on LLMs on the Gemini Team; did gemini 1.5 proKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Gabriel Ilharco @gabriel_ilharco
4K Followers 1K Following Building cool things @xAI. Prev. PhD at UW, Google AIKabir @kabirahuja004
456 Followers 419 Following CSE PhD Student @uwnlp | Ex-RF @MSFTResearch | cinephile 🎥Robert Pless @rbpless
1K Followers 786 Following Professor at GWU, Developer of @traffickcam and @projectRephoto, mercenary interest in glitter and contemporary art. BlueSky: @pless.bsky.socialSameer Singh @sameer_
7K Followers 2K Following Cofounder @SpiffyAI and Assoc Prof at @UCIrvine, working on reliable LLMs, explanations for AI+ML, adversaries for NLP, and debugging/evaluation.Sina Ahmadi @sina_ahm
2K Followers 844 Following « Ὅσον ζῇς φαίνου » Postdoc @UZH_en 🏔 Prev. @GeorgeMasonU 🐿 PhD @UniOfGalway 🌧️ #NLProc, #CLing, #LinkedData Alumnus @Univ_Paris_cite @Sorbonne_NvelleKristina Gligorić @krisgligoric
849 Followers 577 Following CS Postdoc @Stanford @StanfordNLP, @snsf_ch fellow. PhD @EPFL_en, Ex Intern @GoogleAI @mpi_sws_. NLP, Computational Social Science. https://t.co/hclg9MYZ6eOrion Weller @orionweller
865 Followers 745 Following PhD student @jhuclsp. Previously: @apple, @allen_ai, @byu. #NLProc and #IR researchHamish Ivison @hamishivi
476 Followers 598 Following Antipodean Abroad. he/him. I (try to) do NLP research. PhD student @uwcse, prev @Sydney_Uni @allen_ai 🇦🇺🇨🇦🇬🇧Rochelle Choenni @ChoenniRochelle
104 Followers 179 Following PhD candidate in NLP at the University of Amsterdam. I am supervised by prof. Ekaterina Shutova (UvA) and dr. Dan Garrette (Google Research).Ercong Nie @NielKlug
272 Followers 619 Following PhD Student in Computational Linguistics & #NLProc @cislmu of @LMU_Muenchen, affiliated member @munichcenterML, Previously @sjtu1896Masakhane @MasakhaneNLP
5K Followers 446 Following Ubuntu Punk 🌍 The community theatre of ML ✨ Putting African languages on the #NLP map since 2019CIS, LMU Munich @CisLmu
991 Followers 126 Following Center for Information and Language Processing (CIS): #NLProc research group @LMU_Muenchen led by @HinrichSchuetze and @barbara_plankTodd in the Shadows @ShadowTodd
110K Followers 428 Following Be-shadowed music reviewer on YouTube, cohost of @SongVsSongPod Patreon: https://t.co/VE9NNAJlih Media/inquiries: [email protected]Mert İnan @Merterm
204 Followers 1K Following CS PhD student @Northeastern Cognitive-aware MM convAI interdisciplinarity lover Former intern @amazonscience @FulbrightPrgrm @SCSatCMU @BilkentUniv alumniCatherine Arnett @linguist_cat
91 Followers 276 Following Linguistics with Computational Social Science PhD Candidate @UCSanDiego. Research Intern @pleiasfr. She/her.Tannon Kew @tannonk
131 Followers 239 Following phd student at @cl_uzh, working on controlled conditional text generation, text simplification and leveraging LLMsjack morris @jxmnop
11K Followers 767 Following getting my phd in nlp @cornell_tech 🚠 // academic optimist // tweeting from the snack aisle at trader joesJames Michaelov @jamichaelov
287 Followers 625 Following Cognitive Science PhD candidate @CogSciUCSD, Anthropogeny Specialization @CARTAUCSD. Interested in language comprehension, the brain, and computational models.SapienzaNLP @SapienzaNLP
1K Followers 221 Following The #SapienzaNLP group led by @RNavigli at the #Sapienza University of #Rome conducting #research in multilingual #NLProc. Strong connection to @Babelscape!Luheng He @LuhengH
724 Followers 454 Followingeaclmeeting @eaclmeeting
4K Followers 24 Following The European Chapter of the Association for Computational Linguistics An annual Top-tier *ACL conference. #EACL2024 #NLProc 17-22 March 2024David Chiang @davidweichiang
2K Followers 554 Following Associate Professor of Computer Science and Engineering at University of Notre Dame. Natural language processing, formal grammars, machine learningKayla Duskin @Kayla_Duskin
12 Followers 40 Following PhD student @UW_iSchool Misinfo researcher @uwcip Amateur Mountaineer 🏔 BOEALPSSachin Kumar @shocheen
955 Followers 636 Following Incoming Asst. Prof. at @OhioStateCSE ('24). Postdoc at @allen_ai. Visiting @UWNLP. Ph.D. from @LTICMU. He/Him. Taking new students this cycle, reach out!Yuntian Deng @yuntiandeng
3K Followers 3K Following #NLProc Postdoc @ai2_mosaic | Assistant Professor @UWaterloo '24 | Faculty Affiliate @VectorInst '24 | PhD @HarvardSebastian Schuster @sebschu
2K Followers 2K Following Lecturer @LinguisticsUCL, and starting in 2025, Assistant Professor @univienna. #nlproc, computational and experimental semantics and pragmatics. he/him.Sasha Luccioni, PhD �.. @SashaMTL
19K Followers 4K Following AI & Climate @HuggingFace, Board Member of @WiMLworkshop and @ClimateChangeAI. @techreview 35 Innovators under 35, @TEDTalks speaker. She/her/Dr/ 🦋Daniel Khashabi 🕊�.. @DanielKhashabi
2K Followers 826 Following I play with intuitions and data. Now: @jhuclsp @jhucompsci Past: @allen_ai @uwnlp @Penn @cogcomp, @Illinois_Alma, @MSFTResearch He/HimDiyi Yang @Diyi_Yang
14K Followers 2K Following Assistant Professor @Stanford CS @StanfordNLP @StanfordAILab. Formerly @GeorgiaTech. Computational Social Science & NLPJulia Mendelsohn @jmendelsohn2
1K Followers 813 Following PhD candidate @UMSI & @Google PhD fellow. NLP, computational social science, linguistics, polcomm; past CS + Lx @StanfordNAACL HLT 2024 @naaclmeeting
8K Followers 50 Following The official account of the Annual Conference of the North American Chapter of the Association for Computational Linguistics.Börje Karlsson @tellarin
413 Followers 238 Following AI Researcher @BAAIBeijing, ex-@MSFTResearch Asia, @nokia / INdT, @inovacao_cesar. Occasional politics, opinions are my own, RTs ≠ endorsements… 🇸🇪🇧🇷🇵🇹Peiqin Lin @lpq29743
266 Followers 806 Following @ELLISforEurope Ph.D. Student in Natural Language Processing at @CisLmu, supervised by @HinrichSchuetze and @andre_t_martins.Nikola Ljubešić @nljubesic
391 Followers 519 Following Researcher in natural language processing, computational linguistics, computational social scienceMarek Šuppa @mr__shu
212 Followers 876 Followingzhou Yu @Zhou_Yu_AI
9K Followers 837 Following Associate Professor at Columbia, advancing the frontier of NLP. Forbes 30 under 30. Amazon Alexa Prize winner.Lj Miranda @ljvmiranda
780 Followers 431 Following processing language naturally • predoc at @allen_aiNivii Kalavakonda @nkalavak
1K Followers 2K Following HRI/Perception in Healthcare; STS x Ph.D Candidate @uw. nkalavak at sigmoid dot social (she/her/they)John Thickstun @jwthickstun
1K Followers 536 Following Postdoc at Stanford. @StanfordCRFM @StanfordNLP @StanfordAILab Previous @uwcse @uw_wail Controllable Generative Models. AI for Music.Tianyu Gao @gaotianyu1350
3K Followers 687 Following CS PhD student @Princeton @Princeton_nlp working on NLP. Previously: @Tsinghua_Uni @TsinghuaNLPMasoud @linguistMasoud
6K Followers 2K Following Assistant Professor @UCDavis researching language, logic, and learning in humans & sometimes machinesYangsibo Huang @YangsiboHuang
1K Followers 727 Following PhD candidate @Princeton. Prev: @GoogleAI @AIatMeta.Antonis Anastasopoulo.. @anas_ant
3K Followers 2K Following Assist. Prof at George Mason CS #nlproc MT, ASR, and documentation of endangered languages.Rulin Shao @RulinShao
615 Followers 396 Following PhD @UWNLP | MS @SCSatCMU | ex-Applied Scientist @AWS📢 New Paper! Ever wondered why transformers are able to capture hierarchical structure of human language without incorporating an explicit 🌲 structure in their architecture? In this work we delve deep into understanding hierarchical generalization in transformers. (1/n)
@sarahookr You might be referring to arxiv.org/abs/2204.08110 by @TerraBlvns
@sarahookr Might be this one: arxiv.org/abs/2305.10266?
🚀 Introducing Pile-T5! 🔗 We (EleutherAI) are thrilled to open-source our latest T5 model trained on 2T tokens from the Pile using the Llama tokenizer. ✨ Featuring intermediate checkpoints and a significant boost in benchmark performance. Work done by @lintangsutawika, me…
@naacl Now that I know a bit more about it, and given that AACL is already taken, I’d go with AmerACL! 🪄🎉 🎆 (Pronounced like “a miracle”)
I'm excited to share that the journal version of our paper, "An archival perspective on pretraining data", is now available (open access) from Patterns! This project was led by @MeeraDesai18, along with @IrenePasquetto, @az_jacobs, and myself 1/n
Language models scale reliably with over-training and on downstream tasks Scaling laws are useful guides for developing language models, but there are still gaps between current scaling studies and how language models are ultimately trained and evaluated. For instance,
In beautiful Malta to attend #EACL2024 and to present our paper "CoDET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation". This was carried out at my time at George Mason #NLProc with my great colleagues @mahfuzibnalam and @anas_ant: arxiv.org/abs/2305.17267
Happy to share REPLUG🔌 is accepted to #NAACL2024 We introduce a retrieval-augmented LM framework that combines a frozen LM with a frozen/tunable retriever. Improving GPT-3 in language modeling & downstream tasks by prepending retrieved docs to LM inputs. 📄:…
Excited to share that our paper "Do multilingual language models think better in English?" has been accepted at the NAACL 2024 main conference! 🎉🎉🎉 Thanks to all coauthors! @gazkune @Aitor57 @oierldl @artetxem @IxaGroup @Hitz_zentroa
Do multilingual language models think better in English? 🤔 Yes, they do! We show that using an LLM to translate its input into English and performing the task over the translated input works better than using the original non-English input! 😯 arxiv.org/abs/2308.01223
Check out our work on "extracting distinguishing dialectal features via interpretable dialect classifiers" led by the amazing @ruoyuxyz ! Accepted to #NAACL2024
✨ Can we use interpretability methods to extract linguistic features that characterize dialects❓ 🎉 New preprint: arxiv.org/abs/2402.17914 (@ruoyuxyz, @orevaahia, @tsvetshop, @anas_ant) 👉Code & Data: github.com/ruoyuxie/inter… 🧵(1/6)
The project is still open to contributions! Congrats, @mayhewsw and all co-authors/contributors! @TerraBlvns @hila_gonen @josephimperial_ @ljvmiranda @nljubesic @lpq29743 @yuvalpi @mr__shu @barbara_plank @Shuheng_Liu @ChunyuanDeng @EmilStenstrom @ArijRiabi @peterkz_swe et al.
Quality multilingual annotated data is always scarce, so I'm extra happy to see ✨Universal NER✨ has been accepted at #NAACL2024. We hope the project will help address the data gap and facilitate new multilingual/cross-lingual research! 🎉 Preprint: arxiv.org/pdf/2311.09122…
Our paper 'Examining Modularity in Multilingual LMs via Language-Specialized Subnetworks' got accepted to #NAACL2024 findings. Preprint: arxiv.org/pdf/2311.08273… See you in Mexico City!⛱️
This has been accepted to #NAACL2024 main conference (w/ high review scores)! 🥳🥂 I'm happy to have contributed Tagalog and Cebuano languages 🇵🇭 to this project led by @mayhewsw 🦜
🚨 New Dataset Alert 🚨 I'm extremely excited to announce Universal NER v1, available now. It is gold-standard human annotations of 18 datasets covering 12 languages, based on Universal Dependencies texts. This is the first data release of the UNER project. 1/3
🚨 Introducing Branch-Train-miX (BTX) 🚨 BTX improves a generalist LLM on multiple fronts: - Train expert LLMs in parallel for new skills in domains such as math, code & world knowledge - Join (mix) them together & finetune as a Mixture-of-Experts arxiv.org/abs/2403.07816 🧵(1/4)
𝗛𝗼𝘄 𝗰𝗮𝗻 𝘄𝗲 𝗯𝘂𝗶𝗹𝗱 𝗺𝗼𝗿𝗲 𝗿𝗲𝗹𝗶𝗮𝗯𝗹𝗲 𝗟𝗠-𝗯𝗮𝘀𝗲𝗱 𝘀𝘆𝘀𝘁𝗲𝗺𝘀? Our new position paper advocates for retrieval-augmented LMs (RALMs) as the next gen. of LMs, exploring the promises, limitations, and a roadmap for wider adoption. arxiv.org/abs/2403.03187 🧵
New preprint with @tylerachang and Benjamin Bergen! We find that some languages need up to five times as much storage in bytes to convey the same amount of information arxiv.org/pdf/2403.00686…