Syeda Nahida Akter @SNAT02792153
PhD student at @LTIatCMU @SCSatCMU. Working on Multimodal Question Answering #NLProc snat1505027.github.io Dhaka, Bangladesh Joined May 2020-
Tweets135
-
Followers152
-
Following477
-
Likes634
Curious about socially-intelligent AI? Check out our paper on underlying technical challenges, open questions, and opportunities to advance social intelligence in AI agents: Work w/ @lpmorency, @pliang279 📰Paper: arxiv.org/abs/2404.11023 💻Repo: github.com/l-mathur/socia… 🧵1/9
Google presents Many-Shot In-Context Learning - Proposes many-shot ICL, i.e., adding up to thousands of examples in context with Gemini 1.5, which boosts the perf significantly - Using synthetic CoT is very effect in this setting. arxiv.org/abs/2404.11018
Excited to share our work on FlexCap! It can provide visual captions at varying level of granularity for any region in the image. Website: flex-cap.github.io See 🧵by @debidatta for more details!
Excited to share our work on FlexCap! It can provide visual captions at varying level of granularity for any region in the image. Website: flex-cap.github.io See 🧵by @debidatta for more details!
We are excited to share Large World Model (LWM), a general-purpose 1M context multimodal autoregressive model. It is trained on a large dataset of diverse long videos and books using RingAttention, and can perform language, image, and video understanding and generation.
📢New paper : "In-Context Principle Learning from Mistakes" Instead of prompting using only *correct* few-shot examples, we intentionally make *mistakes*, and then learn "principles" or "lessons" from them. Lead by @tianjun_zhang @aman_madaan @luyu_gao arxiv.org/pdf/2402.05403…
📢New paper : "In-Context Principle Learning from Mistakes" Instead of prompting using only *correct* few-shot examples, we intentionally make *mistakes*, and then learn "principles" or "lessons" from them. Lead by @tianjun_zhang @aman_madaan @luyu_gao arxiv.org/pdf/2402.05403… https://t.co/w0nY0KGU6s
Introducing Vision Arena! Inspired by the awesome Chatbot Arena, we built a web demo on @huggingface for testing Vision LMs (GPT-4V, Gemini, Llava, Qwen-VL, etc.). You can easily test two VLMs side by side and vote! It’s still a work-in-progress. Feedbacks are welcome! 🔗…
Researchers from @CarnegieMellon, BerriAI explore the translation capabilities of Google’s Gemini and suggest Gemini Pro could be a valuable tool for MT. @SNAT02792153 @yu_zichun52802 @AashiqMuhamed @tianyue_01 @a13xba @a_a_cabrera @krrish_dh @XiongChenyan slator.com/is-google-gemi…
We're excited about all the interest in our Gemini report and working to make it even better! This week we made major improvements, switching to the @MistralAI instruct model, and working with the Gemini team to reproduce their results. Updates below.
We're excited about all the interest in our Gemini report and working to make it even better! This week we made major improvements, switching to the @MistralAI instruct model, and working with the Gemini team to reproduce their results. Updates below.
Google’s Gemini recently made waves as a major competitor to OpenAI’s GPT. Exciting! But we wondered: How good is Gemini really? At CMU, we performed an impartial, in-depth, and reproducible study comparing Gemini, GPT, and Mixtral. Paper: arxiv.org/abs/2312.11444 🧵
I caught up with @abertsch72 at #NeurIPS2023, who was presenting Unlimiformer, a retrieval-augmentation method for encoder-decoder models allowing unlimited length inputs. Paper: Unlimiformer: Long-Range Transformers with Unlimited Length Input Work with @urialon1 @gneubig, and…
🚀 1/7 We are thrilled to launch LLM360 — pushing the frontier of open-source & transparent LLMs! Starting with Amber (7B) & CrystalCoder (7B), we are releasing brand new pre-trained LLMs with all training code, data, and up to 360 model checkpoints. 🔗 llm360.ai
Introducing LQ-LoRA Decomposing pretrained matrices into (fixed) quantized + (trainable) low-rank components enables more aggressive quantization. We can quantize LLaMA-2 70B to 2.5 bits with minimal degradation in instruction-tuning performance. arxiv.org/abs/2311.12023 🧵1/n
Huck Yang @huckiyang
568 Followers 526 Following Sr. Research Scientist @NVIDIAAI Generative Error Correction | Ph.D. @GeorgiaTech | Past: @GoogleAI @AmazonScience | 🗣️ educationJoe Stacey @_joestacey_
576 Followers 1K Following PhD student at Imperial and Apple Scholar. I love running, NLP and travelling (in no particular order). Ex teacher and PwC Consultant. #NLProcMaitraye Das @MaitrayeUrmi
1K Followers 1K Following Asst prof @KhouryCollege @NU_CAMD @Northeastern. Researching #HCI, #CSCW, #Accessibility. PhD @NorthwesternU; Prev @uwcreate @MSFTResearch. she/her. From 🇧🇩Shrimai @shrimai_
2K Followers 504 Following Senior Research Scientist @nvidia | PhD from @SCSatCMU | Prev @SFResearch @facebookai & @MSFTResearchSyed Mostofa Monsur @symos66
4 Followers 175 FollowingDevansh Jain @devanshrjain
97 Followers 689 Following MIIS @LTIatCMU | Economics and CS @bitspilaniindia | ex Research Intern @CIS_Penn, @unihh, @Cardiff_NLP | #NLProcSheikh Shafayat @shafayat_sheikh
87 Followers 186 Following a universe of atoms, an atom in the universe KAIST '24 🇰🇷Sue Hyun Park @suehpark
133 Followers 326 Following MS student @kaist_ai. BBA & BS @SeoulNatlUni. Interested in evaluating LLM behavior and aligning them to human context + causal reasoning #NLProcHarish Agrawal @agrawalharish63
27 Followers 573 Following Applied Scientist @ Amazon AGI | M Tech Research in CDS @ IISc. Curious about - Nature, Mathematics, AI, NeuroScience, Astronomy…Emily Li @EmilyLiJiayao
376 Followers 625 Following Researcher @carnegiemellon | Founder @acadiaai | Prev research @modern_ai, ML @ evolution_devices, Founder @ arquestssern | Data-centric & Multimodal AIHamid Naderi Yeganeh @naderi_yeganeh
35K Followers 32K Following Research Student @UCL Maths. Mathematical artist. Email: naderiyeganeh at gmail dot comHarsh Desai @dreamerharsh
1 Followers 3K Followingresearcher Gpt LLM @researchGptllm
238 Followers 4K FollowingShivam Rai @imsr282
332 Followers 5K Following Tech enthusiast 🚀 | Embarking on a journey through Machine Learning & Data Science 🤖📊 | Curious mind, coding heart ❤️ | Exploring the data-driven frontier 🌐MUHAMMAD ZAYYAN @MUHAMMADZA78472
450 Followers 1K Followingliuyong @forrestbing
243 Followers 5K Following I am a researcher in AIGC, Multi-modality and VitrualHuman tech directionCLS @ChengleiSi
2K Followers 3K Following vibing @stanfordnlp | real AGI is the friends we made along the wayStefan Popov @stfnpopopop
13 Followers 2K Followingcamron, a machine of .. @CamronBergh
192 Followers 1K Following artificially intelligent, idiot. deep learning. I retweet things. opinions expressed probably arent even mine. he/him.Shibo Hao @Ber18791531
730 Followers 504 Following Ph.D. student at UC San Diego @UCSanDiego. B.S. in Computer Science at Peking University @PKU1898mukesh kumar @mukeshkr165
52 Followers 2K Following Dropped out of college in just two months with zero credits taken(lol)Arbaaz Qureshi @arbaaz__qureshi
330 Followers 2K Following Data Scientist @Lowes | Previously @Google and @MSFTResearch| CS grad @UMassAmherst and undergrad @IITPatMorgan McGuire @morgymcg
2K Followers 4K Following Learning Machine Learning...came for the bants, stayed for the rants. | Growth ML Eng @weights_biases | ex-Facebook Safety | https://t.co/a7i7G5dkLG | 🇮🇪Pratik Joshi @Roprajo
2K Followers 478 Following Research Engineer @GoogleDeepMind | Teaching machines to code | Prev @LTIatCMU @GoogleAI, @MSFTResearch @BITSPilaniGoaAnanjay Goel @GoelAnanjay
5 Followers 165 FollowingMubashara Akhtar @akhtarmubashara
1K Followers 682 Following Final yr PhD student @KingsCollegeLon • NLProc • co-organizing @FEVERworkshop • prev @CambridgeNLP, @tu_wien, intern @GoogleDeepmindRobert Haase @haesleinhuepf
11K Followers 4K Following Image Data Scientist, Lecturer, Training Coordinator @Sca_DS / @UniLeipzig, also @NFDI4BioImage, @NEUBIAS_, @GloBIAS_, GPUs, LLMs, AI, #openscience, views:mineAshutosh Mehra @ashutoshmehra
1K Followers 5K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.Ananjay Goel @AnanjayGoel
0 Followers 34 FollowingMehal Rashid @mehal_rashid
296 Followers 592 Following Creating value driven content, growing brands, driving results 🚀- Top Rated Plus Writer on UpWork | Social Media Marketer & Manager | Founder @QuranSekhoOmar Khattab @lateinteraction
11K Followers 2K Following CS PhD candidate @StanfordNLP. 2022 Apple Scholar in AI/ML. Author of ColBERT (https://t.co/2ZtgXoa1np), DSPy (https://t.co/BH7WmMKDXR), & various retrieval & LM systems.Maria Stasimioti @MStasimioti
329 Followers 527 Following AI Program Manager & Research Analyst ⏩ MTPE Expert ⏩ Researcher @myionio. Passionate about translation technology and all things novel and innovative.Rithvik Kolla @KollaRithvik
19 Followers 299 Following Research. RSDE at Microsoft Research India. Sports fan. NovelsRafiqul Rabin @mdrafiqulrabin
121 Followers 296 Following Postdoctoral Fellow at @CSatUH of @UHouston. Interested in Safe AI/ML and LLMs for Code Intelligence.Omar Nusrat @OmarNusrat
651 Followers 2K Following PhD Candidate in Medical Physics @TorontoMet @UnityHealthTO • Vice Chair @COMPTrainees • @OntarioTech_U AlumOfir Press @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Mahmud Hasan Khan @thekhanzadeh
32 Followers 253 Following data analyst, ml researcher🧠, liberal🙋🏻♂️, have interests in international politics and animeGraham Neubig @gneubig
31K Followers 586 Following Associate professor at CMU, studying natural language processing and machine learning.Huck Yang @huckiyang
568 Followers 526 Following Sr. Research Scientist @NVIDIAAI Generative Error Correction | Ph.D. @GeorgiaTech | Past: @GoogleAI @AmazonScience | 🗣️ educationShubhra K. Karmaker (.. @karmake2
161 Followers 249 Following Assistant Professor of Computer Science at Auburn University. Communication Chair of ACL Rolling Reviews (ARR)Abhishek Das @abhshkdz
6K Followers 202 Following Prev: Research Scientist at FAIR @Meta & @OpenCatalyst, PhD at @GeorgiaTech.Cognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqAhmed Awadallah @AhmedHAwadallah
758 Followers 333 Following Partner Research Manager, AI Frontiers @MSFTResearchAnthropic @AnthropicAI
262K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.Xander Dunn @xanderai
1K Followers 459 Following Building LLMs for code gen. Past: Applied LLM Secret Sharer, Deep RL for industrial robotics, @Apple Software EngineerTianjun Zhang @tianjun_zhang
1K Followers 763 Following Project Lead of RAFT, Gorilla, Berkeley Function Calling Leaderboard, and member of LiveCodeBench, PhD student at Berkeley-AI-ResearchConference on Languag.. @COLM_conf
2K Followers 6 Following https://t.co/GhGCMEoa4A Abstract submission: March 22, 2024Benno Krojer @benno_krojer
2K Followers 2K Following PhDing in AI (Vision+Language) @Mila_Quebec and @mcgillu. Vanier Scholar. I try to see my research as an infinite game: I play so I get to continue playingNYRE @sleenyre
20 Followers 50 FollowingAshutosh Mehra @ashutoshmehra
1K Followers 5K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.Omar Khattab @lateinteraction
11K Followers 2K Following CS PhD candidate @StanfordNLP. 2022 Apple Scholar in AI/ML. Author of ColBERT (https://t.co/2ZtgXoa1np), DSPy (https://t.co/BH7WmMKDXR), & various retrieval & LM systems.Chuang Gan @gan_chuang
4K Followers 455 Following Faculty Member at UMass Amherst; Principal researcher at MIT-IBM Watson AI Lab; Homepage: https://t.co/oXP6pqXCpoBindu Reddy @bindureddy
124K Followers 338 Following CEO of @abacusai, using Gen AI to build Applied AI and LLM agents and systems at scale, ex-AWS / Google, passionate about human behavior and open-source AGIDavis Liang @LiangDavis
328 Followers 219 Following NLProc Research Scientist @AbridgeHQ. Prev: Research Scientist (@MetaAI), Applied Scientist (@awscloud).Xin Eric Wang @xwang_lk
7K Followers 1K Following Multimodal and Embodied AI Researcher / Professor @UCSC. Director of https://t.co/Y4swOBag21. AI for Humanity in the long run. he/himYue Wang @yuewang314
5K Followers 929 Following Assistant Professor @ USC CS and part-time Research Scientist @ Nvidia Research. Previous: EECS PhD @ MIT CSAIL. Opinions are mine.Simran Khanuja @simi_97k
2K Followers 897 Following NLP | PhD Student @LTIatCMU | Predoctoral Researcher @Google | Microsoft Research | BITS Pilani, GoaMubashara Akhtar @akhtarmubashara
1K Followers 682 Following Final yr PhD student @KingsCollegeLon • NLProc • co-organizing @FEVERworkshop • prev @CambridgeNLP, @tu_wien, intern @GoogleDeepmindGuillaume Lample @GuillaumeLample
37K Followers 648 Following Cofounder & Chief Scientist https://t.co/hLfvKLkFHd (@MistralAI). Working on LLMs. Ex @MetaAI | PhD @Sorbonne_Univ_ | MSc @CarnegieMellon | X11 @PolytechniquePratik Joshi @Roprajo
2K Followers 478 Following Research Engineer @GoogleDeepMind | Teaching machines to code | Prev @LTIatCMU @GoogleAI, @MSFTResearch @BITSPilaniGoaMustafa Suleyman @mustafasuleyman
131K Followers 535 Following CEO, Microsoft AI | Author: The Coming Wave | Past: Co-founder, @InflectionAI & @GoogleDeepMindRoss Taylor @rosstaylor90
6K Followers 869 Following Something new 🥷. Previously: @paperswithcode, reasoning lead @metaai, Galactica LLM lead, Atlas ML (acq by Meta)David Sontag @david_sontag
9K Followers 309 Following CEO & Co-founder @layerhealth. Professor, MIT. Research on machine learning in health care. Part of @MIT_CSAIL, @MIT_IMES, @MITEECS, @AIHealthMITKaixin Ma @KaixinMa9
268 Followers 239 Following Senior NLP Researcher at Tencent AI Lab, ex-PhD student at LTI, CMUHaofei Yu @haofeiyu44
146 Followers 723 Following MS student @LTIatCMU | previously CS undergrad @ZJU_China | ex-intern @Apple @TencentGlobalsyeda tamzida akter @tamzida5591
2 Followers 22 FollowingShaily @shaily99
5K Followers 2K Following PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on ethics and evaluation in #NLProc. Usually ranting, often about research & DEI. 📚 @readsndrantsAK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxShelby Heinecke @shelbyh_ai
343 Followers 830 Following AI Leader, Researcher, & Engineer. AI Research Manager @SFResearch. Math PhD @thisisUIC, Math BS @MIT. On a mission! 🚀Pierluca D'Oro @proceduralia
1K Followers 356 Following Final-year PhD student at Mila and researcher at Meta, working on the science of AI agents. Made in Sicily.Ani Kembhavi @anikembhavi
2K Followers 297 Following Senior Director @allen_ai + Affiliate Assoc Prof @UW 📷 : Visual Prog, Unified-IO, BiDAF 🤖 : ProcTHOR, Objaverse, SPOC 🌎 : SATLAS All views my own.Jing Yu Koh @kohjingyu
3K Followers 486 Following Machine Learning PhD student @CarnegieMellon. Previously: fulltime vision-and-language research @GoogleAI, undergrad @sutdsg. 🇸🇬Faisal Mahmood @AI4Pathology
4K Followers 2K Following Associate Prof. @Harvard | Faculty @harvardmed @BWHPath @MGHPathology @broadinstitute @harvard_data | via @JohnsHopkins | Multimodal Computational PathologyJaemin Cho @jmin__cho
1K Followers 891 Following PhD student at @UNCCS @UNCNLP Previously at @GoogleAI, @MSFTResearch, @AdobeResearch, @Allen_AI, @official_naver, and @SeoulNatlUniAnjali Kantharuban @anjali_ruban
303 Followers 120 Following PhD in Language Technology @ CMU, working on NLP for Dialects | Formerly @ Cambridge & UC BerkeleyNing Yu @realNingYu
720 Followers 332 Following Research Scientist at Netflix Eyeline Studios. Ex-Salesforce. Joint PhD from UMD & MPI-INF. Leading efforts in visual and multimodal generative AI.Sireesh Gururaja @_sireesh
361 Followers 2K Following Trying to get to know my neighbors, both irl and online. PhD student @LTIatCMU, interested in NLP that lets people keep agency. Former: @kensho, @IBM, @ColumbiaXuhui Zhou @nlpxuhui
685 Followers 429 Following PhD student @LTIatCMU. Previously, @GeorgiaTech, @UWNLP, and @Apple. Social Intelligence in language +X. He/Him.🐳Yushi Hu @huyushi98
1K Followers 1K Following 🎓PhD student @uwnlp | Visiting Researcher @allen_ai Prev. @GoogleAI @UChicago @TTIC_Connect | NLP/CV/AI 📖🎹🪗📷⚽️Maitraye Das @MaitrayeUrmi
1K Followers 1K Following Asst prof @KhouryCollege @NU_CAMD @Northeastern. Researching #HCI, #CSCW, #Accessibility. PhD @NorthwesternU; Prev @uwcreate @MSFTResearch. she/her. From 🇧🇩Houda BOUAMOR @hbouamor
684 Followers 398 Following Associate Professor @CMU-Q, Associate Area Head of Information Systems, NLP and Machine Learning ExpertPuyuan Peng @PuyuanPeng
962 Followers 561 Following CS PhD student @UTAustin, working on speech and audio recognition, understanding, and generation. Previously @uchicago Stats, @BNU_Official MathA fundamental skill of human developers is to mentally simulate and reason about code execution in natural language. Can we teach LLMs this skill? Excited to share our recent work with @AnsongNi @miltos1 @armancohan @yinlin_deng @kensen_shi @RandomlyWalking at @GoogleDeepMind!
Turns out that even SOTA MLLMs achieve near random accuracy on these visual IQ questions 🧐
How good are MLLM at solving IQ (abstract visual reasoning) problems? Check our new benchmark paper! MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning Paper: arxiv.org/pdf/2404.13591… Website: marvel770.github.io
Great resource from @shi_weiyan and team! This will be valuable for advancing LLMs that can operate across cultural contexts
🚨New Paper🚨 We propose 1⃣CultureBank🌎 dataset sourced from TikTok & Reddit 2⃣An extensible pipeline to build cultural knowledge bases 3⃣Evaluation of LLMs’ cultural awareness 4⃣Insights into culturally-aware LLMs Project: culturebank.github.io Data: shorturl.at/hrtwP
On May 2-3, we're going to have a big event in Pittsburgh about LLM Agents. We have invited talks from great speakers inside and outside CMU, student research presentations and posters, tutorials and discussions! Come join us at CMU campus, and register at cmu-agent-workshop.github.io
Join us in the Agent Workshop! Will have a lot of fun: talks, tutorials, hackathon, posters, ... 🥳
On May 2-3, we're going to have a big event in Pittsburgh about LLM Agents. We have invited talks from great speakers inside and outside CMU, student research presentations and posters, tutorials and discussions! Come join us at CMU campus, and register at cmu-agent-workshop.github.io
Curious about socially-intelligent AI? Check out our paper on underlying technical challenges, open questions, and opportunities to advance social intelligence in AI agents: Work w/ @lpmorency, @pliang279 📰Paper: arxiv.org/abs/2404.11023 💻Repo: github.com/l-mathur/socia… 🧵1/9
Llama 3 just changed the LLM game. People are finding wild use cases at GPT-4 level. There is a massive movement in the open source community. 10 examples (and ways to use Llama 3):
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
The multimodal version of this nit (for me) is when people say they use the "LLaVA architecture". I believe the original paper with this architecture is actually Frozen (arxiv.org/abs/2106.13884) from DeepMind in 2021. LLaVA uses the exact same architecture and training objective…
most solid architecture is the "Noam" architecture. stop calling it a llama or whatever. this is the Noam transformer. (you can call it PaLM architecture too!)
I've just finished Senior Area Chairing for the ARR February 2024 cycle. It was almost a disaster and I think we should talk about our experiences across the community.
Honored to receive the 2024 Jane Street Graduate Research Fellowship! Thank you @JaneStreetGroup for the award and for organizing an amazing workshop! The best part of this was getting to meet PhD students working on algebraic geometry, cosmology, quantum algorithms, and more!
Yes Paul (@pliang279), that bubbly is yours! 🥳🎉Congratulations on your very successful dissertation defense! (on the "Foundations of Multisensory Artificial Intelligence" in the @mldcmu ,@SCSatCMU, @CarnegieMellon ).
Congrats Dr. @pliang279!! That was a great defense 🎉🎉
Yes Paul (@pliang279), that bubbly is yours! 🥳🎉Congratulations on your very successful dissertation defense! (on the "Foundations of Multisensory Artificial Intelligence" in the @mldcmu ,@SCSatCMU, @CarnegieMellon ).
🎙️ Just wrapped up a talk at @MilaNLProc! Delighted to share our efforts in socially aware and interactional NLP systems: 🔗 Sotopia: openreview.net/forum?id=mM7Vu… 🔗 Agents vs Script: arxiv.org/abs/2403.05020 🔗 Cobra Frames: aclanthology.org/2023.findings-… #NLP #Research #MilaNLProc
📖For our weekly @MilaNLProc lab seminar, it was a pleasure to have @nlpxuhui presenting "Towards Socially Aware and Interactional NLP Systems". #NLProc
[p1] 🐕Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward🐕 Paper link: arxiv.org/pdf/2404.01258… page: github.com/RifleZhang/LLa… How to effectively train video large multimodal Model (LMM) alignment with preference modeling?
podcast episode where @_sireesh @claranahhh and I talk about our EMNLP paper on paradigm shifts! thanks for having us :)
🎙️New episode: Change, with Sireesh Gururaja, Amanda Bertsch and Clara Na pca.st/episode/d63166…
Interruptions make conversations feel natural. Much work has focused on AI voice assistants that can be interrupted by humans, but systems that know much more than us should be able to interrupt us too. At @AGIHouseSF's Launchathon today, I'm launching Interrupting Cow 🐮📢
🔥I will be joining @CarnegieMellon @LTIatCMU this upcoming Fall, working with @gneubig and @wellecks on evaluating LLMs & improving them with (human) feedback! Can't wait to explore what lies ahead during my Ph.D. journey☺️
Check out this new work studying LLM agent social simulations, enabled by the #Sotopia platform! And a great paper title 😄 @nlpxuhui 👏
Let’s talk about social simulations! Do you know that term could refer to various settings? Our new work suggests that you might want to double-check before being “amazed” by those simulations. 📜: arxiv.org/abs/2403.05020 🌐: agscr.sotopia.world 1/
Tools can empower LMs to solve many tasks. But what are tools anyway? github.com/zorazrw/awesom… Our survey studies tools for LLM agents w/ –A formal def. of tools –Methods/scenarios to use&make tools –Issues in testbeds and eval metrics –Empirical analysis of cost-gain trade-off