lmsys.org @lmsysorg
Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtm lmsys.org US Joined March 2023-
Tweets369
-
Followers32K
-
Following170
-
Likes536
Yes, check out @RekaAILabs's strong Flash-21B model!
Yes, check out @RekaAILabs's strong Flash-21B model!
Aarti Ravariya @ravariaarti
0 Followers 33 FollowingTaulant @taulantdemaa
227 Followers 374 Followingmiston @miston27
22 Followers 9 Followingbadboy @badboyjohnny2
36 Followers 72 Following You wanna do what you just did, but you already did it, so you can't do thatSalvader trump @ziwangfu
610 Followers 5K FollowingSengshin Lee @sengshinlee
3 Followers 837 FollowingCaptspeedlol @hnxinsh44719996
5 Followers 118 FollowingGloire Tshimanga @TshimangaG1
31 Followers 58 FollowingMikhail Matveenko @MikhailMaTV
16 Followers 428 FollowingLion @Lion75654786512
0 Followers 8 Followingyu kki @yukki7055285380
1 Followers 23 FollowingUbon-Obong Jeremiah U.. @UbObong2341
305 Followers 2K Following Optimist|| Electrical/Electronics Engineering Student || Mathematics/Science Tutor ||TonyALLINAI @allinai85459
1 Followers 8 Followingirfreeman @_mutaga
293 Followers 781 Following ~ Verilog, C, and Rust at @yale Searching for the better Noryve? Follow @kaganwa_musoreadam ber @adamber11
64 Followers 303 Following Full time freelancer🙌🏻 Click the link to learn how to quit your 9-5 ✨alikaeid @1800Alikaeid
363 Followers 865 FollowingTim Vogel @TimVogelTesla
28 Followers 221 Following hi, I'm Tim and Samele are donating for a Telsa for my family Paypal: https://t.co/inyaUVuUhYShedrach Stephen @StephenShe68907
57 Followers 205 FollowingKhalid AlQahtani @KhalidAlqh55901
0 Followers 267 FollowingBadoo_jiraa @BadooJiraa
1 Followers 33 FollowingAlexis Urusoff @elurusoff
179 Followers 2K Following Professional Passionist Inflamer. Cordobés. Hijo de española y ruso-guaraní. @narkocibernético Krishna leads.Robertomixaudio @robertomix60433
3 Followers 45 FollowingAmbient Earth @AmbientEarthv
16 Followers 77 Following Ambient Earth - Improve focus, relaxation and tranquility with VirtuScapes that harness the soothing sounds of the earth. 🌿🎶 #ambientearth #focus #rest #peaceShishir Joshi @theshishirjoshi
15 Followers 518 FollowingRyudas @Syuudas
32 Followers 546 Followingkenjiiij @kenjiiij1
0 Followers 381 Followingkouseinen @kouseinen_real
2K Followers 2K Following Prompt Engineer / Manager / Product Manager ITの大企業PM→大企業の子会社PdM→ベンチャーでPdM&経営企画→IT上場企業でプロンプトエンジニア Produce @AInews_trend #AI #generative #ChatGPT #promptDaniel Neburagho @DNeburagho12135
45 Followers 159 FollowingVeeran @Veeran_Veerpoy
15 Followers 1K FollowingRaydonLiu @LiuRaydon
0 Followers 26 FollowingATL @JUUUBAAAAA
2 Followers 31 FollowingAmjad @Ad31571973
76 Followers 266 Following „Dein Verstand ist grenzenlos, es sind die Zweifel, die dich limitieren.“@hugoiarce @hugoiarce
21 Followers 386 Following Ceo y Fundador de https://t.co/3wdMl6qO2I +54 9 3434158123pretoria2020 @pretoria2020
346 Followers 2K Following Si vis pacem, para bellum If you want peace, prepare for war #NoJusticeNoPeace #Resistance2021AI at Meta @AIatMeta
532K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Hyperbolic @hyperbolic_labs
3K Followers 43 Following Realize your vision for AI with open access to more than just compute. Join our discord: https://t.co/SaGT3y9AtELisa Dunlap @lisabdunlap
487 Followers 154 Following PhD student & vibe curator @berkeley_ai and Sky Computing Lab -- for the love of god look at your dataNaman Jain @StringChaos
901 Followers 896 Following CS PhD @UCBerkeley | Projects - R2E, LiveCodeBench, Chatbot-Arena Coding, RAFT, Data Quality | Past: @AWS @MSFTResearch @iitbombaySimon Mo @simon_mo_
339 Followers 303 Following Working on System for ML @ucbrise. Happy to get in touch: https://t.co/ACIbL2HqBr at https://t.co/FWFXdUDDMp (ex-@anyscalecompute)Hao AI Lab @haoailab
356 Followers 136 Following Hao AI Lab at UCSD. Our mission is to democratize large machine learning models, algorithms, and their underlying systems.Jeff Dean (@🏡) @JeffDean
296K Followers 6K Following Chief Scientist, Google DeepMind and Google Research. Co-designer/implementor of things like @TensorFlow, MapReduce, Bigtable, Spanner, Gemini .. (he/him)Banghua Zhu @BanghuaZ
2K Followers 802 Following PhD @Berkeley_EECS, statistics, info theory, LLM, RL, Human-AI Interactions.Liangsheng Yin @lsyincs
47 Followers 153 Following Undergraduate in SJTU, ACM Honor Class 2021. Interested in mlsys | machine learning | distributed systemsZihao Ye @ye_combinator
830 Followers 454 FollowingLogan Kilpatrick @OfficialLoganK
92K Followers 2K Following Lead product for @Google AI Studio and working on the Gemini API, helping developers build with AI, my views!Tengyu Ma @tengyuma
25K Followers 512 Following Assistant professor at Stanford; Co-founder of Voyage AI (https://t.co/wpIITHLgF0) ; Working on ML, DL, RL, LLMs, and their theory.Christopher Manning @chrmanning
126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Anastasios Nikolas An.. @ml_angelopoulos
3K Followers 784 Following @Berkeley_EECS Ph.D. with Mike Jordan/Jitendra Malik. Conformal prediction, distribution-free uncertainty quantification, vision/imaging. Former @stanford_ee.Arthur Mensch @arthurmensch
40K Followers 872 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxJames Zou @james_y_zou
10K Followers 59 Following @Stanford professor. Chan-Zuckerberg investigator. Sloan Fellow. AI for biotech + health. Making AI more trustworthy, reliable and human compatible.σ(W_hx * x_t + W_hh .. @QNixSynapse
94 Followers 85 Following Created this account to keep an eye of all things #ML Part time researcher of Natural stupidity in Artificial IntelligenceFireworks AI @FireworksAI_HQ
5K Followers 65 Following 🎆 Generative AI Platform built for developersHaotian Liu @imhaotian
6K Followers 396 Following building intelligence @xAI, creator of #LLaVA, cs @UWMadison, prev @MSFTResearchMistral AI @MistralAI
90K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPapolinario (multimoda.. @multimodalart
10K Followers 376 Following ML for Art and Creativity, working @HuggingFace ([email protected])Allen Institute for A.. @allen_ai
54K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfLYang Song @DrYangSong
10K Followers 886 Following Leading the Strategic Explorations team @OpenAI. Score-Based Models. Diffusion Models. Consistency Models.Xuechen Li @lxuechen
2K Followers 900 Following Building intelligence @xai. PhD @Stanford. Undergrad @UofT. Worked at @GoogleAI @MSFTResearch @Vectorinst. I go by Chen.Andrew Ng @AndrewYNg
1.0M Followers 912 Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCsJohn Schulman @johnschulman2
39K Followers 609 Following Cofounder @openai, lead post-training for ChatGPT and the API. Interested in reinforcement learning, alignment, birds, jazz musicYi-01.AI @01AI_Yi
5K Followers 8 Following A global company building AI 2.0 platform and applicationsShiyi Cao @shiyi_c98
396 Followers 361 Following PhD student @UCBerkeley, MSc @ETH, B.S @sjtu1896, systems, ml, and hpcmartin_casado @martin_casado
50K Followers 2K Following GP @ a16z ... questionable heuristics in a grossly underdetermined worldYang You @YangYou1991
8K Followers 386 Following Presidential Young Professor at @NUSingapore. @Forbes 30 under 30. Ph.D. from @UCBerkeley. Founder, President and Chairman of @HPCAITech and Colossal-AI.Song Han @songhan_mit
6K Followers 144 Following Assoc. Prof. @MIT, Distinguished Scientist @NVIDIA, cofounder of DeePhi (now part of AMD) and OmniML (now part of NVIDIA). PhD @Stanford. Efficient AI computingKai-Fu Lee @kaifulee
1.5M Followers 658 Following #AI Expert, CEO of @01ai_yi and Chairman of 创新工场 @sinovationvc, former President of Google China, Author of AI 2041 and NYT Bestseller AI SuperpowersKaichun Mo @KaichunMo
3K Followers 879 Following Research Scientist at NVIDIA Seattle Robotics Lab; Previously CS Ph.D. from StanfordPhilipp Schmid @_philschmid
16K Followers 651 Following Tech Lead and LLMs at @huggingface 👨🏻💻 🤗 AWS ML Hero 🦸🏻 | Cloud & ML enthusiast | 📍Nuremberg | 🇩🇪 https://t.co/l1ppq3q3hkTianyi Zhang @Tianyi_Zh
1K Followers 613 Following iterating ... I used to train more language models but am working on agents nowEric Wallace @Eric_Wallace_
6K Followers 1K Following Researcher at OpenAI working to make language models more trustworthy, secure, and private.Yuandong Tian @tydsh
16K Followers 801 Following Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.Zhijian Liu @zhijianliu_
695 Followers 600 Following PhD Student at @MIT. Focusing on efficient algorithms and systems for deep learning.Guido Appenzeller @appenz
7K Followers 198 Following At a16z investing in AI & Infra. 2x founder & CEO. CTO at Intel & VMware. CPO at Yubico. Tweets are my own.Zhuang Liu @liuzhuang1234
3K Followers 931 Following Research Scientist @MetaAI (FAIR, at NYC). machine learning, computer vision, neural networks. PhD from @Berkeley_EECSAudrey Cheng @audreyccheng
500 Followers 126 Following CS PhD Student @ucbrise, undergrad @Princeton. Excited about transactions and databases in general!Yangqing Jia @jiayq
12K Followers 263 Following Founder @leptonai. @UCBerkeley alumni. ex @google & @facebook. ex vp @AlibabaGroup. Open source work on caffe, @pytorch, @tensorflow, & @onnxai.Yann Dubois @yanndubs
4K Followers 1K Following PhD student @stanfordAILab | Prev: AI resident @metaai, @vectorinst, @CambridgeMLGOpenAI @OpenAI
3.4M Followers 0 Following OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6LgzPAAfter a grueling few days of having to click accept to view the chatbot leaderboard, we put it back on HF :p huggingface.co/spaces/lmsys/c…
What a week since we released Llama 3! I couldn’t be more proud of the response. 🏆 Llama 3 70B is now the highest ranking open model on @lmsysorg leaderboard. 📈 1.2M+ downloads. 🤗 600+ derivative models on @huggingface. I'm excited for much more to come.
LMsys added phi-3-128K into the arena. Got it in my comparisons. Excited to see where it’ll be placed
Such a good service - open leaderboards!
LMsys added phi-3-128K into the arena. Got it in my comparisons. Excited to see where it’ll be placed
Gemini 1.5 Pro has entered the (LMSys) Arena! Some highlights: -The only "mid" tier model at the highest level alongside "top" tier models from OpenAI and Anthropic ♊️ -The model excels at multimodal, and long context (not measured here) 🐍 -This model is also state-of-the-art…
More exciting news today -- Gemini 1.5 Pro result is out! Gemini 1.5 Pro API-0409-preview now achieves #2 on the leaderboard, surpassing #3 GPT4-0125-preview to almost top-1! Gemini shows even stronger performance on longer prompts, in which it ranks joint #1 with the latest…
Gemini 1.5 Pro is now ranked #2 on @lmsysorg chat arena (and #1 for long context). More work to do, but excited we put this model into the hands of developers. The era of truly multimodal models has arrived 🚀
@profjoeyg @lmsysorg Something like this. But 1. Unsquish elo number and [-] 2. Order top ones at top and elo towards right 3. Given gold silver and bronze for good meaure and participation trophy for smallest model that shows up here
@profjoeyg @lmsysorg I think maybe rotating the image by 90 degrees would make a drastic difference. The aligning towards the right instead of the left. You could also the not need to use icons. And you can color the groups based on clusters you see.
@profjoeyg @lmsysorg Plot above but with the labels for the elo not squished by the bar. And perhaps icons of the companies or teams along with the elo number since reading vertical text is hard.
Llama3-70B has settled at #5. With 405B still to come next... I remember when GPT-4 released in March 2023, it looked like it was nearly-impossible to get to the same performance. Since then, I've seen @Ahmad_Al_Dahle and the rest of the GenAI org in a chaotic rise to focus,…
Exciting update -- Llama-3 full result is out, now reaching top-5 on the Arena leaderboard🔥 We've got stable enough CIs with over 12K votes. No question now Llama-3 70B is the new king of open model. Its powerful 8B variant has also surpassed many larger-size models. What an…
Good to see that on @lmsysorg, our feb version of Reka Flash 21B ⚡️from @RekaAILabs, despite only being 21B dense parameters has competitive performance to other much larger models like Mixtral 8x22 and Mistral medium 🚀🔥 We'll have much better Flash and Core soon! 🦾
@lmsysorg A "hard" category in the leaderboard could be very good.😀
Such a welcome addition to the benchmarking landscape! - Creating benchmarks that correlate well with humans and separate top models has become increasingly hard as they’ve become increasingly capable. - And knowing it will be refreshed reduces the incentive for organizations…
Introducing Arena-Hard – a pipeline to build our next generation benchmarks with live Arena data. Highlights: - Significantly better separability than MT-bench (22.6% -> 87.4%) - Highest agreement to Chatbot Arena ranking (89.1%) - Fast & cheap to run ($25) - Frequent update…
New Benchmark by @lmsysorg! 🏆 Arena-Hard is a new benchmark to automatically evaluate LLMs on 500 real-world use cases. Arena-Hard matches 89% of human preferences from the LMSYS chatbot arena using LLM-as-a-Judge. 🤯 TL;DR: 🥇 Outperforms other benchmarks like MT-Bench and…
2 other models worth highlighting 😉 @RekaAILabs Flash 21B is very strong for its size! 💪
How good is @AIatMeta Llama 3 in real-world user scenarios?🤔 The early votes in @lmsysorg are in, and Llama-3 is the best open LLM, even outscoring @OpenAI GPT-4 (March) or @AnthropicAI Claude 3 Haiku! 👑 Llama 3 currently scores at 1199 in #7, only behind the latest @OpenAI…
@lmsysorg This is a cool initiative. How about you also introduce equitable evaluation and leaderboard customization since users of lmsys may have their own requirement too? An example implementation is here: arxiv.org/pdf/2106.05532…
Llama-3 is closing the gap with GPT-4, but multimodal models gotta catch up. Vision capabilities of open models like LlaVA are far, far behind GPT-4V. Video models are even worse. They hallucinate all the time and fail to give detailed descriptions of complex scenes and actions.…
Give it a try!
Congrats @GoogleDeepMind on shipping Gemini 1.5 Pro to public review! Upon capacity & latency testing, we have now brought Gemini 1.5 Pro up to the Arena🤖 Big improvement from Pro 1.0 to 1.5 across the board, and exceptionally strong long context understanding. Come test and…
I get to work with incredible people and I love it ❤️
Congrats @GoogleDeepMind on shipping Gemini 1.5 Pro to public review! Upon capacity & latency testing, we have now brought Gemini 1.5 Pro up to the Arena🤖 Big improvement from Pro 1.0 to 1.5 across the board, and exceptionally strong long context understanding. Come test and…