Matei Zaharia @matei_zaharia
CTO at @Databricks and CS prof at @UCBerkeley. Working on data+AI, including @ApacheSpark, @DeltaLakeOSS, @MLflow, https://t.co/94gROE5Xa0. https://t.co/nmRYAKG0LZ people.eecs.berkeley.edu/~matei/ Berkeley, CA Joined October 2010-
Tweets2K
-
Followers39K
-
Following1K
-
Likes11K
When you invent a category, you expect to be the leader. We're really pleased to see that Forrester recognizes our commitment to excellence as THE leader in their new #ForresterWave for Data Lakehouses. sprou.tt/1eh2T9qU2jh
Super cool that this state-of-the-art medical NLP result was achieved using @lateinteraction's #DSPy. Beginning of the age of auto-optimized compound AI systems? IReRa is another example of an auto-optimized SOTA pipeline (arxiv.org/abs/2401.12178).
Super cool that this state-of-the-art medical NLP result was achieved using @lateinteraction's #DSPy. Beginning of the age of auto-optimized compound AI systems? IReRa is another example of an auto-optimized SOTA pipeline (arxiv.org/abs/2401.12178).
🔥Congratulations to @ugustintoma et al (@BoWang87 Lab) for winning the 1st place in *every* sub-task of the 2024 MEDIQA Clinical NLP competitions. They build & optimize extremely high-quality LM programs in DSPy, outperforming the next best participating system by up to 18 pt!
🔥Congratulations to @ugustintoma et al (@BoWang87 Lab) for winning the 1st place in *every* sub-task of the 2024 MEDIQA Clinical NLP competitions. They build & optimize extremely high-quality LM programs in DSPy, outperforming the next best participating system by up to 18 pt! https://t.co/TD1q9Qatn8
🆕💡🎧 DBRX Unpacked - featuring @hagay_lupesko @databricks: 🔥 Open LLM bridging quality & cost for AI apps ⚡️ Mixture-of-experts architecture for efficiency ⚙️ Optimized for enterprise needs: code gen, long context thedataexchange.media/dbrx-a-leap-fo…
Weekly LLM drops are cool, but real solutions are systems. Check out @matei_zaharia workshop on it!
Weekly LLM drops are cool, but real solutions are systems. Check out @matei_zaharia workshop on it!
Databricks Asset Bundles (DABs) is now GA. It lets you deploy data and AI projects as config-based packages and integrates great with Git, CI/CD and IDEs -- greatly streamlines data and AI deployment using modern software engineering practices. sprou.tt/1ubhaEia3Eu
Come see @infwinston @lmsysorg @charlespacker @MemGPT @abhi_venigalla talk about LLMs at this month's Berkeley LLM meetup -- hosted @databricks SF along with @andykonwinski! lu.ma/berkeleyllm
I don’t think everyone has comprehended the massive disruption and distortion that is going to happen in the Gen AI market due to Llama3. Moats will be destroyed and investments will go to zero. Just like everything in Gen AI, this will all happen fast.
Another major data provider on Databricks Marketplace — welcome @ICE_Markets!
Another major data provider on Databricks Marketplace — welcome @ICE_Markets!
In our upcoming webinar, join @rxin and Narinder Singh of @salesforce to learn how an AI-optimized data warehouse delivers better performance, governance, and user experience 👇 dbricks.co/49LTHqx
Check out the details of how @allen_ai OLMo 1.7-7B trained on @databricks jumped in accuracy since the last release - spoiler alert 🚨 data matters
Check out the details of how @allen_ai OLMo 1.7-7B trained on @databricks jumped in accuracy since the last release - spoiler alert 🚨 data matters
Super cool work, so excited that we got to support it!
My GTC talk on LLM inference performance is available on demand now. Watch it to learn about important factors affect LLM inference performance as well as how to think about LLM inference services.
My GTC talk on LLM inference performance is available on demand now. Watch it to learn about important factors affect LLM inference performance as well as how to think about LLM inference services.
Join @ankit_math and @dennylee as they dive into how DBRX was built! Learn about the open, high-quality #LLM at Open Source Summit North America in Seattle! The session will be on Thursday, April 18th, 11am-11:40am PST. See you there! 👋 sched.co/1bdIO #ossummit
#BigQuery via #BigLake now offers native support for #DeltaLake! This integration will make it easier for you to get insights from your data and make data-driven decisions. Learn more ➡️ cloud.google.com/blog/products/… #opensource #oss #linuxfoundation
That was a short run for the role of “prompt engineer” lol
Andy Pavlo (@andy_pav.. @andy_pavlo
29K Followers 205 Following Associate Prof. of Databases @CarnegieMellon. Co-Founder @OtterTuneAIDatabricks @databricks
70K Followers 1K Following Databricks is the data and AI company, helping data teams solve the world’s toughest problems.Erik Bernhardsson @bernhardsson
38K Followers 3K Following Building @modal_labs when I'm not posting bangers about data and software. Previously built the music rec sys at Spotify and ran the eng team at Better.Shreya Shankar @sh_reya
39K Followers 589 Following I study ML & AI engineers and try to make their lives a little better. PhD-ing in databases & HCI @Berkeley_EECS @UCBEPIC and MLOps-ing around town. She/they.Delta Lake @DeltaLakeOSS
8K Followers 66 Following Delta Lake is an open-source storage framework that enables building a Lakehouse architecture for Spark, Flink, Trino, Hive, Scala, Java, Rust, Python, & more!Rock the JVM @rockthejvm
8K Followers 213 Following Teaching #Scala, #Kotlin, #Spark, #Flink and tech on the JVM. 📹 Videos at https://t.co/1ODhzZCpb9 🔖 Articles at https://t.co/vwfzUXMF4CJoe Hellerstein @joe_hellerstein
15K Followers 894 Following Berkeley CS Prof, focused on data and computation.Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistMim @mim_djo
9K Followers 3K Following #Fabric Enthusiast, Small Data And self service, #Microsoftemployee since Nov 2023 , but my tweets are my ownAdi Polak @AdiPolak
14K Followers 803 Following DevX @ Confluent • Cloud • ML/AI & Data Platforms • Ex Microsoft, Akamai • Keynote Speaker • Author of Scaling ML Systems(O'Reilly) • Opinions are mineJacek Laskowski @jace.. @jaceklaskowski
7K Followers 874 Following Freelance Data Engineer | #ApacheSpark #DeltaLake #Databricks #ApacheKafka #KafkaStreams | Java Champion | @theASF | #DatabricksBeaconsAlex P @ifesdjeen
12K Followers 1K Following Distributed and Storage Systems. Apache Cassandra Committer and PMC member. Author of Database Internals @therealdatabass. Discord: https://t.co/8LwhZom9eQJo Kristian Bergum @jobergum
9K Followers 814 Following Distinguished Engineer @vespaengine. Tweets about Vespa, search, recommendation, ranking, and IR. CET. #StandWithUkraine 💙💛Gwen (Chen) Shapira @gwenshap
26K Followers 9K Following Co-founder of @niledatabase. Making SaaS global, elastic and chill. Find me at: https://t.co/uyuHg400cpArun Kumar @TweetAtAKK
5K Followers 256 Following Assoc Prof at UC San Diego CSE & HDSI. HDSI Faculty Fellow. Research on data management & ML systems. Wisconsin PhD. Freethinker. Poet. Memester. Gay. He/him.Pramod Bhatotia @pramod_bhatotia
3K Followers 595 Following Professor, Systems Research @TU_Muenchen and @EdinburghUniAlex Ratner @ajratner
5K Followers 551 Following @SnorkelAI @uwcse / prev @StanfordAILab – Interested in data management systems for machine learning, weak supervision, and impactful applications.billions of packets @justinesherry
10K Followers 2K Following Computer person. I like middleboxes, systems, and Internets. Assistant Prof @ AS9. she/her, [email protected], @[email protected], 🇺🇲❤️🇵🇹Ergest Xheblati @ergestx
8K Followers 990 Following Data + Business | Author: Minimum Viable SQL Patterns | Newsletter: Data Patterns (see links below)Amr Awadallah 🤖 @awadallah
36K Followers 14K Following Founder/CEO of @Vectara (Trusted GenAI for the Enterprise). Founder/ex-CTO @Cloudera, ex-VP at Yahoo & Google. PhD EE Stanford. IG, FB, LI: @awadallah.Kelly Estrada @kelly__estrada
26 Followers 92 Following Storyteller. Dreamer. Creator. Adventure Seeker.⚡️ Talking all things #employerbranding #employeestorytelling & #cultureAleksey Karasavov @threadend
5 Followers 68 FollowingRonald Simons @RonaldSimons
87 Followers 736 Following CEO building investor relationships at Treenia, an Early-Stage Data Point Domain Name Registrar Startup | Reader | Chess FIDE Legend | Views are my ownAnabell sussan @OketeKings38771
1 Followers 17 FollowingRaju @IitgRranjan
13 Followers 173 Followingk.b 👨💻 @jordandesi
54 Followers 265 Following Senior Software Engineer (Backend and Data Engineering). I like to share knowledge snippets on tech and quote lyrics.karthik @kmeesal
102 Followers 808 FollowingStefan Juang @StefanJuang
147 Followers 1K Following The final goal of AI is not just to create intelligent machines, but to understand intelligence itself.Acarin Inc @Acarin
39 Followers 112 Following Digital Strategy and Service firm specialized in Customer Engagement, Workplace Digitalization, IT Modernization, and Omni Channel Enablement/IntegrationVijay Kethana @v_kethana
112 Followers 217 Following Dead languages, spoken languages, computer languages | CS at UC Berkeleytruthseeker @truthseeker281
214 Followers 5K Following My whole life is thunder. Not a financial Advisor, Please do your own Damn research. "DD"Zhaoyang Chu @zhaoyang_c68411
9 Followers 365 Following CS Master@HUST. Interested in SE+ML, specifically focusing on building trustworthy and reliable AI-based software systems. Seeking PhD starting in 2025 Fall.h4ckt0p3 @h4ckt0p3
6 Followers 318 Following Tech-savvy software engineer and data scientist with a passion for undercover white-hat hacking. Unveiling secrets and debunking myths #DataScience #hackingChris de CostaVerde @ChrisDeCstVerde
4 Followers 96 FollowingAnikait Singh @Anikait_Singh_
126 Followers 264 Following PhD Student @StanfordAILab, Previously Student Researcher @GoogleDeepMind, Undergraduate @Berkeley_AI Deep Learning, Reinforcement Learning, Robotics.ctcsystems @ctcsystems
14 Followers 482 FollowingAMAN J◎HAR @wanderingscapes
2K Followers 2K Following web3 ecosystem builder @thenftbrewery | Creating new revenue streams for businesses as they cater to a digital native generation #BitcoinRock amanjyot.ethdksksiaia @dksksiaia50881
0 Followers 1 FollowingTate Estes @TateEstes1
42 Followers 838 Followingkcaverly @kcaverly_dev
142 Followers 969 Following Working on LLM Systems and DSPy. Pushing everyday to get better.Yet Another Financial.. @channel_yet
68 Followers 627 Following Maven for sharing financial information to reflect. Not investment advice, please do your own due diligenceryang1122 @ryang112211
0 Followers 54 FollowingAakar Sharma @Aakar__Sharma
1 Followers 56 Following ML Engineer at @healthifyme, Gold Medalist @iitjammuDanil Zvyagintsev @danzvyagintsev
155 Followers 2K Following 💻 Top Rated Power BI Developer @Upwork | I write about Data, Design and Analytics | 19K+ on LinkedIn, follow me (link in bio)Electronicsseeker @libertarian108
9 Followers 2K FollowingEdward @_edwardpraveen
206 Followers 711 Following Data Engineer & Scientist | Innovating Data Solutions for Real-World Challenges | DatapreneurDeepraj Bhosale @deepraj_bh43216
0 Followers 7 FollowingYijie Chen @YijieCh_benzo
9 Followers 101 Following Master student at BJTU. Focused on machine translation and code generation.Mindaugas Galvosas, M.. @MGalvosas
4K Followers 3K Following Cough AI and Digital Health solutions @hyfeapp @CoughPro | prev. founded @aichom_hello | interested in Healthtech, DTx | my opinionsDavid Chu @davidchuyaya
62 Followers 59 Following PhD student in distributed systems @UCBerkeley, advised by @joe_hellerstein and @siobhcrooSyed Tariq Ali @alishaw99
56 Followers 921 Following #Monitoring, Evaluation and Research Professional, #python, #pandas,#DataAnalyst,#Monitoring & Evaluation, #NLP, # Data Modeling, #Data EngineeringVasi Butnaru @vasile_butnaru_
40 Followers 310 Following Driven by a deep love for tech, I lead top-notch teams with joy and optimism. Eager to explore ML, I'm grateful for the chance to learn and innovate.Ingimar Tomasson @ingitom99
46 Followers 133 FollowingJen Wilkins @jen_wilkin69850
0 Followers 10 FollowingAndy Pavlo (@andy_pav.. @andy_pavlo
29K Followers 205 Following Associate Prof. of Databases @CarnegieMellon. Co-Founder @OtterTuneAIDatabricks @databricks
70K Followers 1K Following Databricks is the data and AI company, helping data teams solve the world’s toughest problems.Erik Bernhardsson @bernhardsson
38K Followers 3K Following Building @modal_labs when I'm not posting bangers about data and software. Previously built the music rec sys at Spotify and ran the eng team at Better.Shreya Shankar @sh_reya
39K Followers 589 Following I study ML & AI engineers and try to make their lives a little better. PhD-ing in databases & HCI @Berkeley_EECS @UCBEPIC and MLOps-ing around town. She/they.Delta Lake @DeltaLakeOSS
8K Followers 66 Following Delta Lake is an open-source storage framework that enables building a Lakehouse architecture for Spark, Flink, Trino, Hive, Scala, Java, Rust, Python, & more!Rock the JVM @rockthejvm
8K Followers 213 Following Teaching #Scala, #Kotlin, #Spark, #Flink and tech on the JVM. 📹 Videos at https://t.co/1ODhzZCpb9 🔖 Articles at https://t.co/vwfzUXMF4CJoe Hellerstein @joe_hellerstein
15K Followers 894 Following Berkeley CS Prof, focused on data and computation.Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistAdi Polak @AdiPolak
14K Followers 803 Following DevX @ Confluent • Cloud • ML/AI & Data Platforms • Ex Microsoft, Akamai • Keynote Speaker • Author of Scaling ML Systems(O'Reilly) • Opinions are mineJacek Laskowski @jace.. @jaceklaskowski
7K Followers 874 Following Freelance Data Engineer | #ApacheSpark #DeltaLake #Databricks #ApacheKafka #KafkaStreams | Java Champion | @theASF | #DatabricksBeaconsJo Kristian Bergum @jobergum
9K Followers 814 Following Distinguished Engineer @vespaengine. Tweets about Vespa, search, recommendation, ranking, and IR. CET. #StandWithUkraine 💙💛Gwen (Chen) Shapira @gwenshap
26K Followers 9K Following Co-founder of @niledatabase. Making SaaS global, elastic and chill. Find me at: https://t.co/uyuHg400cpArun Kumar @TweetAtAKK
5K Followers 256 Following Assoc Prof at UC San Diego CSE & HDSI. HDSI Faculty Fellow. Research on data management & ML systems. Wisconsin PhD. Freethinker. Poet. Memester. Gay. He/him.Pramod Bhatotia @pramod_bhatotia
3K Followers 595 Following Professor, Systems Research @TU_Muenchen and @EdinburghUniAlex Ratner @ajratner
5K Followers 551 Following @SnorkelAI @uwcse / prev @StanfordAILab – Interested in data management systems for machine learning, weak supervision, and impactful applications.billions of packets @justinesherry
10K Followers 2K Following Computer person. I like middleboxes, systems, and Internets. Assistant Prof @ AS9. she/her, [email protected], @[email protected], 🇺🇲❤️🇵🇹Amr Awadallah 🤖 @awadallah
36K Followers 14K Following Founder/CEO of @Vectara (Trusted GenAI for the Enterprise). Founder/ex-CTO @Cloudera, ex-VP at Yahoo & Google. PhD EE Stanford. IG, FB, LI: @awadallah.Simon Whiteley @MrSiWhiteley
3K Followers 589 Following Director of Engineering / Owner of @AdvAnalyticsUK, Speaker & Consultant. Spark Nerd. Londoner, foodie & gamer! Microsoft MVP. Databricks Beacon. He/Him.Eric Sammer @esammer
13K Followers 716 Following ceo at @decodableco! prev: @splunk, @rocanainc (acq'd), @cloudera. open source / dist systems / data. o'reilly author. [email protected]Modern Data Stack @moderndatastack
6K Followers 459 Following Everything that you need to know about building and operating a Modern Data Stack. Operated by team at @quantive_incUnsecured CCTV Camera.. @Unsecured_CCTV
88K Followers 0 Following Posting screenshots of unsecured CCTV/IP cameras once per hour using a bot. ** Locations are IP-based and may be inaccurate. **trevordarrell @trevordarrell
2K Followers 127 Following EECS, BAIR, UC Berkeley. Director, BAIR Commons Program.Bo Wang @BoWang87
8K Followers 2K Following Assistant Prof. CS,LMP @UofT; CIFAR AI Chair @VectorInst; Chief AI Scientist, @UHN; former PHD, CS @Stanford; opinions my own. #AI #healthcare #combioBill Chambers @bllchmbrs
1K Followers 806 Following 👷 https://t.co/ODHNO6YBx7 ✍️ https://t.co/cX04twkyJ5 1x indie exit. 1x O'Reilly author. 🦄 🚀 - Anyscale, Databricks, $PCOR Talks about Startups, Data, AIRich Lyons @richlyons
11K Followers 612 Following Assoc. Vice Chancellor for Innovation & Entrepreneurship, UC Berkeley, and former Dean, Berkeley’s Haas School.Rishi Yadav @rishiyadav
763 Followers 758 Following Founder & CEO @Roost || #ChatGPT || #LLMs || #AI || Vipassana || Founder @InfoObjects || Published Author : 2 books on Apache Spark @packtpub || #IITDelhiRichard Socher @RichardSocher
101K Followers 971 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindSijun Tan @sijun_tan
95 Followers 239 Following CS PhD student @UCBerkeley @BerkeleySky | Working on secure AI and applied crypto | Prev: @AIatMeta @AntGroup @uva | https://t.co/PUN3YitVsZHassan Hayat 🔥 @TheSeaMouse
5K Followers 4K Following Building the AI assistant for all @ https://t.co/D4gDyw97guAnna Tong @annatonger
4K Followers 943 Following @Reuters tech correspondent covering artificial intelligence and other things. Not leaving SF for NYC, LA or Miami! Signal: 6504683913 / [email protected]Yang Zhang @yaaang
3K Followers 432 Following Building @plasmicapp. Previously @MIT_CSAIL, @Google, @MSFTResearch, @InferInc.Artem Vysotsky @avysotsky
559 Followers 338 Following Founder of ChatLabs and @writingmateai Access to 20+ premium LLMs in one place https://t.co/8McOSyBzuw Ex Eng @Meta, Ex Dir Eng @ppl_aiMichele Catasta @pirroh
5K Followers 224 Following VP of AI @Replit | Head of Applied Research @Google | Research Scientist + Instructor @StanfordAILabTaka YAYOI / 弥生 �.. @taka_aki
638 Followers 287 Following 4/12にApache Spark徹底入門を出版しました! https://t.co/ZOs9PL7Lr6 Databricksで働いてます。料理、酒、本、音楽、ジョギング、モノづくりをこよなく愛する人間のつもり。Qiitaで記事書いてます。Eitan Turok @EitanTurok
170 Followers 892 Following AI Researcher @DbrxMosaicAI. Sorting in exponential time, training on the test set, and praying for geometric revelations.Bill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscCameron R. Wolfe, Ph... @cwolferesearch
21K Followers 625 Following Director of AI @RebuyEngine • Writer @ Deep (Learning) Focus • PhD @optimalab1 • I make AI understandableNathan Lambert @natolambert
25K Followers 690 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentialsInterconnects @interconnectsai
2K Followers 1 Following What you need to know about AI research trends, from @natolambert Wednesday mornings weekly, sometimes extra posts.Brett Larsen @_BrettLarsen
419 Followers 332 Following Sr. Research Scientist @DbrxMosaicAI | Guest Researcher @FlatironInst @NYU_CNS | Efficient deep learning + better algorithms for data sciencebilal2vec @bilaltwovec
2K Followers 781 Following ✨ research engineer • prev @googlebrain @cohere @dbrxmosaicai • se @uwaterlooUC Berkeley CTO @ucberkeley_cto
820 Followers 898 Following Bill Allison -- Official Twitter account of UC Berkeley's Chief Technology OfficerSophie @lebrechts
901 Followers 839 Following COO @allen_ai formerly AI/ML @Apple, SVP Strategy & Ops https://t.co/4Z5RuqSEkZ, PhD from @BrownUniversity, post-doc @CarnegieMellonDaniel Liden @danjliden
179 Followers 631 Following Developer Advocate @Databricks | Former @bitdotioinc @GuinnCenterSarah Wang @sarahdingwang
9K Followers 904 Following General Partner @a16z growth fund. “Excellence is the capacity to take pain”davidveuve @davidveuve
774 Followers 64 Following Head of Security Field Engineering for Databricks focused on Security Analytics. SF Bay Area Native, preferred NYC. Traveler, Tech Guy. Generally decent?Jared Quincy Davis @jaredq_
646 Followers 308 Following Founder and CEO, Foundry. @mlfoundry Orchestrating Compute. Fmr Research Scientist @DeepMind, Deep Learning Team. CS PhD @Stanford. ML, Distributed SystemsDaniel Smilkov @dsmilkov
7K Followers 993 Following Building @lilac_ai with @nsthorat. Past: Co-created Know Your Data & TensorFlow.js. Ex-PAIR/Google Brain. 🇲🇰🇺🇸 . ML & VisualizationNikhil Thorat @nsthorat
10K Followers 2K Following Co-founder of Lilac AI (@lilac_ai), now joining @databricks. Past: Co-created TensorFlow.js and Know Your Data. Google Brain // PAIR // Responsible AILilac, joining Databr.. @lilac_ai
2K Followers 3 Following Curate better data for LLMs. We are now joining @databricks. Github: https://t.co/DHtc0lOTiiFrancisco Romero @farllamas
101 Followers 279 Following @Stanford EE PhD (systems for ML and video analytics) @USC EE Alumni ✌️DBOS, Inc @DBOS_Inc
219 Followers 1 Following Born from research at MIT and Stanford, DBOS is revolutionizing the way people build cloud-native TypeScript applications - transactional serverless computing.Andy Palmer @andyhpalmer
5K Followers 3K Following serial entrepreneur who specializes in accelerating the growth of early-stage, mission-driven startups.Liana @lianapatel_
17 Followers 134 FollowingAditi Partap @AditiPartap97
251 Followers 341 Following CS Ph.D. Student @Stanford Applied Cryptography Group | M.S. from @Illinois_Alma | Undergrad from @iitdelhiStella Biderman @BlancheMinerva
15K Followers 748 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/herMax Marion @maxdoesresearch
323 Followers 98 Following my machine learning research account where i tell you abt all my sick experiments | pfp: me w/ https://t.co/XWwMkEg1a1 | personal account: @maxisawesome538Michael Ryan @michaelryan207
568 Followers 403 Following NLP Masters Student @stanfordnlp. || Working on DSPy 🧩 || Prev @GeorgiaTech @MicrosoftKrista Opsahl-Ong @kristahopsalong
655 Followers 352 Following CS PhD candidate @Stanford @StanfordAILab || Prev @Google @MicrosoftSuma Bailis @BailisSuma
19 Followers 122 FollowingParth Asawa @pgasawa
150 Followers 96 Following EECS + Business @ Berkeley | research @ucbepic | prev @databricks, @robusthq, @amazonErika Cardenas @ecardenas300
4K Followers 815 Following @weaviate_io | Interested in vector databases, LLM frameworks, and information retrievalNew paper from our team, led by @pat_verga Are you: * Doing evaluation with LLMs? * Using a huge model? * Worried about self-recognition? Try an ensemble of smaller LLMs. Use a PoLL: less biased, faster, 7x cheaper. Works great on QA & Arena-hard evals arxiv.org/abs/2404.18796
TIL of the bad.horse traceroute
I'm seeing really powerful iterative pairwise training approaches for LLMs. Ideas like that are timeless! Reminds me of ColBERT-QA (2020). Proposed an iterative pairwise supervision approach for retrievers. It also trained for 3 iterations on bootstrapped positives & negatives!
This guy discovered that you can use Midjourney’s “describe” feature to upload a photo of yourself and hear how the model sees you
Columbia University Gives Students Option To Finish Classes From Prison bit.ly/3xY3X1R
When you invent a category, you expect to be the leader. We're really pleased to see that Forrester recognizes our commitment to excellence as THE leader in their new #ForresterWave for Data Lakehouses. sprou.tt/1eh2T9qU2jh
Burying the lead... I think these are the first Llama3-based retrieval results? Llama3-8B looks way better than Mistral-7B.
Another banger from @cHHillee thonking.ai/p/strangely-ma…
👥 Be a part of our community-driven development at #MLflow. Share your thoughts and help us focus on the features that matter most to you. Take the MLflow feedback survey now: surveys.training.databricks.com/jfe/form/SV_3j… #opensource #oss #linuxfoundation #mlops
A wealth of results that many people will find valuable. GPT-3.5 vs GPT-4 with & without DSPy optimizers. Optimized GPT-3.5 calls are consistently more accurate here than uncompiled GPT-4 calls.
Authors @AugustinToma @RonaldXie1 @SPalayew @BoWang87 won all three sub-tasks of MEDIQA-CORR 2024 with their programs. Check out their thread (above) and paper (below). The image shows the added gains by optimizing the language programs in DSPy. arxiv.org/pdf/2404.14544
Just wrapped up an awesome convo with with @NaveenGRao, VP of GenAI at @databricks and former CEO at MosaicML. Not only is Naveen an elite operator, but he has a great nack for explaining complex topics in simple terms. We discussed: - Naveen's AI journey and background - Why…
Databricks named a Leader in the 2024 #ForresterWave for Data Lakehouses. Recognized for our strengths in several areas, including data storage and formats, security and governance, GenAI/LLM, scale, end-to-end integration, and more. Get the full report: dbricks.co/3wfpLp8
Databricks DBRX がAmazon SageMaker JumpStart で使えるようになったaws.amazon.com/blogs/machine-… via @awscloud
Sort of like this, but the gap is twice as big.
In our upcoming webinar, see how an AI-optimized data warehouse delivers better performance, governance and user experience. You'll learn to use #AI to build data pipelines faster, scale governance, and use natural language to democratize data and AI.
Llama 3 degrades more than Llama 2 when quantized. Probably because Llama 3, trained on a record 15T tokens, captures extremely nuanced data relationships, utilizing even the minutest decimals in BF16 precision fully. Making it more sensitive to quantization degradation.…
Talk: "OLMo: Findings of Training an Open LM" from Hanna Hajirshizi at AI2 from OSGAI. Extremely interesting overview of the 4 parts (Data, Training, Adaptation, Eval) of the OLMo open LLM project. Rare insight into how these processes work at scale. youtube.com/watch?v=qFZbu2…
@TimSweeneyEpic 100%! procedural engines like @sidefx Houdini walked so compound systems like @midjourney @ChatGPTapp could run. Pipelines are the magic, as @matei_zaharia shows here: x.com/matei_zaharia/…
Interesting trend in AI: the best results are increasingly obtained by compound systems, not monolithic models. AlphaCode, ChatGPT+, Gemini are examples. In this post, we discuss why this is and emerging research on designing & optimizing such systems. bair.berkeley.edu/blog/2024/02/1…