Jonas Mueller @jomulr

Co-Founder & Chief Scientist @CleanlabAI. Previously @awscloud, PhD @MIT_CSAIL people.csail.mit.edu/jonasmueller Joined April 2020

Tweets

125
Followers

101
Following

6
Likes

328

Curtis G. Northcutt @cgnorthcutt

6 days ago

Goodbye Hallucinations! Today, Cleanlab launches the Trustworthy Language Model (TLM 1.0), addressing the biggest problem in Generative AI: reliability. technologyreview.com/2024/04/25/109…

3 16 47 5K 12

Curtis G. Northcutt @cgnorthcutt

3 weeks ago

@CleanlabAI made the Forbes AI 50 list! It is an honor to be recognized alongside friends and inspirational companies like @OpenAI, @databricks, and @huggingface. We're just getting started... forbes.com/lists/ai50

0 3 18 349 2

Cleanlab @CleanlabAI

3 weeks ago

Open-Source AI aficionados: you've probably heard of the new Open-Source AI Cookbook from @huggingface At the top of this amazing resource, you'll now find a new notebook: Detecting Issues in a Text Dataset with Cleanlab 👇 huggingface.co/learn/cookbook… [...]

1 11 45 6K 30

Curtis G. Northcutt @cgnorthcutt

4 weeks ago

News! @CleanlabAI is ranked 18th globally among AI private companies. cbinsights.com/learn/ai-100-2…

4 7 20 2K 8

Download Image

Jonas Mueller @jomulr

2 months ago

Launched today: Use Data-Centric AI to automatically catch erroneous values in any column of a structured/tabular dataset. No problem if your data is messy with missing values and heterogeneous {numeric, categorical, text, date, ...} fields.

Cleanlab @CleanlabAI

2 months ago

2 6 29 4K 14

0 0 3 105 0

Cleanlab @CleanlabAI

2 months ago

BIG news for open-source practitioners of Data-Centric AI: We just released major updates to cleanlab, the most popular software library for Data-Centric AI (with 8000 GitHub stars thanks to an amazing community) Check out the repo and read on ... github.com/cleanlab/clean…

2 14 56 6K 37

Jonas Mueller @jomulr

2 months ago

New major release of cleanlab open-source library is out! We aim to make Data-Centric AI more useful and accessible to all. Don’t do all your data checking manually – also use cleanlab's AI-automated algorithms to ensure you don’t miss any data problems github.com/cleanlab/clean…

Cleanlab @CleanlabAI

2 months ago

0 0 3 291 0

0 0 0 109 0

Jonas Mueller @jomulr

3 months ago

Every Instruction Tuning dataset inevitably has bad examples lurking within it that are harming your LLMs. Now you have a way to easily filter and correct them 👇 cleanlab.ai/blog/filter-ll… Code to reproduce results for the Dolly dataset is linked in our article.

0 0 0 44 0

Jonas Mueller @jomulr

3 months ago

Our Data-Centric AI system can catch the bad images in any dataset. See how it works in our new blogpost

Cleanlab @CleanlabAI

3 months ago

Our Data-Centric AI system can catch the bad images in any dataset. See how it works in our new blogpost

2 0 15 1K 2

Download Image

0 0 1 74 0

Cleanlab @CleanlabAI

5 months ago

Everybody’s excited about @GoogleDeepMind's new LLM: Gemini #Gemini produces accurate outputs (that beat #GPT4) via an “uncertainty-routed chain-of-thought” algorithm. Cleanlab scientists previously invented a similar algorithm to boost the trustworthiness of outputs from any…

0 1 3 585 0

Download Image

Cleanlab @CleanlabAI

5 months ago

To learn about practical advances in #DataCentricAI at #NeurIPS2023, check out our paper: arxiv.org/abs/2207.10062 This collaboration w/ @MLCommons, @GoogleAI, @AIatMeta, @kaggle + other institutions -- introduces a community benchmarking framework for data-centric AI innovation

0 2 6 877 0

Nick Erickson @innixma

5 months ago

AutoGluon 1.0 is live!! Shatters SOTA, wins 75% vs prior release, 63% win-rate vs best-in-hindsight combination of other methods. To our knowledge, this is the biggest leap forward in tabular ML in the past 4 years. See how we did it: github.com/autogluon/auto… #AutoML #AutoGluon

5 35 99 18K 17

Download Image

Cleanlab @CleanlabAI

6 months ago

🤔 Did you know the famous ImageNet dataset has some very confusing classes in it? 💥 CHALLENGE: Go look at some examples and WITHOUT PEEKING AT LABELS see if you can determine which belongs to the “missile” vs the “projectile” class, or the “keyboard” vs “space bar” class.

1 1 2 626 0

Download Image

Jonas Mueller @jomulr

6 months ago

Many #computervision folks work with image segmentation datasets, but never tried annotating the data. Labeling pixels right is hard! Most segmentation datasets are thus full of errors -- our new research paper shows how to automatically detect them. The code is open-source!

Cleanlab @CleanlabAI

6 months ago

2 1 12 1K 1

Download Image

0 0 2 155 0

Matei Zaharia @matei_zaharia

6 months ago

We've launched some great industry-specific solution accelerators on Databricks Marketplace with @CleanlabAI, @Graphistry and @JohnSnowLabs. Check out these free resources with notebooks and sample data for healthcare, cybersecurity & communications. sprou.tt/18zyKvQGzyl

0 10 40 6K 6

Shawn Charles🎤🔥 @ShawnBasquiat

32K Followers 3K Following 🧑🏾‍💻Ex-FAANG Software Engineer 🥑Senior ML Developer Advocate @ Coming Soon 🏗️Building Tech Communities

Matheus @mathdesilva

126 Followers 357 Following

RobVaughanMS @RobVaughanMS

187 Followers 2K Following Private Wealth Advisor @MorganStanley For more information visit my website. NMLS: 1764845

Shubham Raj @Shubham16845113

1 Followers 104 Following

Wei Xin Chan @weixinnnnn

16 Followers 160 Following Research Fellow @ NTU. CS PhD. Working on mental health and batch effects in biological data.

Simone Lionetti @s_lionetti

11 Followers 81 Following Researcher at HSLU on ML and MedTech

TK @Eh_Tk

243 Followers 1K Following Helping founders navigate the path to Product Market Fit. Finding the Freedom to Flâneur.

LlamaLytics.ai @getllamalytics

5 Followers 96 Following Stop flying blind with your chatbot. Try LlamaLytics AI for free today

Basil @Basil_Trunov

185 Followers 1K Following VR/XR/UE5/ at @varjodotcom

Diogo Santos @diogosantosbr

402 Followers 2K Following Building IA-Based Products | Lead Data Scientist 📈

Yoooooog🔥 @iamjoelyan

194 Followers 4K Following Software Engineer

AI Product Database, a site dedicated to discovering and sharing the latest and greatest AI-powered products for every use case and industry.

AIProductDB @AIProductDB

655 Followers 2K Following AI Product Database, a site dedicated to discovering and sharing the latest and greatest AI-powered products for every use case and industry.

Anirudh Dagar @gollum_here

451 Followers 1K Following ML | Open Source @aws @d2l_ai

Tamara Stanley @TStanley39457

0 Followers 51 Following

Pi @punifi86

42 Followers 321 Following

Joe Mayo @JoeMayo

14K Followers 6K Following Building @generellem - #AI, #opensource, and #startups. Writing my own content.

Speech and Language Therapist#disability advocate##books#caregiversempowerment 🇬🇭#inclusive education#assistivetechnology

Gifty Ayoka @gifty_go

90 Followers 417 Following Speech and Language Therapist#disability advocate##books#caregiversempowerment 🇬🇭#inclusive education#assistivetechnology

Leire benito del vall.. @leirebeni98

0 Followers 90 Following

AI is reshaping the world.

Who are the people and companies driving the change? Visit our website to search more than 5,000 profiles.

AI Deeply @AiDeeply

404 Followers 5K Following AI is reshaping the world. Who are the people and companies driving the change? Visit our website to search more than 5,000 profiles.

VP Engineering (AI/ML) @openteamsinc | PhD | Adj. Asst. Prof. @utcompsci | Former Scholar @OpenAI | CS @StanfordEng | ❤️ Dogs & Lifting Weights

Fatma Tarlaci @coderphd

F785 @F71550724

73 Followers 254 Following

SaaS VC @pointninecap. Seed investor @algolia, @contentful, @factorialhr, @incident_io, @loom, @nexhealthhq, @poolside, @typeform, @whereby, @zendesk

Christoph Janz 🇺�.. @chrija

42K Followers 20K Following SaaS VC @pointninecap. Seed investor @algolia, @contentful, @factorialhr, @incident_io, @loom, @nexhealthhq, @poolside, @typeform, @whereby, @zendesk

rosa.v @v_rosaviz

158 Followers 4K Following Partnerships @whatnot

Boxi Yu @BoshCavendish

96 Followers 660 Following PhD @ CUHK-Shenzhen, focusing on AI4SE, SE4AI, and building AGI Infra.

hanncx @hanncx

74 Followers 4K Following perpetual learning

Jeffrey Li @lijeffrey39

2K Followers 946 Following co-founder @paraformtalent | prev swe @cruise @carnegiemellon

Nick Erickson @innixma

779 Followers 74 Following Author & Lead Developer of @AutoGluon : https://t.co/foLauqqWWe Senior Scientist at AWS AI #autogluon #automl #opensource

Engineer, historian, inventor, political scientist, pellet grill master. On a quest for truth, justice, and preservation of freedom!

Thomas Olenik @TomOlenik

4K Followers 5K Following Engineer, historian, inventor, political scientist, pellet grill master. On a quest for truth, justice, and preservation of freedom!

Aarav @Aaravmk2021

7 Followers 243 Following

SHAVIK @shavik_ai

172 Followers 593 Following Smarter Integrated Software

Aditya Parekh @AdityaParekh29

114 Followers 546 Following Early stage investments

Lauren Maffeo @LaurenMaffeo

2K Followers 2K Following #civictech Service Designer. Author of "Designing Data Governance from the Ground Up" @pragprog. Editorial Boards @magazine_cdo & @springernature. She/Her/Hi!

@MenloVentures invest in AI 1st Infra & SaaS @cartainc @benchling @harnessio @anthropicai @typefaceai @clarifai @cleanlabai Airbase, Envoy, Zylo, Vivun, Egnyte

mmurph @mmurph

5K Followers 417 Following @MenloVentures invest in AI 1st Infra & SaaS @cartainc @benchling @harnessio @anthropicai @typefaceai @clarifai @cleanlabai Airbase, Envoy, Zylo, Vivun, Egnyte

Engineering Manager @linkedin - a @microsoft company. Manages awesome @trinodb team. Past: @oracle (stream processing, middleware, RDBMS), @netapp, @ibm

Vikram Shukla @vshukla

278 Followers 2K Following Engineering Manager @linkedin - a @microsoft company. Manages awesome @trinodb team. Past: @oracle (stream processing, middleware, RDBMS), @netapp, @ibm

Suraj Rajwani e/acc @surajluke

2K Followers 5K Following General Partner at @DoubleRock, Jameson lover, Pilot, Skydiver. Investor in brilliant minds

Abhi Venigalla @abhi_venigalla

5K Followers 1K Following Researcher @Databricks. Former @MosaicML, @CerebrasSystems. Addicted to all things compute.

Adam @AbecidAdam

405 Followers 2K Following @berkeley_ai

Joseph Ravichandran @0xjprx

3K Followers 544 Following PhD Student studying Microarchitectural Security @MIT

Jeff Huber 🇺🇸 @jhuber

42K Followers 19K Following Founding CEO @GRAILbioㅣCo-founder @TriatomicCapㅣearlier: @Google Ads, Apps, Maps & ⟦x⟧ㅣf*ckcancerㅣPer aspera ad astra.

Mohamed Kari @m0k4r1

353 Followers 2K Following PhD Student in XR & CV @ Meta Reality Labs Research & University of Duisburg-Essen. Prev @ Apple, Porsche & ETH Zurich. 🇩🇪🇹🇳🇨🇭🇺🇸🏳️‍🌈

Diah Anggraeni Pitalo.. @dilovasket

117 Followers 295 Following

Pratham Savaliya @SavaliyaMbbs

194 Followers 925 Following Machine Learning Practitioner☘️!Community @hackthisfall

Gathnex @gathnexorg

43 Followers 350 Following 🤖 Exploring Generative AI & LLM. Join the Gathnex community for cutting-edge discussions and updates! 🌟 #AI #LLM #Gathnex

Abdullah Al Mamun @Abdulla61868968

98 Followers 732 Following I am a full-swing freelancer. It’s my profession and addiction, especially in digital marketing.

yaatehr @yaatehr

7 Followers 52 Following MIT 20’ 21’ and current MLE at Instagram with experience in Privacy Preserving ML, Ranking/Retrieval, and 🎷

MATTHEW TAKSA 🌉 @matthewtaksa

865 Followers 698 Following building something new in fintech | 2x founder, 1x acquired | ex @mastercard, @ucberkeley

Ashutosh Mehra @ashutoshmehra

1K Followers 5K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.

abdul shaikh @abdulshaikh028

118 Followers 5K Following

Artificial Intelligence Engineer 🧑‍💻

ML Research || DL || Computer Vision || NLP || Ex- Python Course Instructor
|| Frontend Developer || Problem Solving

Md. Rashed Miah @rashed_mahmud7

112 Followers 1K Following Artificial Intelligence Engineer 🧑‍💻 ML Research || DL || Computer Vision || NLP || Ex- Python Course Instructor || Frontend Developer || Problem Solving

Minh Nguyen @MinhNguyen89455

3 Followers 67 Following

Data is the currency of AI. Cleanlab increases the value of your data, automatically.

Come pioneer the future of Data-Centric AI: https://t.co/AXtvL5wegd

Cleanlab @CleanlabAI

2K Followers 159 Following Data is the currency of AI. Cleanlab increases the value of your data, automatically. Come pioneer the future of Data-Centric AI: https://t.co/AXtvL5wegd

MIT Introduction to D.. @dcai_course

31 Followers 7 Following

Chris Mauck @cmauck10

151 Followers 542 Following Data Scientist @ Cleanlab, Car Enthusiast, and Food Connoisseur

CEO & Co-Founder @CleanlabAI. PhD in ML @MIT. I am the @thephdrapper ~ Former @GoogleAI, @oculus, @amazon, @facebookai, @MSFTResearch

Curtis G. Northcutt @cgnorthcutt

851 Followers 243 Following CEO & Co-Founder @CleanlabAI. PhD in ML @MIT. I am the @thephdrapper ~ Former @GoogleAI, @oculus, @amazon, @facebookai, @MSFTResearch

phd @mit_csail • cto @cleanlabai • research at https://t.co/MdknnUlVnY • blog at https://t.co/oGOMQxZogX • open-source at https://t.co/VawMWMIb6F

Anish Athalye @anishathalye

3K Followers 208 Following phd @mit_csail • cto @cleanlabai • research at https://t.co/MdknnUlVnY • blog at https://t.co/oGOMQxZogX • open-source at https://t.co/VawMWMIb6F

AutoGluon @autogluon

766 Followers 20 Following Fast and Accurate ML in 3 Lines of Code #AutoML #opensource

Anish Athalye @anishathalye

6 days ago

Using this new trustworthiness score to prioritize human review of LLM outputs can have large (up to 80%) cost savings over other popular methods like OpenAI’s log probs or asking the LLM to self-reflect: cleanlab.ai/blog/trustwort…

0 3 13 2K 6

Download Image

Cris Dobbins @crisdobbins

6 days ago

Launch day @CleanlabAI! We are solving the biggest problem with productionizing GenAI: reliability/hallucinations. Check it out and give us some feedback! producthunt.com/posts/trustwor…

0 1 5 284 0

matt turk @TurkMatthew

6 days ago

Exciting day for our team at @CleanlabAI! Our Trustworthy Language Model (TLM) is now live (v1.0)! TLM helps solve the most significant problem with productionizing GenAI: reliability/hallucinations. With TLM, you can get more accurate outputs than GPT-4, along with…

Cleanlab @CleanlabAI

6 days ago

Announcing the Trustworthy Language Model, a solution to the biggest problems in productionizing GenAI: hallucinations and reliability. TLM provides a reliable trustworthiness score for every LLM output and can also produce more accurate outputs than GPT-4.

3 9 50 21K 40

Download Image

0 0 3 120 0

Curtis G. Northcutt @cgnorthcutt

6 days ago

Goodbye Hallucinations! Today, Cleanlab launches the Trustworthy Language Model (TLM 1.0), addressing the biggest problem in Generative AI: reliability. technologyreview.com/2024/04/25/109…

3 16 47 5K 12

Akshay 🚀 @akshay_pachaar

2 weeks ago

Here are all the relevant links: Research paper: arxiv.org/abs/2210.06812 Details blog post with code: docs.cleanlab.ai/stable/tutoria…

0 2 20 5K 11

Akshay 🚀 @akshay_pachaar

2 weeks ago

This Python library works like magic! ✨ With a single line of code & the two matrices you see in the image below, I can find: - a consensus label - quality score for individual & consensus labels - overall quality score for each annotator Introducing CrowdLab! 🚀 A weighted…

6 68 365 39K 277

Download Image

Menlo Ventures @MenloVentures

3 weeks ago

@CleanlabAI provides no-code, automated data curation for LLMs and the modern AI stack.

0 0 5 85 0

Download Image

Curtis G. Northcutt @cgnorthcutt

3 weeks ago

0 3 18 349 2

Cleanlab @CleanlabAI

3 weeks ago

1 11 45 6K 30

Cleanlab @CleanlabAI

4 weeks ago

We are honored to be featured in @CBinsights 2024 list of the top 100 private AI companies in the world. Alongside @OpenAI, @AnthropicAI, @huggingface, @MistralAI, @databricks, and other friends.

1 0 9 623 4

Download Image

Curtis G. Northcutt @cgnorthcutt

4 weeks ago

News! @CleanlabAI is ranked 18th globally among AI private companies. cbinsights.com/learn/ai-100-2…

4 7 20 2K 8

Download Image

CB Insights @CBinsights

4 weeks ago

Boom: Meet the 2024 AI 100 cbi.team/4aoH7Pu From new AI architectures to precision manufacturing, this year’s winners are tackling some of the hardest challenges across industries.

7 19 57 35K 29

Curtis G. Northcutt @cgnorthcutt

a month ago

Startup founders. The pandemic is over. GO MEET YOUR CUSTOMERS. Shake their hand. Hug them. Learn what's working and what's not. Free webinar TODAY with myself and BRG, an enterprise @CleanlabAI customer that 2x'd their customers using Cleanlab Studio. register.gotowebinar.com/register/27334…

0 1 16 599 2

Ruben Hassid @RubenHssd

2 months ago

Claude put together a prompt library. I've tested all of them. Here are 5 prompts you can try ↓ #1. Adaptive Editor

10 24 240 75K 599

Download Video

Cleanlab @CleanlabAI

2 months ago

Bad data costs the U.S. $3 Trillion per year. Your company's structured data has errors due to data entry or measurement mistakes, sensor noise, pipeline bugs, etc. Announcing 📣 an AI solution to catch erroneous values in *any* tabular dataset: help.cleanlab.ai/tutorials/data…

2 6 29 4K 14

@levelsio @levelsio

2 months ago

Absolutely NOBODY is paying for cbt.chat, well actually one sign up Thousands of users that all got 10 messages free Either we can tweak this or we just invalidated the idea of an AI mental help life coach within a week which is fine too

@levelsio @levelsio

2 months ago

Slowly been sharing the cbt.chat chat bot username in its Telegram channel to soft launch and test it Payment integration from Telegeam to Stripe works 700 people have now used it I will add @SimpleAnalytic server side event tracking soon to log activity better

19 1 72 349K 60

Download Image

198 10 494 462K 180

Download Image

Curtis G. Northcutt @cgnorthcutt

2 months ago

A common trend I've seen in AI startups: 1. startup makes a useful open-source project and raises Seed 2. invests in open-source, primarily user growth/ not new features 3. based on growth, raises a Series A 4. kills open source to focus 100% on SaaS sales to raise Series B

1 4 11 841 3

Cleanlab @CleanlabAI

2 months ago

Flawed data produces flawed AI, and real-world datasets have many flaws. With one line of code, you can run cleanlab on any dataset to automatically catch these flaws, and thus improve almost any ML model fit to this data. Don't just explore/check data manually, use automation!

1 0 2 239 1

Download Image

Cleanlab @CleanlabAI

2 months ago

Today’s v2.6.0 release includes new capabilities like Data Valuation (via Data Shapely), detection of Underperforming Data Slices/Groups, + more. Our blogpost outlines the new cleanlab techniques to systematically increase the value your existing data: cleanlab.ai/blog/cleanlab-…