Hugo Larochelle @hugo_larochelle
Google DeepMind researcher, machine learning professor, ex-Twitter Cortex, father of 4, wine/music/comedy enthusiast mila.quebec/en/person/hugo… Joined June 2015-
Tweets3K
-
Followers113K
-
Following625
-
Likes22K
DeepMind researchers discover impressive learning capabilities in long-context LLMs venturebeat.com/ai/deepmind-re…
We went through the peer review process at @TmlrOrg and it was quite helpful to improve the paper (better positioning wrt prior work, fixed a bug in an equation, comparisons to ReST in terms of transfer performance). See the accepted version here: arxiv.org/abs/2312.06585…
We went through the peer review process at @TmlrOrg and it was quite helpful to improve the paper (better positioning wrt prior work, fixed a bug in an equation, comparisons to ReST in terms of transfer performance). See the accepted version here: arxiv.org/abs/2312.06585…
Excited to share our work at @GoogleDeepMind! We propose Naturalized Execution Tuning (NExT), a self-training method that drastically improves the LLM's ability to reason about code execution, by learning to inspect execution traces and generate chain-of-thought rationales 🧵👇
Another TMLR Journal-to-Conference track partnership, this time with @RL_Conference! Submit your TMLR papers on reinforcement learning to be presented at the new Reinforcement Learning Conference.
Another TMLR Journal-to-Conference track partnership, this time with @RL_Conference! Submit your TMLR papers on reinforcement learning to be presented at the new Reinforcement Learning Conference.
While we can't drop a whole surprise album like @taylorswift13, we can at least drop a surprise track! If you've been itching to present a previously published journal paper, our new presentation track at #RLC2024 might be just for you: docs.google.com/forms/d/e/1FAI….
Excited to share a new blog on ML-based repair for build errors at Google! We found that automatically repairing build errors in the IDE increases productivity as measured by overall task completion with no detectable negative impact on code safety!
A new position paper @TmlrOrg on applications of continual learning, by 20 participants to our @dagstuhl seminar in March 2023. It wasn’t easy to agree on *how* we should be doing continual learning, but we did find common ground on *why* it’s important! openreview.net/forum?id=axBIM…
This #EarthDay, discover the AMI project that puts #AI at the service of biodiversity protection 🌱🌎 📽️ youtu.be/VXJ40hAmTZY @david_rolnick @UK_CEH @AarhusUni @EspacePourLaVie @eButterfly_org @VTEcostudies
Penzai is one of the coolest ML libraries out there. Not only can you inspect every weight matrix and attention head in a Colab, you can trivially knock out heads, skip or repeat layers, or extract intermediates with a one line change. A beautiful tool for interpretability.
Penzai is one of the coolest ML libraries out there. Not only can you inspect every weight matrix and attention head in a Colab, you can trivially knock out heads, skip or repeat layers, or extract intermediates with a one line change. A beautiful tool for interpretability.
when i said i'm humbled by my colleagues, @_ddjohnson is one of the people i was thinking of. he single-handedly built this amazing JAX toolkit that you should definitely check out. i'm already using it on one of my active research projects, and it was very easy to integrate!
when i said i'm humbled by my colleagues, @_ddjohnson is one of the people i was thinking of. he single-handedly built this amazing JAX toolkit that you should definitely check out. i'm already using it on one of my active research projects, and it was very easy to integrate!
Excited to share Penzai, a JAX research toolkit from @GoogleDeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: github.com/google-deepmin…
It's been quite interesting to study how Gemini 1.5 Pro scales its in-context learning (ICL) from few to many shots. I found our experiments that avoid using hand-labeled examples with Reinforced ICL, and Unsupervised ICL (i.e. shots are input examples only), particularly neat.
It's been quite interesting to study how Gemini 1.5 Pro scales its in-context learning (ICL) from few to many shots. I found our experiments that avoid using hand-labeled examples with Reinforced ICL, and Unsupervised ICL (i.e. shots are input examples only), particularly neat.
The Mila Techaide event last Friday was a real success! We're thrilled to announce that we raised more than $100k in support of @CentraideMtl. A huge thank you to everyone for your generous donations, and special thanks to all our panellists and speakers. See you next year!
Last week, I gave a talk at @Mila_Quebec. The talk should be of interest to anyone working on predictive models, particularly in latent space. In collab. with @MahanFathi @ClementGehring @J_Pilault @davidkanaa @pierrelux. See you at @iclr_conf in 🇦🇹! drive.google.com/file/d/1mQSXFa…
📢 Don't forget! TMLR papers can be submitted to #CoLLAs2024, too! 👇
📢 Don't forget! TMLR papers can be submitted to #CoLLAs2024, too! 👇
Many thanks to our generous sponsors whose support makes this year's Mila Techaide AI Conference possible: @Microsoft @GoogleDeepMind @hydroquebec @OVHcloud_CA Kinetik Solutions and @IVADO_Qc! The event is this Friday. Secure your spot – Register now: mila.quebec/en/mila-techai…
I am very happy to announce that Gemma 1.1 Instruct 2B and “7B” are out! Here are a few details about the new models: 1/11
I'll be giving a talk in Montreal next week! conference proceeds are donated to Centraide
C'est un véritable honneur d'être nommée parmi les entreprises les plus admirées au Québec, deux années de suite! On est fiers de servir les Québécois depuis maintenant 20 ans, et on a hâte de célébrer cet anniversaire marquant avec vous plus tard cette année. #FiersDetreIci…
C'est un véritable honneur d'être nommée parmi les entreprises les plus admirées au Québec, deux années de suite! On est fiers de servir les Québécois depuis maintenant 20 ans, et on a hâte de célébrer cet anniversaire marquant avec vous plus tard cette année. #FiersDetreIci…
📣📣📣 Please don't miss the TechAide AI Conference on April 12! Join us for presentations by Ian Goodfellow, Bruce Schneier, Hsiu-Chin Lin, Ryan Lowe, and Sara Sabour, as well as a panel discussion featuring Yoshua Bengio and Doina Precup. mila.quebec/en/mila-techai…
Soumith Chintala @soumithchintala
185K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Alfredo Canziani @alfcnz
86K Followers 269 Following Musician, math lover, cook, dancer, 🏳️🌈, and an ass prof of Computer Science at New York University(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingLucas Beyer (bl16) @giffmana
56K Followers 445 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Kosta Derpanis @CSProfKGD
48K Followers 198 Following #CS Associate Prof @YorkUniversity, #ComputerVision Scientist Samsung #AI, @VectorInst Faculty Affiliate, TPAMI AE, #CVPR2024/#ECCV2024 Publicity Co-chairKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pRosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRDan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistMichael Bronstein @mmbronstein
43K Followers 4K Following #DeepMind Professor of #AI @UniofOxford / Fellow @ExeterCollegeOx / ML Lead @ProjectCETI / https://t.co/kZpGpDzYeVGautam Kamath @thegautamkamath
44K Followers 504 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Co-EiC @TmlrOrg. I lead @TheSalonML. Privacy, robustness, machine learning.Oriol Vinyals @OriolVinyalsML
166K Followers 82 Following VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzJia-Bin Huang @jbhuang0604
51K Followers 285 Following Associate Professor @umdcs; Part-time Research Scientist @Meta. I like pixels.Sergey Levine @svlevine
79K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical IntelligenceThomas G. Dietterich @tdietterich
50K Followers 505 Following Distinguished Professor (Emeritus), Oregon State Univ.; Former President, Assoc. for the Adv. of Artificial Intelligence; Robust AI & Comput. SustainabilityFerenc Huszár @fhuszar
40K Followers 1K Following Secular Bayesian. Associate Professor in Machine Learning @Cambridge_CL. Talent aficionado at https://t.co/RbJkoLguey Alum of @Twitter, Magic Pony and @BaldertonSander Dieleman @sedielem
50K Followers 2K Following Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).thatse @edov_i
0 Followers 37 FollowingCirro @__Cirro
8 Followers 2 Following Fan Account || If my updates ruffle your feathers, I won't apologize because I'm all about keeping it real. @Manutd @Realmadrid @cristianoMicheal Harvey @MichealHar18252
10 Followers 68 Followingnymous @nymous985951
0 Followers 54 FollowingJin.Chen @jinqiang605
7 Followers 162 Followingtuan pho @tuanpho
1 Followers 110 FollowingDigitalDrip AI Newsle.. @getdigitaldrip
15 Followers 49 Following We curate, summarize, and bring impactful and interesting AI blogs directly to your inbox. https://t.co/GoBsdQIPM3Anita Kay @domevampire11
4 Followers 149 FollowingFranjo Ivancic @fivancic
339 Followers 783 Following Senior Staff Software Engineer & Manager at Google. https://t.co/GNlq6Pi68dPengcheng Yin @pengchengyin
563 Followers 123 Following @GoogleDeepMind. Formerly a Neulab member @LTIatCMU. Interested in machine learning for NLP and code, dog training and aviation.Jonas @Jonas94G
33 Followers 43 FollowingUmiltcaho @umiltcaho27712
13 Followers 291 Following kya dekhne aae ho. Bs itna smjh lo tumse thoda sa zyada smjhdar huHarsh Desai @dreamerharsh
1 Followers 3K FollowingMadaline Zannes @zanneslaw
40K Followers 24K Following Lawyer: Business, Entertainment, Sports, Tech & web3 Firm: @zanneslawfirm Partner: @metaverseBA @elatalent Seen in Forbes, MIT, CBC, AP 🌠HoshAI @hoshaicom
8 Followers 65 Following HoshAI: Your AI-powered companion for generating text, images, audio, and video. Sign Up at https://t.co/ppEPkf6VlT today!shivaio code @Shivaaiio
18 Followers 115 FollowingLiam @LiamAugmented
2 Followers 31 FollowingDhvani Kansara @dhvaniiiiii
5 Followers 57 Followingwz @unknown_entropy
7 Followers 36 Followinglingfeng zhou @LZhou92949
4 Followers 49 FollowingOpen @OpenXuu
1 Followers 96 FollowingFelo @wangzhi0467
9 Followers 82 Followingrc @RColab13962
0 Followers 10 FollowingPlacePython | Vivre d.. @PlacePython
353 Followers 325 Following Je partage avec vous 25 ans d'expérience avec Python pour vous aider à accélérer votre reconversion dans le web. Telegram: https://t.co/8z628oi5Uh谭硕 @tanshuo142758
2 Followers 138 FollowingUrvesh Dungrani @urrvesh
25 Followers 348 Following I am Nothing. Nothing Matters. Nothing, is everythingSid Sahu @siddhantsahu92
179 Followers 646 Following Spatial computing, deep learning, developer tools. Building a mixed reality developer ecosystem at Strivr, former product @ LinkedIn.George @naivepriest
2 Followers 244 Following Machine learning enthusiast Artificial Intelligence developerBob Schmidt @bobinorlando
10K Followers 6K Following Author. https://t.co/TupvvMTd2j Lists: Orlando, Web Analytics, TimeshareKw Esi @KwEsi1757358
33 Followers 52 FollowingEbreuhyton @Ebreuhyton
22 Followers 165 Followingpsy art @ddmmyyyy87
0 Followers 24 Followingdrobi @drobi283515
1 Followers 44 FollowingFarshid Abrar Labib @bestlabib
0 Followers 346 FollowingManu Gupta @ManuGup31501641
0 Followers 122 FollowingJonathan Cruz @cruzjonk
0 Followers 115 FollowingMarina Fuster @FusterMfuster
1 Followers 45 Following Software Engineer (BSc, MSc), interested in the intersection between engineering and research. Artificial Intelligence Systems Lecturer @ ITBASoumith Chintala @soumithchintala
185K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Jürgen Schmidhuber @SchmidhuberAI
106K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).NeurIPS Conference @NeurIPSConf
111K Followers 35 Following New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to [email protected].Kevin Patrick Murphy @sirbayes
42K Followers 334 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistGautam Kamath @thegautamkamath
44K Followers 504 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Co-EiC @TmlrOrg. I lead @TheSalonML. Privacy, robustness, machine learning.Oriol Vinyals @OriolVinyalsML
166K Followers 82 Following VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzSergey Levine @svlevine
79K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical IntelligenceThomas G. Dietterich @tdietterich
50K Followers 505 Following Distinguished Professor (Emeritus), Oregon State Univ.; Former President, Assoc. for the Adv. of Artificial Intelligence; Robust AI & Comput. SustainabilityFerenc Huszár @fhuszar
40K Followers 1K Following Secular Bayesian. Associate Professor in Machine Learning @Cambridge_CL. Talent aficionado at https://t.co/RbJkoLguey Alum of @Twitter, Magic Pony and @BaldertonSander Dieleman @sedielem
50K Followers 2K Following Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).François Fleuret @francoisfleuret
31K Followers 455 Following Prof. @Unige_en, Adjunct Prof. @EPFL_en, Research Fellow @idiap_ch, co-founder @nc_shape. AI and machine learning since 1994. I like reality.Sara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Zachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Edward Grefenstette @egrefen
36K Followers 774 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.Christopher Manning @chrmanning
126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Natasha Jaques @natashajaques
25K Followers 1K Following Senior Research Scientist at @GoogleAI and Assistant Professor @uwcse. Social Reinforcement Learning in multi-agent and human-AI interactions. PhD from @MIT.Ethan Mollick @emollick
210K Followers 551 Following Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgqMax Schwarzer @max_a_schwarzer
942 Followers 282 Following Doing research at @OpenAI. Did my PhD with Aaron Courville and @marcgbellemare at @Mila_Quebec. Interned at @Apple, @DeepMind, Google Brain, @Numenta.Ashley Edwards @ashrewards
484 Followers 200 Following Research scientist @GoogleDeepMind. Past: Uber AI Labs, Georgia TechSylvain Carle @froginthevalley
11K Followers 6K Following Partner at Innovobot Resonance Ventures. Investing in seed stage #DeepTech #ForGood Startups. Lang=FR: @sylvain. Also: @[email protected]Dylan HadfieldMenell @dhadfieldmenell
2K Followers 2K Following Assistant Prof @MITEECS working on value (mis)alignment in AI systems; @[email protected] @[email protected] he/himAnna Hedström @anna_hedstroem
282 Followers 257 Following ML PhD student @UMI_Lab_AI at @TUBerlin | evaluation-centric interpretabilityEtienne Laliberté @etnlalib
667 Followers 58 Following Plant ecologist. Professor @Biologie_Umontreal @IRBV_Montreal PI @traitlab @CABO_science Canada Research Chair in Plant Functional Biodiversity He/himBenjamin Rosman @BenjaminRosman
2K Followers 394 Following Professor (machine learning and robotics) @WitsUniversity | Director @raillabwits | Co-founder @LelapaAI | Founder @DeepIndaba | Proudly South AfricanGabi Surita @gssurita
254 Followers 260 Following Building the Al counterculture. Sometimes converts coffee into code. (she/her) 🏳️🌈 🇧🇷David Ifeoluwa Adelan.. @davlanade
2K Followers 1K Following @DeepMind Academic Fellow @uclcs, incoming assistant Professor @mcgillu, Canada CIFAR AI Chair @CIFAR_News | interested in multilingual NLP | Disciple of JesusGEO BON @GEOBON_org
5K Followers 413 Following A global network of experts working together to understand biodiversity changeKatie Everett @_katieeverett
225 Followers 462 Following Machine learning researcher at @GoogleDeepMind (via Brain) + PhD student @MIT. Previously @chorus cofounder, @twitter, @MIT.Sangnie Bhardwaj @sangnie
423 Followers 352 Following ML researcher @GoogleAI. PhD student @Mila_Quebec.Pessimists Archive @PessimistsArc
91K Followers 65 Following Exploring technophobia and moral panic through the ages. A litany of shameful cynicism and spite. Curated by @louisanslowEmtiyaz Khan @EmtiyazKhan
11K Followers 234 Following Team leader at @RIKEN_AIP_EN. Opinions my own. Follow me at https://t.co/jXDOS1HKXEElijah Cole @eli_cole_
844 Followers 3K Following Machine learning for scientific discovery. AI/ML Scientist @altos_labs • PhD @Caltech • Prev. Google, Microsoft, AFRL, Duke • he/him 🏳️🌈Paul Vicol @PaulVicol
954 Followers 1K Following Research Scientist at Google DeepMind. PhD from @UofT and @VectorInst.Melisa Bok @melisabok
440 Followers 335 Following Software developer at @UmassAmherst. Currently working on @openreviewnetSara Beery @sarameghanbeery
11K Followers 3K Following Research on computer vision and the environment 🌍 Asst Prof at @MIT_CSAIL #QueerInAI 🏳️🌈 sarabeery on threads @[email protected]ladies and gentlemen,.. @CraigWeekend
578K Followers 1 Following daniel craig reminds you that the weekend is here, every friday eveningShek Azizi @AziziShekoofeh
7K Followers 997 Following Staff Research Scientist @Google @GoogleDeepMind 🧠 Opinions are my own.Sonia Joseph @soniajoseph_
4K Followers 537 Following AI researcher. Getting PhD @Mila_Quebec, prev @Princeton. Multimodal interpretability + alignment.Adam Roberts @ada_rob
7K Followers 646 Following ai researcher @ Google DeepMind :: ♫ (MusicVAE, NSynth, MusicLM, SingSong) & 📝 (T5, PaLM) & :: t5x & seqio // recovering comp biologistkermorvant @kermorvant
62 Followers 43 Following PhD in Machine Learning, building products and services for document analysis with AIMario Lucic @MarioLucic_
3K Followers 148 Following Staff Research Scientist @ https://t.co/pXedOGSgT3. Gemini Video and Audio-video understanding.Sanjana Basu @SanjanaBasu14
607 Followers 982 Following Investing in AI @radicalvcfund since 2019 | Previously investing in deeptech & consumertech @TataCompanies Venture Arm, IB @Barclays | Alum @IIM_BangaloreSabela @sabelaraga
1K Followers 596 Following I+D+me. Costa da Morte - A Coruña - Zürich. Opinions are my own.Geoffrey Cideron @CdrGeo
222 Followers 380 Following Research Engineer at Google DeepMind. Spent time at FAIR London, INRIA Lille, and Instadeep.Kelsey Allen @KelseyRAllen
1K Followers 370 Following Formerly: physicist turned cognitive scientist @MIT Presently: Research Scientist @DeepMind I like humans, crows, primates and robotsJoelle Pineau @jpineau1
10K Followers 352 Following AI researcher. VP AI Research (FAIR), @AIatMeta. Professor of Computer Science, @mcgillu. Core academic member, @Mila_Quebec👩💻 Paige Bai.. @DynamicWebPaige
59K Followers 2K Following ✨Keep it simple, make it scale. AI should be about empowering people, building understanding, & making dreams realities. 👩💻GenAI @GoogleDeepMind ex-@GitHubAwa Dieng @adoubleva
774 Followers 470 Following researcher at Google DeepMind • working on building fair machine learning systems using causality • organizer @afciworkshopBlaise Aguera @blaiseaguera
8K Followers 311 Following VP of Engineering at Google, working on basic problems and applications in AI, with a focus on privacy. Order my book 'Who Are We Now?' out now ⬇️Accepted papers at TM.. @TmlrPub
3K Followers 2 Followingrohan anil @_arohan_
12K Followers 2K Following Principal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.Jeremiah Harmsen @JeremiahHarmsen
1K Followers 488 Following Creator of #TensorFlowHub and @TensorFlow Serving. Lead in Google Brain.Laurence Therrien @Lau_Therrien
157 Followers 486 Following GR and Public Policy @GoogleCanada / MEB @UWaterloo / Montrealer / Unofficial account of Lupo the DooglerNew Submissions to TM.. @TmlrSub
2K Followers 3 Following Submissions to Transactions of Machine Learning ResearchTransactions on Machi.. @TmlrOrg
5K Followers 3 Following Transactions on Machine Learning Research (TMLR) is a new venue for dissemination of machine learning researchSamira E. Kahou @SamiraEKahou
1K Followers 228 Following Mother, Associate Professor at @etsmtl / Vision and RL Lab, Adjunct Professor at @mcgillu, Canada CIFAR AI Chair, member of @Mila_Quebec, @rllabmcgillWe went through the peer review process at @TmlrOrg and it was quite helpful to improve the paper (better positioning wrt prior work, fixed a bug in an equation, comparisons to ReST in terms of transfer performance). See the accepted version here: arxiv.org/abs/2312.06585…
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models paper page: huggingface.co/papers/2312.06… Fine-tuning language models~(LMs) on human-generated data remains a prevalent practice. However, the performance of such models is often limited by the quantity…
Excited to share our work at @GoogleDeepMind! We propose Naturalized Execution Tuning (NExT), a self-training method that drastically improves the LLM's ability to reason about code execution, by learning to inspect execution traces and generate chain-of-thought rationales 🧵👇
If you are interested in robotics, we are excited to welcome the #ROS community to Mila on May 2nd. See you there!
🇨🇦 Montreal ROS Users! Join us next Thursday, May 2nd, at @Mila_Quebec for a very special #ROS meetup in Montreal. We're visiting Montreal for @ohsummit and worked with @clearpathrobots to put organize a little get together during our visit. eventbrite.com/e/ros-and-robo…
"Reinforcement learning for therapeutics is a really big, untapped area of research." Doina Precup, Core Academic Member at Mila, shared her insights on using AI to advance precision medicine during a talk at #WSAIAM24.
Today on the blog, read all about how automatically repairing non-building code increases productivity and appears to introduce no detectable negative impact on code safety, provided that high quality training data and responsible monitoring are employed →goo.gle/4b9Hm0w
Another TMLR Journal-to-Conference track partnership, this time with @RL_Conference! Submit your TMLR papers on reinforcement learning to be presented at the new Reinforcement Learning Conference.
While we can't drop a whole surprise album like @taylorswift13, we can at least drop a surprise track! If you've been itching to present a previously published journal paper, our new presentation track at #RLC2024 might be just for you: docs.google.com/forms/d/e/1FAI….
Really exciting to see this! The kicker here is that if you exclude extremely short queries (<5 tokens), 1.5 Pro ranks joint #1 unequivocally.
More exciting news today -- Gemini 1.5 Pro result is out! Gemini 1.5 Pro API-0409-preview now achieves #2 on the leaderboard, surpassing #3 GPT4-0125-preview to almost top-1! Gemini shows even stronger performance on longer prompts, in which it ranks joint #1 with the latest…
Gemini 1.5 Pro has entered the (LMSys) Arena! Some highlights: -The only "mid" tier model at the highest level alongside "top" tier models from OpenAI and Anthropic ♊️ -The model excels at multimodal, and long context (not measured here) 🐍 -This model is also state-of-the-art…
More exciting news today -- Gemini 1.5 Pro result is out! Gemini 1.5 Pro API-0409-preview now achieves #2 on the leaderboard, surpassing #3 GPT4-0125-preview to almost top-1! Gemini shows even stronger performance on longer prompts, in which it ranks joint #1 with the latest…
Congratulations to Allison Cohen, Senior Applied AI Projects Manager in Mila's AI for Humanity team, whose work is featured in @TechCrunch 's "Remarkable women who contributed to the AI revolution" series!
Women in AI: Allison Cohen on building responsible AI projects tcrn.ch/4b0psNM
A new position paper @TmlrOrg on applications of continual learning, by 20 participants to our @dagstuhl seminar in March 2023. It wasn’t easy to agree on *how* we should be doing continual learning, but we did find common ground on *why* it’s important! openreview.net/forum?id=axBIM…
@ylecun I am not saying it has to be like that, more that it is one of many possible implementations.
@YouTube I also look forward to presenting this recent research in a more condensed format at #WorldAISummit this week, where I will talk about how we can use generalist robots to learn the tech tree of the universe.
This #EarthDay, discover the AMI project that puts #AI at the service of biodiversity protection 🌱🌎 📽️ youtu.be/VXJ40hAmTZY @david_rolnick @UK_CEH @AarhusUni @EspacePourLaVie @eButterfly_org @VTEcostudies
Excellent video-tutorial on the curse of unrolling. There's no feeling like when others build and improve upon your work 🤗
Check out my latest video on the "Curse of Unrolling," a counter-intuitive phenomenon when you unroll differentiate ("piggyback AD") through an iterative algorithm: youtu.be/80w5wDxq26c Even if your primal converges exponentially linear, the Jacobian initially does not. 🧵🧵
🙌🏻Shout out to @jpineau1 for her remarkable work leading FAIR at Meta.
This surprises me, I didn't expect this to work arxiv.org/abs/2404.11018
deep learning infra is hard to get right but so important, advancements in it enable totally new lines of research
Excited to share Penzai, a JAX research toolkit from @GoogleDeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: github.com/google-deepmin…
Researchers in Canada got most of what they were hoping for in the country’s 2024 federal budget, with a big boost in postgraduate pay and more funding for research and scientific infrastructure go.nature.com/4aYSvRS
Wow. This looks amazing.
Excited to share Penzai, a JAX research toolkit from @GoogleDeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: github.com/google-deepmin…