Jiacheng Zhu @JiachengZhu_ML
Postdoc @MIT_CSAI, PhD from @CarnegieMellon | Prev. at @Apple AI/ML Health AI, @ATTResearch, | Statistical ML, Optimal Transport, ML for Health, and Robotics jiachengzhuml.github.io Joined October 2019-
Tweets59
-
Followers399
-
Following532
-
Likes1K
This was a cool project - our conclusion is that the A matrix in LoRA should just be random and not tuned! 🤯
This was a cool project - our conclusion is that the A matrix in LoRA should just be random and not tuned! 🤯
We don't need A&B for LoRA👇 Understanding the changes in each part (AB)👇 Proof (math!) that A&B learn differently 👇 Potential to new directions, new merging, new training, less parameters 👇 👇 👇 👇
We don't need A&B for LoRA👇 Understanding the changes in each part (AB)👇 Proof (math!) that A&B learn differently 👇 Potential to new directions, new merging, new training, less parameters 👇 👇 👇 👇
For x = (W_0 + BA)x, the A matrix projects input data (tokens) to features, and the B matrix uses these features to create the desired output. In other words, the B matrix is inherently and provably more effective! A simple trick to improve LoRA performance is to freeze the A…
For x = (W_0 + BA)x, the A matrix projects input data (tokens) to features, and the B matrix uses these features to create the desired output. In other words, the B matrix is inherently and provably more effective! A simple trick to improve LoRA performance is to freeze the A…
Ever wondered what LoRA fine-tuning actually does? Curious about the roles of matrices A and B in the LoRA adapter? Check our latest work, "Asymmetry in Low-Rank Adapters of Foundation Models," where we explore these differences in depth. The learned B matrics are significantly…
Announcing SGI 2024! Undergrads and MS students: Apply for 6 weeks of paid summer geometry processing research. No experience needed: 1 week tutorials + 5 weeks of projects. Mentors are top researchers in this emerging branch of graphics/computing/math. sgi.mit.edu
Looking for the right venue to showcase your groundbreaking research on AI/ML methods and applications in health settings? CHIL is the perfect platform! We solicit a diverse range of topics on ML/AI advancements in the health domain. ⚕️🧑💻🔍 Visit chilconference.org to submit
🚨 Deadline Extension Alert 🚨 We’re extending our submission deadline to Feb 16th EoD! 🎉 Don’t miss the opportunity to polish your work & submit the best version of your paper! chilconference.org/call-for-paper… #CHIL2024 #CallForPapers #ml4h #machinelearning #ai #health #MoreTimeToShine
📢Reminder: Papers are due in less than a month! This year, CHIL 2024 will accept submissions for 3⃣ distinct tracks: Models and Methods, Applications and Practice, and Policy, Impact and Society. ⌛️Submit papers by February 5th, 11:59pm EST!
Obligatory post-tenure news article 🗞️🥳 -- plus a very purple photograph! news.mit.edu/2023/justin-so…
Research roundtables are happening now! A great opportunity for meeting and chatting with experts in our field
👋💬Join Marinka Zitnik @marinkazitnik, Jiacheng Zhu @JiachengZhu_ML, and Rafal Dariusz Kocielnik at #ML4H2023 for their roundtable on Multimodal AI for Health.🌟Learn about integrating diverse data sources for ML healthcare applications.🧠Real-time insights, real-world impact!
We're back! 🙌 Announcing the 5th annual Conference on Health, Inference, and Learning (CHIL) to be held in-person from June 27-28, 2024 in New York City 🗽 Call for papers is up now! chilconference.org/call-for-papers ⏳Submission Deadline: Monday, February 5, 2024 #CHIL2024
Learn how to effortlessly segment moving objects in 4D given monocular videos, just like those from your iPhone 📱! Join us for our #ICCV2023 Poster presentation: 📅 Friday (6th) 🕝 02:30-04:30 PM 📍 Room "Nord" - 165 #ComputerVision #Segmentation Website:visual.cs.brown.edu/projects/seman…
The Shrödinger problem defines a dynamic interpolation between two distributions using Brownian bridges. The 0 temperature limit is Optimal Transport. hal.archives-ouvertes.fr/hal-00849930/d…
While most adversarial robustness studies focus on the local area of data samples, we use optimal transport to generate interpolation data distributions on the geodesic connecting different subpopulation distributions. Paper: proceedings.mlr.press/v202/zhu23i
While most adversarial robustness studies focus on the local area of data samples, we use optimal transport to generate interpolation data distributions on the geodesic connecting different subpopulation distributions. Paper: proceedings.mlr.press/v202/zhu23i https://t.co/izmfYizM69
Ziang @Ziang250193
19 Followers 81 FollowingPatrick Young -- e/ac.. @ConsumerRick
515 Followers 1K Following quantum sim wiz | way too self aware rn | travellerAileen @AnastasiaJ96311
21 Followers 394 Following I am a business investor currently running a beauty clinic and yoga studio in Chicago with my sisterLinwei Sang @sanglinwei21
164 Followers 942 Following Ph.d candidate in Tsinghua University @Tsinghua_Uni, visiting scholar in UC, Berkeley @UCBerkeley, power system analytics under the lens of OR and ML.Adina Yakup @AdeenaY8
2K Followers 454 Following @huggingface 🤗 | Contributing to Chinese ML community.AI Papers Podcast @aipaperspodcast
875 Followers 2K Following A digestible daily update on the latest AI Research Papers. Brought to you by @pocketpodappKarush Suri @karush_
392 Followers 629 Following Meta Learner @Theteamatx in @GoogleAI swimmer & comic book collector Past @borealisai @eceuoftSouth Island Holdings @SouthIslands01
7K Followers 750 Following South Island was established in 2010 and is involved in medium sized developments across the Asia Pacific regions.Pu Hua @THU_PuHua
68 Followers 270 FollowingAnkur Parikh @ank_parikh
3K Followers 3K Following Staff Research Scientist at Google DeepMind. Former adjunct assistant prof at @NYU_Courant. PhD at @mldcmu. ML for Bio/Chem (Prev. NLP). All opinions my own.HaoyueBai @haoyue_bai
932 Followers 838 Following Ph.D. student at Computer Science Department @UWMadisonCS, MPhil @HKUSTCSE.Kathy Yang @Kathy__Yang
47 Followers 335 Following #costumedesigner#reader#art#enjoying the weather and nature America, enjoy sea travel, food, sports, reading, music, healthLelCh @Yongsheng_Si
3 Followers 534 Following Aiming to be a researcher, writer, and startup founder.Kathy Chen @Kathy__chen9
321 Followers 345 Following Fashion designer, in his free time he likes to do sports, exercise, camping, barbecue, and readingHan Xue @HanXue012
120 Followers 488 Following Ph.D. student @sjtu1896 | Interesed in Robotics & 3D VisionZhengyuan Jiang @Zhengyuan22
302 Followers 229 Following Trustworthy AI Ph.D. @DukeU, previously undergrad @USTCTesla-Bennett @Chleonbennett
1K Followers 2K Following Tesla, playing live music, investing in Tesla, boating, getting Tesla updates, teaching, waiting for my Cyber truck, sports, friends and loved ones!!Todd Kueny — e/acc @techgazetteco
3K Followers 6K Following Empowering worlds where AI enriches lives, solves complex problems, and inspires continuous learning.W. Brian Byrd @BByrdFW
3K Followers 4K Following Husband to an amazing woman, father, entrepreneur, @UTAustin, physician, former Fort Worth councilman.Dmitry Alimov @dmitryalimov
6K Followers 7K Following Tech VC and entrepreneur. Curious. Investing and building in AI. Built companies in media and tech. Founder @frontiervc. Learned things @harvard, @stanfordArthi Suresh @callmearts
0 Followers 40 FollowingYi.S @yi_shen23
15 Followers 425 Following Soccer @shanghaishenhua @FCBarcelona; Basketball @sixers,@ShanghaiSharks; Cycling @vismaleaseabike; Complement @DukeMEMS.Albert Arnó @alarno
17K Followers 1K Following I used to be a MD, now ehealth man. Father of 2. Passionate about technology. Maybe geek. Genetically optimistic!!!!!! https://t.co/CQItYCt2rTParadise Retreats @ParadisRetreats
1K Followers 4K Following We are Santa Barbara based property management company. Our luxury properties are what make us the #1 vacation rental company in SB.Kyle @Kyle91205853
1 Followers 4 FollowingElgce @BenQingwei
66 Followers 219 Following Hey, everyone! I am a junior student of Tsinghua University & incoming Ph.D of MMLAB@CUHK. I am interested in Reinforcement Learning and Robotics.Chen Wang @chenwang_j
2K Followers 672 Following PhD student @StanfordSVL @StanfordAILab. Prev @NVIDIA @MIT_CSAIL. Robotics/ManipulationYves Mulkers @YvesMulkers
102K Followers 79K Following #Data strategist. Define & Design #Data #Strategy for Impact. Love Music and DJ-ing, Founder @7wDataElon Musk @rosel85965
75 Followers 829 FollowingYuanliang Ju @AveryJuuu0213
232 Followers 241 Following 鞠沅良🫠|RA@IIIS, Tsinghua University🔮|Advised by Prof. Li Yi🏆|Interested in 3D Vision,HCI🧩|ᠪᠢ ᠬᠣᠷᠢᠭᠯᠠᠬᠤ ᠶᠢᠨ ᠠᠷᠭᠠ ᠦᠭᠡᠢ᠃🎨AI Recapped @AiRecapped
85 Followers 833 Following The latest rumours, news and random thoughts about artificial intelligence.Yuchen Zeng @yzeng58
562 Followers 549 Following PhD student in Computer Science at University of Wisconsin-Madison | Large Language ModelsYiping Wang @ypwang61
132 Followers 433 Following Ph.D. @uwcse. undergraduate @ZJU_China. I'm interested in mathematics, agi, and physics.Zhengran Ji @Zhengran_Ji
26 Followers 101 Following Master's student of computer science at Duke University General Robotics Lab. Interested in Reinforcement Learning.Abolfazl Karimi @KarimiAbolfazl
140 Followers 3K FollowingMatthew T. Flavin @MattTFlavin
1K Followers 2K Following Postdoctoral fellow at Northwestern | @MIT PhD ‘21, EECS | Neural mechatronics and mixed reality for healthcare: https://t.co/N6mKjIN5uxFanchao Chen @FanchaoChen
220 Followers 1K Following PhD Student @WisconsinCS | Prev. @FudanUni, @NTUsg, @ucbrise, @MSFTResearch, Moonshot AI | Machine Learning SystemsYuanchen_Ju @ju_yuanchen
257 Followers 216 Following 鞠沅辰🧸|Currently RA @Tsinghua_IIIS 🌍|Advised by Prof. Huazhe Xu🍯|Interested in Multimodal learning🌵 & Robot Learning🤖️|INTJ ♒️Cornell BME @CornellBME
4K Followers 568 Following Catalyzing interactions between biologists, physical scientists, and engineers to benefit medicine and human health. #CornellBMEKarush Suri @karush_
392 Followers 629 Following Meta Learner @Theteamatx in @GoogleAI swimmer & comic book collector Past @borealisai @eceuoftAdina Yakup @AdeenaY8
2K Followers 454 Following @huggingface 🤗 | Contributing to Chinese ML community.HaoyueBai @haoyue_bai
932 Followers 838 Following Ph.D. student at Computer Science Department @UWMadisonCS, MPhil @HKUSTCSE.Ankur Parikh @ank_parikh
3K Followers 3K Following Staff Research Scientist at Google DeepMind. Former adjunct assistant prof at @NYU_Courant. PhD at @mldcmu. ML for Bio/Chem (Prev. NLP). All opinions my own.Freda Duan @FredaDuan
11K Followers 318 Following Investing @ Altimeter Capital. No investment adviceJoshua Elkington @elkingtonxy
25K Followers 3K Following Founder and General Partner at Axial @axialxyzHan Xue @HanXue012
120 Followers 488 Following Ph.D. student @sjtu1896 | Interesed in Robotics & 3D VisionZhengyuan Jiang @Zhengyuan22
302 Followers 229 Following Trustworthy AI Ph.D. @DukeU, previously undergrad @USTCDmitry Alimov @dmitryalimov
6K Followers 7K Following Tech VC and entrepreneur. Curious. Investing and building in AI. Built companies in media and tech. Founder @frontiervc. Learned things @harvard, @stanfordYi.S @yi_shen23
15 Followers 425 Following Soccer @shanghaishenhua @FCBarcelona; Basketball @sixers,@ShanghaiSharks; Cycling @vismaleaseabike; Complement @DukeMEMS.Albert Arnó @alarno
17K Followers 1K Following I used to be a MD, now ehealth man. Father of 2. Passionate about technology. Maybe geek. Genetically optimistic!!!!!! https://t.co/CQItYCt2rTMatthew T. Flavin @MattTFlavin
1K Followers 2K Following Postdoctoral fellow at Northwestern | @MIT PhD ‘21, EECS | Neural mechatronics and mixed reality for healthcare: https://t.co/N6mKjIN5uxElgce @BenQingwei
66 Followers 219 Following Hey, everyone! I am a junior student of Tsinghua University & incoming Ph.D of MMLAB@CUHK. I am interested in Reinforcement Learning and Robotics.Zhengran Ji @Zhengran_Ji
26 Followers 101 Following Master's student of computer science at Duke University General Robotics Lab. Interested in Reinforcement Learning.Chen Wang @chenwang_j
2K Followers 672 Following PhD student @StanfordSVL @StanfordAILab. Prev @NVIDIA @MIT_CSAIL. Robotics/ManipulationYves Mulkers @YvesMulkers
102K Followers 79K Following #Data strategist. Define & Design #Data #Strategy for Impact. Love Music and DJ-ing, Founder @7wDataYuanliang Ju @AveryJuuu0213
232 Followers 241 Following 鞠沅良🫠|RA@IIIS, Tsinghua University🔮|Advised by Prof. Li Yi🏆|Interested in 3D Vision,HCI🧩|ᠪᠢ ᠬᠣᠷᠢᠭᠯᠠᠬᠤ ᠶᠢᠨ ᠠᠷᠭᠠ ᠦᠭᠡᠢ᠃🎨Pavel Izmailov @Pavel_Izmailov
6K Followers 1K Following Incoming Assistant Professor @nyuniversity 🏙️ Previously @OpenAI #StopWar 🇺🇦Yiping Wang @ypwang61
132 Followers 433 Following Ph.D. @uwcse. undergraduate @ZJU_China. I'm interested in mathematics, agi, and physics.Yuanchen_Ju @ju_yuanchen
257 Followers 216 Following 鞠沅辰🧸|Currently RA @Tsinghua_IIIS 🌍|Advised by Prof. Huazhe Xu🍯|Interested in Multimodal learning🌵 & Robot Learning🤖️|INTJ ♒️Fanchao Chen @FanchaoChen
220 Followers 1K Following PhD Student @WisconsinCS | Prev. @FudanUni, @NTUsg, @ucbrise, @MSFTResearch, Moonshot AI | Machine Learning SystemsGu Zhang @Gu__Zhang
129 Followers 139 Following Incoming CS PhD @Tsinghua_IIIS | Advised by Prof. Huazhe Xu | Prev. Student Researcher @MIT @SJTU1896 | Research interest focus on robot manipulationEthan Wenjun Hou @houwenjun060
47 Followers 380 Following 🪫Phd Student @HongKongPolyU & @SUSTechSZ | Natural Language Processing & Medical Report Generation & Healthcare AgentsJohnSnowLabs @JohnSnowLabs
41K Followers 30K Following Helping healthcare and life science organizations put AI to work faster with state-of-the-art LLM & NLP.Shoubin Yu @shoubin621
338 Followers 494 Following Ph.D. Student at @unccs @uncnlp, advised by @mohitban47. Previously @sjtu1896. Interested in video understanding, video-language.Thomas Weng @thomas_weng
647 Followers 368 Following PhD Candidate @CMU_Robotics 🤖 | Visiting Researcher @MetaAIYunsheng Ma @yunshengmax
95 Followers 228 Following PhD Student @LifeAtPurdue | MSCS 22 @NYU_Courant | Working on #LLM, #VLM, #AutonomousDrivingColin Raffel @colinraffel
30K Followers 654 Following nonbayesian parameterics, sweet lessons, and random birds. Friend of @srush_nlpXiang Li @KirschVen
33 Followers 158 Following Postdoc at Upenn. Obtain PhD. and B.S. from Peking University. Interests in machine learning and statistics.Ryan Chan @ryanchankh
265 Followers 997 Following Machine Learning PhD at UPenn. NSF GRFP fellow. Interested in the theory and practice of interpretable machine learning.Tzu-Heng Huang @zihengh1
153 Followers 666 Following CS Ph.D. Student @WisconsinCS @UWMadison. Focusing on foundation models and data-centric AI.Kimia Hamidieh @kimiahmdh
136 Followers 128 Following PhD student at @MIT_CSAIL, previously @UofT/@VectorInstShuo Wang @ShuoWang_NLP
31 Followers 219 FollowingRichard Antonello @RichardAntone13
232 Followers 185 Following PhD student in the @HuthLab at UT Austin. Studying how the brain actually understands language by using machines that pretend to understand language.∬ Nazif Berat @nazifberat
3K Followers 5K Following a Humanizer of Industry and Design-Minded Tech LeaderJin Huang @JinHuang9306000
32 Followers 148 Following CS undergrad at University of Michigan, Ann ArborFei Wang @fwang_nlp
915 Followers 2K Following PhD candidate @USC. PhD Fellow @Amazon. Responsible LLM.Ahmad Beirami @abeirami
4K Followers 2K Following Building safe, helpful, and scalable generative AI @Google | ex-{@AIatMeta, @EA, @MIT, @Harvard, @DukeU} | @GeorgiaTech PhD | زن زندگی آزادی | opinions my ownZhangchen Xu @zhangchen_xu
96 Followers 112 Following UW PhD Student|Distributed Systems & Federated Learning & LLM Security | Looking for Summer Internships 🥲Kaixuan Huang @KaixuanHuang1
282 Followers 549 Following Intern @ GoogleDeepMind; PhD @ Princeton University. working on generative AIsKristjan Greenewald @KGreenewald
208 Followers 122 Following AI Research Scientist and PI @MITIBMLab, @IBMResearch. Applying statistics and information theory to generative AI. PhD @umich, postdoc @harvard.Ziqing Xu @ZiqingXu97
43 Followers 63 Following Hi, I’m a PhD student in Applied Mathematics and Statistics at Johns Hopkins University. My research interest is deep learning theory.Jeonghwan Kim @MasterJeongK
555 Followers 593 Following PhD student @IllinoisCS @UIUC_NLP | Previously @kaistpr, @HandongUniv | @Amazon for Summer 2024Rohan Pandey @rohan99pandey
64 Followers 143 Following CS PhD Student @UMass Amherst | Trying to make AI robust for Healthcare | Prev. @mckinsey @hiti_lab @ShivNadarUnivPrateek Yadav @prateeky2806
2K Followers 2K Following Ph.D. at @unccs Continual Model Adaptation and Composition Previously @MSFTResearch, @AmazonScience, @iitmadras. UG @iiscbangalore. Opinions are my own.Yong-Hyun Park @hagsaeng_bag
120 Followers 501 Following Love to utilize geometric insights to deepen our understanding of neural networks. Currently a Master's student @SNU and a research intern @official_naverNico Daheim @ndaheim_
204 Followers 391 Following @ELLISforEurope PhD student in NLP and ML at @UKPLab @TUDarmstadt and @ETH_en. Previously MSc. in Data Science @RWTH.Super cool work about the impact of #LLM in interpersonal communication and even potentially personality! Not too surprising that this paper won the Best Paper Award #CHI2024 🏆 Congrats @ChrisYueFu, Sami, and Alexis! Always proud of UW folks!
Can Al affect the way you communicate and potentially influence your personality? Please read our 2024 CHl 🏆 Best Paper Award paper: "From Text to Self. Users' Perceptions of Potential of AlMC on Interpersonal Communication and Self." Thanks to all the co-authors!#CHI2024
Hey Chinese friends, help? How do I reach out to your community? I know Weibo is a thing, and writing there in English is useless? Where else this huge community communicates? Ideas?
More work coming up & we are hiring: openai.com/careers/search…
Introducing the Instruction Hierarchy, our latest safety research to advance robustness for prompt injections and other ways of tricking LLMs into executing unsafe actions. More details: arxiv.org/abs/2404.13208
@roydanroy And acknowledge the students involved as generously as possible
I'm as excited as you are about your {lab, company, school}'s research, but perhaps rather than just hype, you can display a bit of scientific humility and tell me also what the challenges, gaps are.
@myamada0 Precisely! That's also why I always tell my students to try different things for their internships.
Want to play with attribution? What neuron and attention head attends to what? How does it change in different models and sentences? Go play: huggingface.co/spaces/faceboo… Kudo @AIatMeta Igor Tufanov @mahnerak @javifer_96 @lena_voita
It all began in a discussion of C. Zhang, S. Bengio, @mrtz, @beenwrekt, @OriolVinyalsML Fascinating work. arxiv.org/abs/1611.03530 More about their work x.com/ericjang11/sta…
Why is the paper “Understanding Deep Learning Requires Rethinking Generalization” important? by Eric Jang quora.com/Why-is-the-pap…
About generalization of different networks Main finding: Generalization in pretraining follows a single dimension Different networks, architectures, seeds, sizes but: Similar performance → similar linguistic capabilities @aclmeeting accepted (#NLProc) Summary & story 🧵
I got to spend a fun afternoon at Stanford today. Thanks @EmmaBrunskill for the invitation to join your group meeting and get to hang out!
DBRX, Mixtral 8x22B, Llama-3 all released within weeks. The open-source AI scene shows no sign of slowing down. 🔥
Excited to share our latest publication in @NatureMedicine! 🎉 Proud to have been part of this incredible team effort. We study the disparity and fairness in AI models for computational pathology, exploring a variety of modeling strategies. Check it out below! 👇
⚡️🔬📣Excited to share our new @NatureMedicine article, examining disparities in pathology AI models, assessing how modeling choices impact disparities, and evaluating the potential of self-supervised foundation models in mitigating these disparities. nature.com/articles/s4159… See…
🦎Can we teach Transformers to perform in-context Evolutionary Optimization? Surely! We propose Evolutionary Algorithm Distillation for pre-training Transformers to mimic teachers 🧑🏫 🎉 Work done @GoogleDeepMind 🗼with @alanyttian & @yujin_tang 🤗 📜: arxiv.org/abs/2403.02985
We do have interesting findings in the work w/ @yujin_tang @RobertTLange . Check it out :-D
🦎Can we teach Transformers to perform in-context Evolutionary Optimization? Surely! We propose Evolutionary Algorithm Distillation for pre-training Transformers to mimic teachers 🧑🏫 🎉 Work done @GoogleDeepMind 🗼with @alanyttian & @yujin_tang 🤗 📜: arxiv.org/abs/2403.02985
The technical report (arxiv.org/abs/2404.07413) for JetMoE is out! It includes a detailed description of our data mixture. The datasets are available on HF. Together with my modified version of Megablock (github.com/yikangshen/meg…). With a little effort, everyone should be able to…
🎨Spent some time refactoring the 2021 post on diffusion model with new content: lilianweng.github.io/posts/2021-07-… ⬇️ ⬇️ ⬇️ 🎬Then another short piece on diffusion video models: lilianweng.github.io/posts/2024-04-… (Yes, I had an intensive weekend🥹)
Hot take: I’m seeing folks at wealthy companies/schools wringing their hands over a goofy high school contest. Maybe instead, directing your time, energy, thought, and ample resources toward supporting programs in education, outreach, and inclusion you’re more passionate about.🤷
Driving down the same NYC highway 46 years later
sparsity + higher ranks for higher layers arxiv.org/abs/2401.11316 Claims Sota on GLUE