Xia “Ben” Hu @huxia
Associate Professor of CS@Rice working on AutoML, XAI and Network Analytics. Author of AutoKeras and NCF. cs.rice.edu/~xh37/index.ht… Houston, TX Joined September 2009-
Tweets91
-
Followers592
-
Following325
-
Likes209
📢Postdoc Position📢 Dr. Xia Hu @huxia and I are looking for a Chairman's Postdoctoral Fellow in Efficient and Trustworthy LLMs @RiceCompSci If you are interested, please do not hesitate to apply: docs.google.com/document/d/1Ks…
Without fine-tuning, Self-Extend has significantly improved performance of the Gemma-2b-it performance in the needle in the haystack task, increasing its capability from less than 8k (pretraining window) to over 90k!! lnkd.in/gD_DqJvi
Without fine-tuning, Self-Extend has significantly improved performance of the Gemma-2b-it performance in the needle in the haystack task, increasing its capability from less than 8k (pretraining window) to over 90k!! lnkd.in/gD_DqJvi
Excited to introduce KIVI🥝, the first 2bit KV cache quantization breakthrough! 🚀 KIVI can be directly integrated into existing LLMs without any tuning. 📄 Paper: arxiv.org/abs/2402.02750 💻 Code: github.com/jy-yuan/KIVI #KIVI #LLM #AI #MachineLearning
Welcome more people to trying Self-Extend/LongLM on more diverse tasks and models to see when it works and identify its limitations. Here is our implementation github.com/datamllab/Long… . Credits to @serendip410 !
Welcome more people to trying Self-Extend/LongLM on more diverse tasks and models to see when it works and identify its limitations. Here is our implementation github.com/datamllab/Long… . Credits to @serendip410 !
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning With only four lines of code modification, the proposed method can effortlessly extend existing LLMs’ context window without any fine-tuning. arxiv.org/abs/2401.01325
Here you go: Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond arxiv.org/abs/2304.13712
Here you go: Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond arxiv.org/abs/2304.13712
Should we use LLMs or fine-tuned models for downstream tasks? If you are interested in this question, please take a look: Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond arxiv.org/abs/2304.13712
The success of LLMs relies on vast amounts of high-quality data. We've analyzed from a data-centric AI perspective. Stay tuned! @TDataScience towardsdatascience.com/what-are-the-d…
🤝 Companies and organizations seeking research solutions to their data science and artificial intelligence challenges can partner with the D2K Lab at @RiceUniversity to sponsor a capstone team. Learn about the D2K Capstone at bit.ly/3laOXHW
Curious about what "data-centric AI" is? Our comprehensive survey delves into the increasingly important role of data in building AI systems, including the recent waves of LLMs. Check it out! arxiv.org/abs/2303.10158 #AI #MachineLearning #LLMs
The paper submission deadline is in 5 days!⏳Check out the CFP and submit at chilconference.org by Feb 15th, 11:59pm ET
We recently wrote an article discussing the following questions: 1) Whether LLM-generated texts could be detected? 2) How to detect? 3) Opportunities and concerns moving forward. Please take a look at the Medium article, the paper will be out next week. medium.com/@rxtang/the-sc…
Check it out if you are interested in outlier detection lnkd.in/eAbz_ng
Check it out if you are interested in outlier detection lnkd.in/eAbz_ng
A recent work on Trojan Attack on Deep Nets, certainly I like the title :) Paper: arxiv.org/pdf/2006.08131… Codes: github.com/trx14/TrojanNet
After much consideration, the General Chairs, Executive Committee and Organizing Committee for KDD 2020 have decided to take the conference fully virtual. The events of the past few months and the continued safety concerns have led us to make this difficult decision.
Excellent data on COVID-19 just published in Nature: nature.com/articles/s4159…
Wei Jin @weisshelter
2K Followers 930 Following Assistant Professor @EmoryUniversity | #DataMining #MachineLearning #GraphNeuralNetwork | Previously @dse_msu @amazon @snapJundong Li @LiJundong
1K Followers 454 Following Assistant Professor of ECE, CS, and DS at University of Virginia; AI, Machine Learning, and Data ScienceJing Ma (at AAAI24) @JingMa77838617
764 Followers 428 Following Assistant Professor at CWRU @CaseEngineer PhD @CS_UVA| Research intern at MSR @MSFTResearch Causal inference, graph mining, trustworthy AIMeng Jiang @Meng_CS
1K Followers 488 Following Associate Professor with Tenure at Notre Dame CSE | Data Mining | Natural Language ProcessingLuay Nakhleh @NakhlehRice
969 Followers 245 Following William and Stephanie Sick Dean of Engineering at Rice UniversityShuiwang Ji @ShuiwangJi
3K Followers 3K Following Machine Learning, AI for Science, Professor and Presidential Impact Fellow, Texas A&M University Fellow, IEEE and AIMBEJiliang Tang @tangjiliang
2K Followers 918 Following University Foundation Professor at MSU. Deep Learning on Graphs book (https://t.co/1JyC3k5c0H…).Kai Shu @KaiShu0327
751 Followers 446 Following Assistant Professor @IITComputing @illinoistech; Ph.D. of CS in @ASUEngineering. Data science, AI, disinformation; Formerly @MSFTResearch, @YahooResearch.Rice Engineering @RiceEngineering
4K Followers 686 Following The George R. Brown School of Engineering at Rice University offers unparalleled engineering education grounded in social responsibility.Rice Computer Science @RiceCompSci
2K Followers 927 Following Excellence in computer science since 1984. The largest department at Rice University.DK Xu @DongkuanXu
2K Followers 2K Following Assistant Professor @NCState. Co-Founder @GentopiaAI. Artificial General Intelligence. Ex- @MSFTResearch, @ https://t.co/JuUn6gRp78, @NECLabsAmerica. Big Fan of @NFL.Vagelis Papalexakis @vagelispapalex
2K Followers 2K Following Computer Scientist working on #datascience #machinelearning #tensors Associate Professor @UCR_CSE, PhD @ScSatCMU,summer internships @MSFTResearch and @GoogleHanjie Chen @hanjie_chen
2K Followers 365 Following Incoming Assistant Professor @RiceCompSci, Postdoc @jhuclsp, working on Trustworthy AI/NLP/ML, PhD @CS_UVA, former intern @allen_ai, @MSFTResearch, @IBMDegui Zhi @zhizhid
926 Followers 390 Following Professor Biomedical informatics. MedAI (Co-creator of Med-BERT), AI imaging genetics (https://t.co/EIdKENaGOD), and popgen (Identity-by-Descent geek).'YZ' Yezhou Yang (杨.. @prof_yz
1K Followers 422 Following #ProfYZ with @APGASU RTesearching @SCAI_ASU Co-founder https://t.co/9E0eTskwB9 PhD @umdcs BE @ZJU_China Promoting #UBAA #ARA #ARFAL @ https://t.co/XMFpunFQONLu Cheng @luchengSRAI
490 Followers 263 Following Assistant Professor @UICCS. Responsible and Reliable AI, causal machine learning, AI for social good. Previously @ASU DMML and @IBMResearchElio (Keqiang) Yan @KeqiangY
1K Followers 874 Following CS Ph.D. student @TAMU. D.E. Shaw Research Doctoral Fellow. AI&LLMs for materials, molecules, and proteins. Ex @MSFTResearch. Previously @PKU1898.William Wang @WilliamWangNLP
14K Followers 719 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Yanqiao ZHU @Zhu_Yanqiao
802 Followers 621 Following CS PhD at @UCLA, @NSF_CCAS. Organizer of @logconference. Graph & geometric representation learning / AI for ScienceNancy @JeanetteHa59972
0 Followers 175 FollowingMohan Krishna Sunkara @mk344567
440 Followers 2K Following Phd Candidate in @ODUcs at @ODU | HCI/AI Researcher | Works at @WebSciDL, @accessodu Research Lab | Worked at @HP R&D, @ProcterGamble | MS,BS from @MAHE_ManipalTatho @Tathox61q
0 Followers 124 Followingisynch @funnynoise
10 Followers 78 FollowingMingyu_Jin19 @fnruji316625
4 Followers 21 Following Phd student @RutgersU|Undergrad at @LivUni|Data Mining,LLM, XAI, NLPJaywon Koo @JaywonK17250
26 Followers 69 Following CS PhD Student @VisLang @RiceUniversity | Prev. MS CS @ColumbiaUniversityGenerative AI @generativeaihub
7K Followers 6K Following Inspired by Algorithms, Powered by Imagination: Unleashing the Potential of Generative AI. #GenerativeAI #deeplearning #AI #MachineLearningFSM @fsm_top
8 Followers 119 FollowingXvQ @XvQ51845748
0 Followers 14 FollowingZhengping JIANG @zhengping_jiang
51 Followers 397 Following PhD Student in Natural Language Processing at JHU-CLSPHowieHwong @HowieH36226
4 Followers 81 FollowingAbhas Kumar Shrivasta.. @abhas_rewcie
2K Followers 4K Following 23. I used to chase the Avatar, but now I'm chasing a life of purpose and honor. #Zuko #Avatar #TheLastAirbender (Capt. Kane Williamson Fan Account)Pedro🇺🇦 @PeterW2014
5 Followers 696 FollowingCollaborativeDynamics.. @CoDynamicsAI
22 Followers 853 Following Boost all aspects of your business with our bespoke B2B AI solutions in prompt engineering, personas and automation. #AI #Automation #GenerativeAI🚀Arid Hasan @_aridhasan
12 Followers 60 Following Actively Looking for PhD opportunities| GRA & GTA, University of New Brunswick. RI: LLMs, Disinformation, NLProc, CV, and Information Retrieval.Shruti Singh ⇾ @shr.. @shruti_rsingh
225 Followers 2K Following Representation Learning for Scientific Literature | #NLProc | Fulbright fellow @yale | CS Ph.D. Student @iitgn | Past @daiictofficialJack Wang @_zichaowang
53 Followers 169 Following Research Scientist @AdobeResearch. AI for human learning, productivity, and creativity. Prev. Ph.D. student @RiceECE. ex-@GoogleAI, @NVIDIA, @MSFTResearchYihua Zhang @zyh2022
245 Followers 239 Following Ph.D. Student at Michigan State University. Robustness, Scalable and Trustworthy AI. Previous applied scientist intern @Amazon.Huiwen Xu, PhD @Dr_HuiwenXu
1K Followers 3K Following 徐会文, Health Services Researcher, Assist Prof @UTMB_SPPH | Pepper Scholar @UTMB_SCOA. PhD @UofR. Funded by @NIHAging. @CHPAMS #Aging #LTC #ADRD #AI Twitter=MinesFei Wang @fwang_nlp
921 Followers 2K Following PhD candidate @USC. PhD Fellow @Amazon. Responsible LLM.Yupeng Zhang @YupengZhang7
1K Followers 271 Following Assistant Professor @ECEILLINOIS University of Illinois Urbana-ChampaignGautam Machiraju 🌺 @gmachiraju
644 Followers 4K Following PhD-ing @StanfordAILab w/ @ParagMallick @HazyResearch🌲 AI-driven data copilots for scientific discovery♟️🧬🔬🛰🔭 Powered by prog house, people, 3rd places 🪩✨Yuke Wang @YukeWang1
807 Followers 1K Following On Academic Job Market | CS Ph.D. at UCSB | Deep Learning System | ex-Microsoft Research | ex-NVIDIA Research | NVIDIA Graduate Fellowship’22.HaoyueBai @haoyue_bai
941 Followers 839 Following Ph.D. student at Computer Science Department @UWMadisonCS, MPhil @HKUSTCSE.Kaiqiao Han @kaiqiao19017
0 Followers 26 FollowingKazuko Minamoto @KM_Minamoto
380 Followers 2K Following Washington State. lawyer. Diving and golf enthusiastsAshutosh Mehra @ashutoshmehra
1K Followers 5K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.hayate nogotoku @hayaku652
1 Followers 58 Followingcychang @cychang9
7 Followers 49 FollowingAndrew white @Andreww95636515
131 Followers 3K Following 3d modeling. Gaussian splatting, NeRF, Diffusion models, GANs.Zhenting Wang @wang1999_zt
82 Followers 234 Following PhD Student @RutgersCS. Trustworthy and Responsible Generative Artificial Intelligence. Intern @SonyAI_global (current) @Meta GenAI (incoming)JadenZhong @JadenZhong27
0 Followers 54 FollowingChipmunk @AlvinMesser
87 Followers 843 FollowingNinghao Liu @ninghao0
27 Followers 104 FollowingQuyen Tran @tranquyenbk173
24 Followers 249 Following AI Research Resident at @VinAI_Research (Looking for a PhD position in Fall2025) Interested in Continual Learning, OOD detection and Generative modelsYann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Jundong Li @LiJundong
1K Followers 454 Following Assistant Professor of ECE, CS, and DS at University of Virginia; AI, Machine Learning, and Data ScienceMeng Jiang @Meng_CS
1K Followers 488 Following Associate Professor with Tenure at Notre Dame CSE | Data Mining | Natural Language ProcessingLuay Nakhleh @NakhlehRice
969 Followers 245 Following William and Stephanie Sick Dean of Engineering at Rice UniversityShuiwang Ji @ShuiwangJi
3K Followers 3K Following Machine Learning, AI for Science, Professor and Presidential Impact Fellow, Texas A&M University Fellow, IEEE and AIMBEFei Wang @feiwang03
2K Followers 283 Following Health Data Scientist @Cornell; Founding Director, @WCM_AIDH; FACMI, FIAHSI, FAMIA;Jiliang Tang @tangjiliang
2K Followers 918 Following University Foundation Professor at MSU. Deep Learning on Graphs book (https://t.co/1JyC3k5c0H…).Jim Fan @DrJimFan
230K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Kai Shu @KaiShu0327
751 Followers 446 Following Assistant Professor @IITComputing @illinoistech; Ph.D. of CS in @ASUEngineering. Data science, AI, disinformation; Formerly @MSFTResearch, @YahooResearch.Rice Engineering @RiceEngineering
4K Followers 686 Following The George R. Brown School of Engineering at Rice University offers unparalleled engineering education grounded in social responsibility.Rice Computer Science @RiceCompSci
2K Followers 927 Following Excellence in computer science since 1984. The largest department at Rice University.DK Xu @DongkuanXu
2K Followers 2K Following Assistant Professor @NCState. Co-Founder @GentopiaAI. Artificial General Intelligence. Ex- @MSFTResearch, @ https://t.co/JuUn6gRp78, @NECLabsAmerica. Big Fan of @NFL.Vagelis Papalexakis @vagelispapalex
2K Followers 2K Following Computer Scientist working on #datascience #machinelearning #tensors Associate Professor @UCR_CSE, PhD @ScSatCMU,summer internships @MSFTResearch and @GoogleHanjie Chen @hanjie_chen
2K Followers 365 Following Incoming Assistant Professor @RiceCompSci, Postdoc @jhuclsp, working on Trustworthy AI/NLP/ML, PhD @CS_UVA, former intern @allen_ai, @MSFTResearch, @IBMGoogle DeepMind @GoogleDeepMind
944K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.Degui Zhi @zhizhid
926 Followers 390 Following Professor Biomedical informatics. MedAI (Co-creator of Med-BERT), AI imaging genetics (https://t.co/EIdKENaGOD), and popgen (Identity-by-Descent geek).AI at Meta @AIatMeta
533K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.'YZ' Yezhou Yang (杨.. @prof_yz
1K Followers 422 Following #ProfYZ with @APGASU RTesearching @SCAI_ASU Co-founder https://t.co/9E0eTskwB9 PhD @umdcs BE @ZJU_China Promoting #UBAA #ARA #ARFAL @ https://t.co/XMFpunFQONLu Cheng @luchengSRAI
490 Followers 263 Following Assistant Professor @UICCS. Responsible and Reliable AI, causal machine learning, AI for social good. Previously @ASU DMML and @IBMResearchYuke Wang @YukeWang1
807 Followers 1K Following On Academic Job Market | CS Ph.D. at UCSB | Deep Learning System | ex-Microsoft Research | ex-NVIDIA Research | NVIDIA Graduate Fellowship’22.Shizhe Diao @shizhediao
1K Followers 928 Following On job market actively seeking industry positions ML NLP PhD | Intern @BytedanceTalk @sinovationvc Finetune your own LLMs with LMFlow: https://t.co/UTykmQAYPTFeng Xia @fxia61
266 Followers 555 Following Artificial Intelligence, Graph Learning, Brain Science, Digital Health, and Robotics. Professor @RMIT. Fan of #films & #cars.Hua Wei @realhuawei
934 Followers 601 Following Assistant Professor @SCAI_ASU, Penn Stater, Intelligent decision making, reinforcement learning, and urban computing. He/him/his. [email protected]Huaxiu Yao @HuaxiuYaoML
3K Followers 527 Following Assistant Professor of Computer Science @UNC @unccs @uncsdss | Postdoc @StanfordAILab | Ph.D. @PennState | #foundationmodels, #AISafety, #AIforScience | he/himJie Huang @jefffhj
4K Followers 569 Following Ph.D. Candidate at UIUC🌽; Formerly @GoogleDeepmind @NVIDIAAI @AmazonScience. #NLProc Large Language ModelsSong Han @songhan_mit
6K Followers 144 Following Assoc. Prof. @MIT, Distinguished Scientist @NVIDIA, cofounder of DeePhi (now part of AMD) and OmniML (now part of NVIDIA). PhD @Stanford. Efficient AI computingKen aka Frosty 🔜 D.. @KenAKAFrosty
2K Followers 1K Following 💻 Web developer, 🔬 applied deep learning & AI researcher. 🔨Building cool stuff for streamers.Kun Efimov-Zhang 張�.. @GuihuZhang
113 Followers 649 Following Doctoral Student at Inria Saclay-Île-de-France and École PolytechniqueChao Jiang @chaojiang06
360 Followers 911 Following Ph.D. student in Computer Science @mlatgt @GeorgiaTech, and @OhioState @UVA alumnusZhaoyang Wang @wangwan83764204
323 Followers 4K Following CS PhD student at Uni of Birmingham in the United Kingdom. Research interests: Automated Machine Learning, Online Learning, and Reinforcement Learning 🏳️🌈Luciano da F. Costa @LdaFCosta
7K Followers 7K Following Connected to complex systems, networks, image/shape analysis, pattern recognition, systems biology, music, and electronics. Full Prof of physics.Alison O. Gaby @AlisonOGaby
429 Followers 3K Following Founder @BalzardA164 AI for Museums | Co-Founder & Lead Data Scientist 👩🏽💻 @CareCovr | Mom 👧🏻👶🏻 | @WilliamsCollege 💜🐮💛 | 🏔 New Englander in📍NYCXinyu Xing @xingxinyu
958 Followers 1K Following Associate Professor@Northwestern University. Many Ph.D./internship/visiting scholar openings in software/system security. DM/email me.Xin Eric Wang @xwang_lk
7K Followers 1K Following Multimodal and Embodied AI Researcher / Professor @UCSC. Director of https://t.co/Y4swOBag21. AI for Humanity in the long run. he/himHongyeJ @serendip410
227 Followers 165 FollowingFurong Huang @furongh
4K Followers 2K Following Assistant professor of @umdcs @umiacs @ml_umd at UMD. Researcher in #AI/#ML, #Trustworthy AI/ML, #EthicalAI, AI #Democratization, AI for ALL.Chen Luo @rackingroll
322 Followers 505 Following Sr. Scientist at @Amazon Search, CS Ph.D from @RiceUniversity. ML, IR, NLP. Love traveling, adventuring, and having fun. Warriors Fan.Yisong Miao @YisongMiao
585 Followers 1K Following 4th Year PhD Student at @wing_nus @nuscomputing. Studying discourse and emojis with a focus on interpretability of LMs . @Charles_Leclerc will win WDC.Jingfeng Yang @JingfengY
2K Followers 624 Following Applied Scientist @AmazonScience #LLMs #NLProc Formerly @SALT_NLP @Georgia_Tech @PKU1898 @Google @MSFTResearch . Opinions are my own.Anshumali @Anshumali_
858 Followers 227 Following CS Professor, Rice University. Founder and CEO: ThirdAI (https://t.co/SQgXXs29ct) #BigData, #machinelearning, #deeplearning, #AI, #hashingHaoran Li @lihr04
17 Followers 76 Following Rice cs phd student; Continual learning theory, dl + stat + optRuozhen (Catherine) H.. @cathyrzhe
21 Followers 261 Following CS PhD Student @vislang @RiceUniversity | Prev. BSc @CityUHongKong | #computervisionDaochen Zha @zdcfrank
533 Followers 635 Following MLE @Airbnb | CS Ph.D. @RiceUniversity | Former intern @Meta | AI | ML | Reinforcement learningYifei Sun @YifeiSu79476650
55 Followers 211 Following Ph.D. student @ ZJU | NUS. Graph Machine Learning.Chao Fan @Chao_Fan__
391 Followers 848 Following Assist Professor of Climate Change Adaptation, AI & Digital Twins; Editorial Board @HSScomms @NaturePortfolio; Previous @ucdaviscee @TAMUCVENChao Zhang @chaozhangcs
467 Followers 393 Following Assistant Professor @ Georgia Tech CSE LLM, Uncertainty, AI for scienceZhimeng Jiang @ZhimengJ
392 Followers 1K Following Staff Research Scientist@Visa Research | CS Ph.D. @tamu| Formerly, @Amazon & @Visa & @Samsung | Trustworthy ML & Graph Neural Network | Opinions are my ownTengyu Ma @tengyuma
26K Followers 512 Following Assistant professor at Stanford; Co-founder of Voyage AI (https://t.co/wpIITHLgF0) ; Working on ML, DL, RL, LLMs, and their theory.David Leebron @davidleebron
9K Followers 181 Following President Emeritus of Rice University and former dean of Columbia Law School. President-designate of Texas 2036. Opinions and comments my own.OEDK-Rice University @RICE_OEDK
1K Followers 231 Following The Oshman Engineering Design Kitchen (OEDK) is Rice's multidisciplinary innovation design lab used by engineers from their freshman through senior years.Rice D2K | Data to Kn.. @Rice_D2KLab
486 Followers 353 Following Innovative hub for data science education @RiceUniversity. Training exceptional students. Making an impact through #DataScience!Shiqian Ma @ShiqianMa
1K Followers 1K Following associate professor@Rice University. PhD from Columbia IEOR. Work on optimization and machine learning.Lisa Biswal @biswal_lisa
435 Followers 91 Following Professor, Soft Matter, Chemical Engineer, Rheology, Foams and Emulsion.César A. Uribe @CesarAUribe
1K Followers 728 Following @RiceECE 🦉. 🇨🇴 Sometimes control theorist, sometimes optimizer, and sometimes ML or data scientist. 🏐 setter. Zizek groupie.K. Ramesh @GAAPRamesh
130 Followers 302 Following From a family of accountants! Prof. at @RiceUniversity. research capital market info environment. teach corporate governance and regulation. Hobby mridangam.Dr. Marcie O'Malley @MarcieOMalley
1K Followers 457 Following robots. twin boys. the academy. @RiceUniversity @RiceEngineering @RiceMECH Prof of Mech Eng, ECE, BIOE, and CS; Chair, Dept of Mechanical Engineering📢Postdoc Position📢 Dr. Xia Hu @huxia and I are looking for a Chairman's Postdoctoral Fellow in Efficient and Trustworthy LLMs @RiceCompSci If you are interested, please do not hesitate to apply: docs.google.com/document/d/1Ks…
We updated #SmoothQuant to accelerate newer LLMs:
🚀 Updated our #SmoothQuant paper! Now showing it works on newer LLMs like Llama-2, Falcon, Mistral, & Mixtral with W8A8 quantization & negligible loss. Check our paper if you want to reduce LLM serving costs! 📄: arxiv.org/pdf/2211.10438…
Thanks for sharing! It's a great reminder that, despite the challenges, our work is gaining attention. Recently, we released FlashAttention support for SelfExtend (github.com/datamllab/Long…, credits to @qingquan_song!). We're also working diligently to implement triton-based…
Thanks @LangChainAI for giving a shout-out to our self-extend! arxiv.org/abs/2401.01325 Excellent work led by @serendip410
FlashAttention Support for Self-extend! Give it a try!
Thanks for sharing! It's a great reminder that, despite the challenges, our work is gaining attention. Recently, we released FlashAttention support for SelfExtend (github.com/datamllab/Long…, credits to @qingquan_song!). We're also working diligently to implement triton-based…
Thanks @LangChainAI for giving a shout-out to our self-extend! arxiv.org/abs/2401.01325 Excellent work led by @serendip410
RAG for long context LLMs: Video Will long context LLMs really kill RAG? This is a talk @RLanceMartin gave at a few recent meetups that pulls together threads from a few different projects related to this question. Multi-needle in a haystack shows limitations in long-context…
Despite the mixed feelings about Google's latest Gemma model, we're big fans! @GoogleAI Why? Coz we found it pairs incredibly well with our SelfExtend 🤣🤣🤣 - like, perfectly! With Self-Extend, no fine-tuning needed, we effortlessly expanded Gemma's window from 8k to 90k+! On…
Amazing results for Gemma model with Self-Extend!!!
Despite the mixed feelings about Google's latest Gemma model, we're big fans! @GoogleAI Why? Coz we found it pairs incredibly well with our SelfExtend 🤣🤣🤣 - like, perfectly! With Self-Extend, no fine-tuning needed, we effortlessly expanded Gemma's window from 8k to 90k+! On…
Excited to introduce KIVI🥝, the first 2bit KV cache quantization breakthrough! 🚀 KIVI can be directly integrated into existing LLMs without any tuning. 📄 Paper: arxiv.org/abs/2402.02750 💻 Code: github.com/jy-yuan/KIVI #KIVI #LLM #AI #MachineLearning
llama.cpp now supports Self-Extend basic fact extraction tests with ~8k context and base LLaMA 7B v2 (train context of 4096) and the context extension seems to work. github.com/ggerganov/llam…
With the recent release of #TinyLlama, SLMs have attracted a lot of attention. I re-released my previously trained SLM - LiteLlama under the MIT license, which has 460M parameters trained with 1T tokens. I hope to contribute a bit to the community. huggingface.co/ahxt/LiteLlama…
So cool - this is one of the simplest, clearest, cleverest papers I've seen in a while. In 4 lines of code, the authors extend context windows by remapping OOC token positions at inference. arxiv.org/abs/2401.01325 LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning With only four lines of code modification, the proposed method can effortlessly extend existing LLMs’ context window without any fine-tuning. arxiv.org/abs/2401.01325
We conducted 45,079 experiments in total to investigate the most representative fairness method (maybe not the most recent method). Check out our paper for details and findings. Joint work with @jianfengchi, hugo chen, Qifan Wang,@hanzhao_ml, Na Zou, @huxia.
After speeding through the faculty tenure process in only six years — while launching his successful startup @ThirdAILab — @Anshumali_ Shrivastava has been recognized by the deans of Rice University with the Charles W. Duncan Jr. Achievement Award for Outstanding Faculty.
Just landed Kigali🇷🇼. See you soon at #ICLR2023 in person.
Very proud that my PhD students win two major graduate research award at the ECE department award ceremony. Congratulate Yushun Dong for receiving the Louis T. Rader Graduate Research Award and Song Wang for receiving the Ann Lee Brown Rookie of the Year Award @uvaece
This is our original thread
Should we choose to use LLMs or smaller finetuned models in practical use cases? Take a look at our survey arxiv.org/abs/2304.13712 , which covers NLU tasks, Generation tasks, Knowledge-intensive tasks, abilities regarding scaling, some miscellaneous and real-world tasks.
We've updated the still version of the LLM evolutionary tree. Add more models and correct several typos. Go for a check! github.com/Mooler0410/LLM… Our survey: Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond arxiv.org/abs/2304.13712
@DrJimFan thanks. The authors are @JingfengY @serendip410 @RuixiangT Qizhang Feng Bing Yin @huxia .
@DrJimFan Welcome to check out paper as well 😃 arxiv.org/abs/2304.13712