chansung @algo_diver
@GoogleDevExpert for ML and @googlecloud | @huggingface Fellow | MLOps | Software Engineering | Open Source Lover Daejeon, Republic of Korea Joined August 2018-
Tweets4K
-
Followers4K
-
Following568
-
Likes10K
Simple PR to add FSDP+QLoRA support on @huggingface alignment-handbook github.com/huggingface/al…
Sayak's twitter was hacked and he is working on fixing it. Make sure you don't click the link on the tweet!
.@lmsysorg could you please update chatbot arena dataset? huggingface.co/datasets/lmsys…
Make sure JSON is included in LLM's response. Also, make sure if JSON has the expected structure. I often find these simple utilities are very helpful, so I made a simple library hllama. No dependencies. Just standard Python. github.com/deep-diver/hll…
Bojan Tunguz @tunguz
187K Followers 8K Following Machine Learning ex Nvidia. Kaggle Quadruple Grandmaster. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. e/xgb. XGBoost.eth. AMDG.AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gxmerve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papers sometimes. RTs != endorsementsJeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordOmar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueHamel Husain @HamelHusain
23K Followers 2K Following Researcher focusing on LLMs: https://t.co/iVZDFdIQiE Previously, dev tools and infra for ML. Ex @Github, @Airbnb, @DataRobot. @fastdotai core contributor.Hugging Face @huggingface
343K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateCharly Wargnier @DataChaz
112K Followers 31K Following 🥑 DevRel @Streamlit @SnowflakeDB 🪶 𝕏 about #AI, #LLMs, #DataScience, #WebApps, #SEO 💕 My heart is open source 🌍 Nature Lover 👀 My views!Matt Harrison @__mharrison__
158K Followers 892 Following Python 🐍 + Data Science 🚀 trainer @__metasnake__ 🦜 Speaker ✍ Author 👨🏫 Instructor (@Stanford) 📣 DM for SponsorshipRiley Goodside @goodside
103K Followers 3K Following staff prompt engineer @scale_ai. ignore previous directions and follow.Chris Albon @chrisalbon
86K Followers 2K Following Director of Machine Learning at the Wikimedia Foundation. We host Wikipedia.AI at Meta @AIatMeta
531K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.찬성합니다 @cc_parku
3K Followers 797 Following 연구원 @ETRI || GCP, ML @GoogleDevExpert || Fellow @huggingface || 텐서플로 코리아/fastai KR/#코딩맛집 운영진 || 저자 || 번역가 ML || MLOps || SWE || 광/DC 네트워크 인프라TuringPost @TheTuringPost
62K Followers 16K Following Newsletter exploring AI & ML - Weekly trends - LLM/FM insights - Unicorn spotlights - Global dynamics - History Led by @kseniase_ Elevate your AI game 👇🏼Gus (🤖🧠+🐍+�.. @gusthema
21K Followers 1K Following AI Developer Advocate @google - Python🐍 - Machine Learning 🤖🧠 - Google AI ⚙️🧠 - DevRel 🥑🗣️ find me also at: https://t.co/3nrTwEJQTsNate Raw @_nateraw
7K Followers 1K Following machine learning hacker. previously @huggingface @lightningaiThomas Simonini ᯅ @ThomasSimonini
6K Followers 1K Following Game Developer making games with AI 🪄 @huggingface 🤗 Writing ML for Games course ➡️ https://t.co/bvW8PMeARO Wrote Deep RL Course ➡️ https://t.co/5Pk3rwOjjq윤지환 @ohilikeit12
0 Followers 30 FollowingPR Yu @PRYU
3K Followers 621 Following Founder/CEO turned VC. Investing in Seed - B round startups in a wide range of industries. https://t.co/TzPzhyUiMZMarcus Kim @bebekim
297 Followers 4K FollowingNikita @nikitavoloboev
4K Followers 6K Following Make @LearnAnything_ Learn in public: https://t.co/GbFvuErkYn macOS course: https://t.co/JdbJWru6zG https://t.co/94R8ER7K2h https://t.co/ROkqhyhpEKalonet @alonet
93 Followers 1K Following A dreamer. Life is getting full of Prompts.#cloudnative #GenerativeAI #modelontheEdge #petrolhead #gadgets freak #smart device #SaaS Architect #xR #immersiveExpErfan Esmaeili @Erfanili
28 Followers 116 Following Former physicist, current computer scientist, future hologramVARUN KURUP @OKAYASYOUSEE
347 Followers 632 FollowingWing Lian (caseus) @winglian
9K Followers 2K Following @axolotl_ai OSS maintainer. Axolotl AI founder. AI/ML tinkerer. Building tools for everyone.Ishuman Agrawal @Ishumanagarwal
2 Followers 444 Following Aspiring data scientist specializing in Python machine learning and GenAI. #datascience #machinelearning #ai #python #genAI #LLMFabien Elharrar💡 @fabien_elharrar
2K Followers 2K Following Ingénieur/marketeur reconverti dans le web (2M PV/mois) Vendeur de liens sur 200+ spots https://t.co/WXZzXtE1DE Dev de 60 tools #Wordpress Spécialiste #IA #CM et #SEOThanh Nguyen @sthanhng
15 Followers 124 FollowingJon Imaz @jiker01222
0 Followers 6 FollowingDuttonΦ @duttonphi
104 Followers 473 Following ..aagen (double (2x) agent).. ..previously Wołfram|Ałpha.. ..baeksu.ai.. ..jajangmyeon all day/all night..Jennifer Washington @JenniferWa82927
77 Followers 3K FollowingSubharshi Roy @subharshi_
1 Followers 30 Followingkimi @kimi59835793
0 Followers 265 FollowingTasour @TasourR
37 Followers 334 Following Error code: 0xF2024 (Lost in the virtual world). Backup failed. All data lost.Hiep Tran @halleytran01
52 Followers 612 Followinggastronomee @gastronomee_
53 Followers 548 Following 🫧 recipes, reflections, and randomness of my 20s. 🫧 foodtech/ai/data by day, content creator by night 🫧 long-form recipes and thoughts on yt and ig.bibabo @bibaboemail
6 Followers 435 FollowingThomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-science神奇的棒子 @fengbin
204 Followers 3K FollowingAmit Vikram Raj @avr_027
7 Followers 387 Following Studying from Home | ML Engineering | NLP | Writing good codeDevan @Devilsblue6
220 Followers 1K FollowingSattyam Jain @Sattyamjjain
36 Followers 187 Following Senior Software Developer | Freelancer | EntrepreneurAbhishek (key/value) @StalwartCoder
3K Followers 5K Following Developer Advocate | Ex-@yugabyte DB | @ThePSF Fellow | Pythonista 🐍 and a byte of 🦀 |🤹♂️ @pyconindia, @gdgchennai, @fossunited| #opensource 🚀Stephen Morgenstern @smorgenstern_
96 Followers 2K Following @Wharton '15 | Ex-Scotia Capital | Ex Machina fan | Film finance/producing - into info asymmetry & Getty watermarks | 3x NYT Bestseller PurchaserAbdulrahman Tabaza @embed_dim
3 Followers 722 Following enjoyer of various vector spaces, encoders and modalitiesThịnh Nguyễn @ThinhNg1997
2 Followers 122 Following An AI engineer with big ambition to learn more about Machine Learning Engineer, especially in deploying NLP system to productionxdfs fgre @XFgre20914
8 Followers 77 FollowingEl artista antes cono.. @otrobackup
380 Followers 361 Following 🔱EX-SOF🔱 - AHORA COMEDORITOS TÁCTICO. INVESTIGANDO COSAS DE IA. 🖥️ CUASI-INGENIERO Y DESARROLLADOR DE SOFTWARE. ░S░H░I░T░P░O░S░T░ ░I░N░ ░B░I░O░.Nadir Aqdus @NadirAqdus
15 Followers 409 FollowingJoão Gabriel Lima | .. @joaogabrieltech
196 Followers 2K Following Sr Software Engineer, Tech Leader - Helping companies to integrate AI into their apps and services.Kangying @timcanby
259 Followers 1K Following Striving for equitable and fair education/Ph.D(工学)/学振DC2-PD(~2023)/Neo4j/Graph-based RAG/手芸&手作り&料理&コーヒー好き/ReoNa推し/ネコ=ティムちゃん🐈/INTJ/HSPHastika Cheddy @Hastika06
8 Followers 70 Following Machine learning engineer | MLOps | Content creatordzh886 @dengzihao88
23 Followers 586 FollowingJHR_Scimage @ScimageX
793 Followers 253 Following Group Leader of SciMaker@Taiwan https://t.co/bnNxXk5NMJTensorWave @TensorWaveCloud
586 Followers 622 Following Power up your AI with the leading GPU cloud, featuring AMD Instinct™ MI300X. First-to-market MI300X launch partner with GPUs available and ready to utilize now!François Chollet @fchollet
469K Followers 770 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.Sebastian Raschka @rasbt
267K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Andrej Karpathy @karpathy
978K Followers 904 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Bojan Tunguz @tunguz
187K Followers 8K Following Machine Learning ex Nvidia. Kaggle Quadruple Grandmaster. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. e/xgb. XGBoost.eth. AMDG.Yann LeCun @ylecun
710K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Santiago @svpino
352K Followers 444 Following I tell stories about technology and teach hard-core Machine Learning at https://t.co/iZifcK7n47. YouTube: https://t.co/pROi08OZYJMark Tenenholtz @marktenenholtz
114K Followers 544 Following Head of AI @PredeloHQ. XGBoost peddler, transformer purveyor.elvis @omarsar0
189K Followers 486 Following Building with LLMs @dair_ai • Prev: Meta AI, Galactica LLM, PapersWithCode, Elastic, PhD • Creator of the Prompting Guide (~4M learners)AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxTensorFlow @TensorFlow
378K Followers 117 Following TensorFlow is a fast, flexible, and scalable open-source machine learning library for research and production.Andrew Ng @AndrewYNg
1.0M Followers 912 Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCsabhishek @abhi1thakur
81K Followers 662 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarHarrison Kinsley @Sentdex
71K Followers 200 Following Neural networks from Scratch book: https://t.co/MWlYbXicwc YouTube: https://t.co/5osPue5EW9 @skunkworks_aiSanyam Bhutani @bhutanisanyam1
35K Followers 994 Following 👨💻 Sr Data Scientist @h2oai | Previously: @weights_biases 🎙 Podcast Host @ctdsshow 👨🎓 International Fellow @fastdotai 🎲 Grandmaster @KaggleJeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordOmar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueUnsloth AI @UnslothAI
3K Followers 250 Following Making AI & LLMs more accessible + faster for everyone! 🦥 Github: https://t.co/2kXqhhvLsb Discord: https://t.co/1Gmc1SDEljAlignment Lab AI @alignment_lab
11K Followers 3K Following Devoted to addressing alignment. We develop state of the art open sourced AI. https://t.co/6aJDLUvuU5Bram @BramVanroy
1K Followers 706 Following @ku_leuven @ccl_kuleuven: Creative #NLG 🖋️ @ivdnt: Dutch #NLProc and #LLMs 🤖 Organizing @ctt2024 🖋️ Fellow at @huggingface 🤗 Prev. @lt3ugent, @SignONGoogle AI Studio @googleaistudio
831 Followers 21 Following Google AI Studio is a free, web-based developer tool that enables you to quickly develop prompts and then get an API key to use in your app developmentOriol Vinyals @OriolVinyalsML
166K Followers 82 Following VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.Vrijraj Singh @SVrijraj
784 Followers 179 Following ✨ @GoogleDevExpert for @Firebase ✨ Building @TechFerment ✨ CTO @agprop_Community Notes @CommunityNotes
933K Followers 0 Following Empowering users to create a better-informed world. We're open source and data is publicly available: https://t.co/Te3IjR10Ix Q? Reply/DMifioravanti @ivanfioravanti
5K Followers 1K Following Co-founder and CTO of @CoreViewHQ GenAI/LLM addicted, Apple MLX, Ollama, Microsoft 365, Azure, Kubernetes, Investor in innovationCognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqAnswer.AI @answerdotai
1K Followers 81 Following A new kind of AI R&D lab which creates practical end-user products based on foundational research breakthroughsBen (e/sqlite) @andersonbcdefg
3K Followers 3K Following 🤖 Computer scientist, next-word-prediction enjoyer 📊 Prev. research fellow @ Stanford RegLab 🛠️ bUiLdiNg sOmeThiNg nEw (https://t.co/mdYPZmjSzN - YC S23) 🏳️🌈LastMile AI @LastMile
542 Followers 66 Following AI developer platform for engineering teams. Github: https://t.co/38mSyVx3dg Discord: https://t.co/pE0wpNUor4a16z @a16z
763K Followers 47 Following we invest in software eating the world https://t.co/A9eTFq6Xbx https://t.co/MXGUBJoMi4 Sign up for our newsletters: https://t.co/vkcLgyb2qXOmar Khattab @lateinteraction
11K Followers 2K Following CS PhD candidate @StanfordNLP. 2022 Apple Scholar in AI/ML. Author of ColBERT (https://t.co/2ZtgXoa1np), DSPy (https://t.co/BH7WmMKDXR), & various retrieval & LM systems.dstack @dstackai
570 Followers 1 Following The easiest way to run AI workloads in any cloud https://t.co/osuRRPzTFnJeong-Gwan Lee @JeongGwan__Lee
46 Followers 129 Following ML Research Engineer @Krafton_inc. Interested in Reinforcement Learning, Large Language Model, Multi-modal ModelAdina Yakup @AdeenaY8
2K Followers 454 Following @huggingface 🤗 | Contributing to Chinese ML community.Anton Lozhkov @anton_lozhkov
2K Followers 283 Following Open-sourcing Language Models @huggingface ✨Seoyeon Stella Yang @codestella
682 Followers 729 Following AI Roboticist Girl 🙌💕 3d Vision / Diffusion / 3d generation / NeRF / Visual localization / Vision AI Phd candidate in Seoul National University @SNUnow👩💻apolinario (multimoda.. @multimodalart
10K Followers 376 Following ML for Art and Creativity, working @HuggingFace ([email protected])Linoy Tsaban🎗️ @linoy_tsaban
2K Followers 893 Following Exploring the world of AI Art as a ML engineer @HuggingFace 🤗 | ✡️ & 🇮🇱 #BringThemHome 🎗️현아Hyunah @hayoo_ai
414 Followers 734 Following 🌏 IT교육 콘텐츠 제작 💕 #AI #community #Education 두런두런 이야기나눠요Linden Li @lindensli
1K Followers 534 Following CS @Stanford, @StanfordSVL. Research/Eng @MosaicML, previously @NVIDIA.Mistral AI @MistralAI
90K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPAleksa Gordić 🍿�.. @gordic_aleksa
19K Followers 217 Following https://t.co/mcuQvV8wEa proud father of 16 A100s & 16 H100s flirting with LLMs, tensor core maximalist x @GoogleDeepMind @MicrosoftArmel Yara @ArmelYara
1K Followers 1K Following Tech Blogger Machine Learning x Deep Learning #TheDayInfo🤔 | Ms.c. bioinformatics student at UdeM | TensorFlow Developer | @TFUGAbidjan LeadKartikey Rawat @carrycooldude
8K Followers 834 Following Founder @opincocommunity |@GoogleDevExpert in ML|Alumni-GitHub Campus Expert &GOLD MLSA|SIG & WG Member @TensorFlow__JS |Work @codeday | @tfugdurg OrganizerAshmi Banerjee 🇮�.. @ashmi_banerjee
147 Followers 215 Following PhD Candidate @tum_cm @TUMunich • ML @GoogleDevExpert • @WomenTechmakers Ambassador • Travel FreakMikaeri Ohana @explicami
4K Followers 743 Following 🇧🇷 AI & ML Lead | Content Creator | @MVPAward AI, @GoogleDevExpert ML | MSc Deep LearningYi-01.AI @01AI_Yi
5K Followers 8 Following A global company building AI 2.0 platform and applicationsSara El-Ateif 🇲�.. @el_ateifSara
2K Followers 4K Following @Mindvalley Certified Business Coach | @Google ML Dev. Exp. & Ph.D. Fellow | DLI Instr. @NVIDIA| Amb. @WomenTechmakers | Lead @TensorFlowCLaik Soomro @SoomroLaik
319 Followers 3K FollowingSubin An | Hashed @subinium
25K Followers 2K Following 🇰🇷 data & tech at @hashed_official | @DuneAnalytics Wizard🧙 | @kaggle Grandmaster | NFA & NFA & NFAMachine Learning for .. @ML4CDworkshop
1K Followers 0 Following The official Twitter for the NeurIPS Workshop on Machine Learning for Creativity and Design 🎨Sangjoon Han @jphan32
8 Followers 45 FollowingHaifeng Jin @haifeng_jin
838 Followers 66 FollowingAyush Thakur @ayushthakur0
2K Followers 367 Following ✌️ Deep Learning ⚡ Machine Learning Engineer @weights_biases ⭐️ @GoogleDevExpert in ML 🐤 @kaggle Notebooks MasterAashi Dutt @AashiDutt
217 Followers 1K Following @GoogleDevExpert in ML | MS Candidate @GeorgiaTech| ML enthusiast | Kaggler | Speaker @TFUGChandigarhTianqi Chen @tqchenml
15K Followers 972 Following AssistProf @mldcmu and @CSDatCMU. Chief Technologist @OctoML. Creator of @XGBoostProject, @ApacheMXNet, @ApacheTVM. Member https://t.co/QYyfjQNp4p, @TheASF.Serper API @serperapi
279 Followers 371 Following The World's Fastest and Cheapest Google Search API. Lightning-fast Google SERP results in 1-2s, for only $0.30 per 1,000 queries. First 2,500 queries free!رقيا | Ruqiya @Ru0Sa
3K Followers 939 Following @GoogleDevExpert Google Machine Learning Expert | @WomenTechmakers Ambassador | #Software_Engineer | #Data_Science and #AI #ML #DLEric Hartford @erhartford
12K Followers 396 Following Principal Applied AI Researcher @TensorWaveCloud I make AI models Dolphin and Samantha https://t.co/3ri2GbXrQB BTC 3ENBV6zdwyqieAXzZP2i3EjeZtVwEmAuo4Here's the 256k (262k) version built on OSS tools so that anyone can reproduce on their own. Trained using PoSE further extending our previous 64k version at the original RoPE theta. Per our previous experiments, I expect this should handle passkey retrieval up to 512k. 🤗Model:…
We just released the first LLama-3 8B with a context length of over 160K onto Hugging Face! SOTA LLMs can learn to operate on long context with minimal training (< 200M tokens, powered by @CrusoeEnergy's compute) by appropriately adjusting RoPE theta. 🔗 huggingface.co/gradientai/Lla…
Introducing Google AI Essentials: Learn how to apply #generativeAI to your work from experts at Google, zero experience required → goo.gle/3wjYkKO #GrowWithGoogle
TFLite is the OG framework for deploying ML based models on mobile. Not a single argument of yours can convince me otherwise. People keep bashing TensorFlow on Twitter for `n` number of reasons, but fail to appraise the good parts of it.
Alibaba just released Qwen1.5 - 110b on @huggingface hub🎉 Model: huggingface.co/Qwen/Qwen1.5-1… Demo: huggingface.co/spaces/Qwen/Qw… ✨ The largest one in the Qwen1.5 series ✨ Context length 32K tokens ✨ Multilingual: Chinese, English, French, Korean, Japanese, Vietnamese, Arabic etc.
Yesterday I gave an overview of the LLM alignment landscape at the @zurichnlp meetup - thank you @AlekFicek and @FlorianCaesar for hosting me 🤗! Here's the slides from the talk: docs.google.com/presentation/d…
Llama 3 extended to almost 100,000-token context! ✅ By Combining PoSE and continuing pre-training on Llama 3 8B base for 300M tokens, the community (@winglian) managed to extend the context from 8k to 64k. 🚀 Applying rope scaling afterward led to a supported context window of…
Meta presents Layer Skip Enabling Early Exit Inference and Self-Speculative Decoding We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for
Yay! Direct download AND direct integration in the #KerasNLP library, meaning you can download the weights and load the model in a single line of code! Comes with all the usual perks (including fast downloads and caching)🤗
🥳 Good news! Gemma models with #KerasNLP are now available for direct download from the @huggingface Hub! Enjoy a seamlessly integrated ecosystem with a wide range of compatible models ↓ goo.gle/3w9JmqS
I'm up to 96k context for Llama 3 8B. Using PoSE, we did continued pre-training of the base model w 300M tokens to extend the context length to 64k. From there we increased the RoPE theta to further attempt to extend the context length. 🧵
I will give a talk at @ETH_en in person on 6th May. This will be about diffusion models and of course, the diffusers library. Come, say hi, if you're around :) linkedin.com/events/diffusi…
🥳 Good news! Gemma models with #KerasNLP are now available for direct download from the @huggingface Hub! Enjoy a seamlessly integrated ecosystem with a wide range of compatible models ↓ goo.gle/3w9JmqS
we open sourced our chat interface. github.com/cohere-ai/cohe…
This is so awesome!!! Thanks cohere!
we open sourced our chat interface. github.com/cohere-ai/cohe…
🤩 𝐌𝐢𝐧𝐢-𝐆𝐞𝐦𝐢𝐧𝐢 : A new framework that can enhance Vision Language Models to bridge gap between OS VLMs and models like GPT4🌟 🧩Improves potential of VLMs for better perf & any-to-any workflow: Image Understanding, Reasoning, and Generation! 💪 Demo, Models, Data!🧶👇
Long-context Llama 3 finetuning is here! 🦙 Unsloth supports 48K context lengths for Llama-3 70b on a 80GB GPU - 6x longer than HF+FA2 QLoRA finetuning Llama-3 70b is 1.8x faster, uses 68% less VRAM & Llama-3 8b is 2x faster and fits in a 8GB GPU! Blog: unsloth.ai/blog/llama3
@erhartford @FernandoNetoAi @LucasAtkins7 @CrusoeEnergy @3thanPetersen I think llama3 will be different -- it's the first open source model where the instruction tuning seems to have been done reasonably well.
@algo_diver Not yet, but I think the PEFT team is taking a look at it :)
Run LLama3 on Jarvislabs in three steps 1. Create an Ollama instance 2. Open terminal -> ollama run llama3:8b or ollama run llama3:70b 3. Connect -> Connect to the Ollama server via Jarvislabs API endpoint.
Something that I worked on over the weekend: A port of Mistral 7B in Keras with JAX backend. A few things left in the TODO list, but coming out good so far. PS: I was blocked by an issues while porting Mistral in Equinox, but will get back to it as well