Ydn. @akuyudanta
ML+NLP Engineer ~ Dvorak typist |+62 🇮🇩 currently living as a registered Alien in +1 🇺🇸 | I heart Coffee, Traveling and Code for living(fun) 🌿 yudanta.github.io Pittsburgh Joined July 2009-
Tweets65K
-
Followers908
-
Following2K
-
Likes11K
Visual Prompt Injection 💉🛑 IRL
And the last good news of the week. Our work got accepted to the @ICSSIConference. It will be my first attendance at ICSSI. Thanks to my wonderful collaborators, @SarahHBana, @renzheyu, @danielrock, and @mrfrank5790, as well as the reviewers and organizers. See you soon at ICSSI!
Tian (@xie_tian), Principal Research Manager at @MSFTResearch, told us: "I always recommend data scientists/ML engineers read the most basic textbooks about ML" We had an interview with him: turingpost.com/p/mattergen Here are 2 books recommended by Tian:
resource alert! A minimal GPU design in Verilog to learn how GPUs work from the ground up github.com/adam-maj/tiny-…
RESEARCH OPPORTUNITY ALERT. If you're interested in synthetic data, we're recruiting for a Research Scholar to collaborate with Cohere For AI for a 6-month internship. Must be available full-time starting ASAP. DM if you're interested 🥰.
We're having a big event on agents at CMU on May 2-3 (one week from now), all are welcome! cmu-agent-workshop.github.io It will feature: * Invited talks from @alsuhr @ysu_nlp @xinyun_chen_ @MaartenSap and @chris_j_paxton * Posters of cutting edge research * Seminars and hackathons
We're having a big event on agents at CMU on May 2-3 (one week from now), all are welcome! cmu-agent-workshop.github.io It will feature: * Invited talks from @alsuhr @ysu_nlp @xinyun_chen_ @MaartenSap and @chris_j_paxton * Posters of cutting edge research * Seminars and hackathons
I've just released llamafile v0.8 which features LLaMA3, Mixtral 8x22b, and Grok support. It goes 25x faster than ollama at running LLaMA3 70B on CPU. My new tensor multiplication kernels let llamafile eval MoE models 2x faster than llama.cpp github.com/Mozilla-Ocho/l…
Dolphin-2.9-Llama3-70b is released - created by myself, @FernandoNetoAi, @LucasAtkins7, and Cognitive Computations under llama3 license. Much gratitude to my compute sponsor @CrusoeEnergy and personal thanks to @3thanPetersen for quantizing it! And much thanks to the dataset…
There is a really nice community of researchers developing transformer alternatives. Want to highlight these impressive folks. Simran Arora (@simran_s_arora), Chunting Zhou (@violet_zct), Dan Fu (@realDanFu), and Songlin Yang (@SonglinYang4)
Online live preview: storm.genie.stanford.edu Open-source code: github.com/stanford-oval/… Check out `examples/` to see how to configure open LMs like Mistral!
SnapKV LLM Knows What You are Looking for Before Generation Large Language Models (LLMs) have made remarkable progress in processing extensive contexts, with the Key-Value (KV) cache playing a vital role in enhancing their performance. However, the growth of the KV
Last week I gave my lecture on Parameter-Efficient Finetuning (PEFT) at the @UvA_IvI MSc in AI - Foundation Models course! Find the slides here: dropbox.com/scl/fi/gngd55q… and the rest of our course with @cgmsnoek: uvafomo.github.io
Our team in FAIR (at Meta) is hiring researchers (RS & PostDoc) to work on the broad topics of text and multimodal LLMs. Location: NY, Seattle or Menlo Park for RS, and Seattle for PostDocs. PostDoc: metacareers.com/jobs/968496244… Research Scientist, AI (PhD): metacareers.com/jobs/752169417…
Noor✨ @lynxluna
14K Followers 1K Following 𝚝𝚊𝚕𝚎𝚗𝚝 | 𝚕𝚎𝚜𝚜 Brotek Penggagas https://t.co/sXoMuYckAa. "Peak of mount stupid" Dunning-Kruger dweller. Sutan Sjahrir is my programming role model.Asep Bagja 🍍 @bepituLaz
5K Followers 558 Following Show don’t tell. 🎹📷💶 I post in English and Bahasa Indonesia.jogjaupdate.com @JogjaUpdate
1.0M Followers 62K Following akun resmi dari https://t.co/VHDcafzaMy | berbagi cerita, informasi dan berita | redaksi/promo: [email protected]alfaridi @alfaridi
1K Followers 599 Following Tukang ketik, bisa ditunggu. Seringnya nyampah. Lagi gembokan, kalo mau stalking kasih tau aja, ntar aku bukain.Ismail Sunni @ismailsunni
925 Followers 783 Following Math, football, writing, open source geospatial, and will code for sego kucingLeksa @leksa
7K Followers 1K Following Persistent. Everyone has a past, and will be in the future. But never in the present. Digital product. https://t.co/u91IXTjHH5Terry Perdanawati @tey_saja
14K Followers 952 Following Yogyakartan roaming around California & Minnesota. @jay_afrisando’s partner in rhyme, ugal-ugalan with @gatrawardaya, #WatikWirobrajanNilta @asanilta
5K Followers 1K Following Reminder that if you get mad at my tweets due to your own lack of reading comprehension and critical thinking, that's on you 🙏Serena📚 @DS_Serena_
13K Followers 2K Following yoga🧘♀️ travel✈️ books📖 data science📊 live for experiences not things but love buying things tho 🛍️🤑Rusman Wahab @rusman_plat_d
345 Followers 3K Following Lorem ipsum dolor sit amet; Alice, Bob, Charlie, John Doe, Jane Doe; Foo, Bar, Baz, Qux;Vada Calderara @VaCalderar
81 Followers 5K FollowingTwanna Arbeiter @ArbeiterTw36567
34 Followers 5K FollowingErlene Huguet @ErleneH50027
89 Followers 5K FollowingClaudia Pfotenhauer @ClaudiaPfo97176
39 Followers 5K FollowingCandida Costen @cos_candi
72 Followers 5K FollowingEkue @ekpodar
1K Followers 1K Following I am interested in Tech/AI, Marketing, and complex systems, I will posts random stuff in those categoriesShakira Hoff @HofShakir
46 Followers 5K FollowingAdriana Rabkin @adria_rabk
52 Followers 5K FollowingBirgit Giangregorio @BirgitGian68547
85 Followers 5K FollowingKamryn Ohaver @kamr_ohav
38 Followers 5K Followingluffynas @luffynas
11 Followers 76 FollowingKate Lepetich @KLepetich40965
88 Followers 5K Followingowshxx @freakydick
340 Followers 785 Following Lebih baik jodoh yang tertunda daripada jodoh yang tertukar.Robynn Miessler @miess_roby
11 Followers 2K Followingstrywbw @r0mance
1K Followers 5K Following Indonesian. https://t.co/ErrlNZnTAK ⚽ 「Content Ambassador @TopGoal_NFT」 https://t.co/UJqowCZI9x @tonnel_networkTori Brodine @t_brodi
52 Followers 5K FollowingAlexandrina Olejarski @AlexandrOlejar
45 Followers 5K FollowingHilariousHarper @HarperHila16779
16 Followers 2K FollowingWid @widnyana_
874 Followers 751 Following Cloud Infrastructure & Software Engineer | Meme driven development afficionadoWesley Neverman @NevermWesl
29 Followers 5K FollowingJayde Spickler @jayd_spickl
44 Followers 5K FollowingNel Sorbera @sorbe_n
39 Followers 5K FollowingPolly Willian @willi_pol
50 Followers 5K FollowingAmira Gutting @guttin_am
47 Followers 5K FollowingUnsloth AI @UnslothAI
3K Followers 250 Following Making AI & LLMs more accessible + faster for everyone! 🦥 Github: https://t.co/2kXqhhvLsb Discord: https://t.co/1Gmc1SDEljpolygon @yummyguccy
19 Followers 56 FollowingNikola R. Hristov @NikolaRHristov
1K Followers 2K Following 🪴 CEO ⁄ Open ⁄ Source — 📦 Founder → CEO ⁄ ⛈️ @PlayFormCloud — 📦 Founder → Member ⁄ 🌆 @CodeEditorLand — ☁️ https://t.co/djLsQ16Bwq —Ana Rojo-Echeburúa @arojomaths
721 Followers 3K Following Data Science & AI || PhD in Applied Mathematics || Spanish living in Scotland || Crossfit Athlete || Content CreatorVladimir Blagojevic @vladblagoje
622 Followers 304 Following Natural Language Processing; Stanford AI SCPD, MSc York University, Software Engineer @deepset_aiAkshay🚀 @akshay_pachaaer
396 Followers 2K Following Simplifying LLMs, MLOps, Python & Machine Learning for you! • Lead Data Scientist TomTom • BITS Pilani• 3 Patents • Join 2k+readers-›https://t.co/Fh7rvwwp5bFemke Plantinga @femke_plantinga
822 Followers 550 Following Fun stuff ✨ @weaviate_io Simply explaining: https://t.co/Ejfb4iz2BXSmeshiez @smeshiez37960
5 Followers 1K Following After walking for a long time, I realized that the distance is still full of tea, rice, oil and salt.Joanne @gregg_joanne31
220 Followers 3K FollowingSamson @Nethys57711
459 Followers 5K Following See the world on the road, and get to know yourself on the way!Junn @algnza
6 Followers 133 Following • tempat berkeluh kesah karena gak ada yg mau dengerin keluh kesahkuYode Arliando @yodearliando
251 Followers 540 Following Linux and Android Super User | Belajar Mengajar | https://t.co/5DY6eMIWn7Haihao Shen @HaihaoShen
3K Followers 3K Following Creator of Intel Neural Compressor/Speed/Coder, Intel Ext. for Transformers, AutoRound; HF Optimum-Intel Maintainer; Founding member of OPEA; Opinions my ownKaren Ali @KarenAli182861
115 Followers 3K FollowingShannon @busesooth47442
363 Followers 5K Following See the world on the road, and get to know yourself on the way!Ainun Najib @ainunnajib
165K Followers 5K Following Data & Tech | Cofounded @KawalCOVID19 & @KawalPemilu2019 & @kawalmasadepan | Arek NU asli Gresik | 🇮🇩 @ 🇸🇬 | ga sopan = blocklantip @lantip
40K Followers 1K Following pernah kesaspen. pernah tertipu. tapi tetap desainer, programmer, pencinta kopi, pencinta komikNoor✨ @lynxluna
14K Followers 1K Following 𝚝𝚊𝚕𝚎𝚗𝚝 | 𝚕𝚎𝚜𝚜 Brotek Penggagas https://t.co/sXoMuYckAa. "Peak of mount stupid" Dunning-Kruger dweller. Sutan Sjahrir is my programming role model.UdaImre🦉 @imrenagi
30K Followers 1K Following I tweet (funny) things about software engineering and our beloved govt. Ngefans sama IU. Twit serius ditandai dengan -IN lolPinot @pinotski
199K Followers 320 Following Tweeting in Indonesian. Pemalas yg demen kerajinan tangan dibantu keluarganya. Pembuat content di VeeFriends. Stroke Survivor. #SemestaHwarakadahMona @nmonarizqa
41K Followers 820 Following Orang Jogja tapi KTP bukan. Sometimes do data, sometimes code, sometimes do viz, and sometimes lie horizontally on the couch. Kalo DM jgn hi/salam doang 💗Zen RS @zenrs
184K Followers 1K FollowingIsmail Fahmi @ismailfahmi
181K Followers 2K Following Founder of Drone Emprit and Media Kernels Indonesia | https://t.co/L2Ibn7Ffmf | https://t.co/zcTAtTJm6x | #datascience #OSINTKalis Mardiasih @mardiasih
187K Followers 2K Following An Indonesian Female Moslem Writer. Storytelling with gender perspectives. Bersuratlah ke [email protected]Asep Bagja 🍍 @bepituLaz
5K Followers 558 Following Show don’t tell. 🎹📷💶 I post in English and Bahasa Indonesia.jogjaupdate.com @JogjaUpdate
1.0M Followers 62K Following akun resmi dari https://t.co/VHDcafzaMy | berbagi cerita, informasi dan berita | redaksi/promo: [email protected]alfaridi @alfaridi
1K Followers 599 Following Tukang ketik, bisa ditunggu. Seringnya nyampah. Lagi gembokan, kalo mau stalking kasih tau aja, ntar aku bukain.Agus Mulyadi @AgusMagelangan
142K Followers 794 Following Kadang blogger, kadang netizen, sesekali nulis buku | Penjaga toko @akalbuku | Duta @kecapmbahjoyo | Email: [email protected] 📲: 087722271000 |Elon Murz | SVP of Me.. @ecommurz
99K Followers 242 Following 😺 Indo biggest Tech Execs & Workers Community on Instagram 🍵 Teas, Memes and Shitpost 🔴 MURZ ON @bloomberg! 👇Wing Lian (caseus) @winglian
9K Followers 2K Following @axolotl_ai OSS maintainer. Axolotl AI founder. AI/ML tinkerer. Building tools for everyone.Matt Shumer @mattshumer_
51K Followers 1K Following CEO @HyperWriteAI, @OthersideAI - I make AIs do the impossible.Lintang Sutawika @lintangsutawika
383 Followers 565 Following Incoming Ph.D. student @LTIatCMU. Researcher at @AIEleuther. Maintainer of LM-Eval Harness. Here for machine learning papers and discussion.Simon Prince @SimonPrinceAI
9K Followers 331 Following Professor of Computer Science, University of BathJunyang Lin @JustinLin610
5K Followers 1K Following Chief Evangelist Officer of Qwen Team & OpenDevin, building LLM and LMM. Now @Alibaba_Qwen . Previously @PKU1898 LANCO group. ❤️ 🍵 ☕️ 🍷 🥃KDKA @KDKA
234K Followers 343 Following Breaking news, weather and sports from KDKA-TV and CBS News Pittsburgh. Expect More.Dave DiCello @DaveDiCello
129K Followers 946 Following Born, raised and proudly live in the great Steel City of Champions. Pittsburgh cityscape/wedding photographer. Pens fan. Marathon runner. Husband and dad.Pittsburgh Scanner @pgh_scanner
54K Followers 61 Following Two guys with a couple radios. Always listening to Pittsburgh Police/Fire/EMS. Voted #1 Pittsburgh Twitter account in 2023 by City Paper readers!Rui Zhang, PhD, FAMIA @RuiZhang1229
343 Followers 193 Following Founding Chief of Division of Computational Health Sciences @UMNSurgery, Director of NLP research program, McKnight Presidential Fellow, University of MinnesotaAmanda Lazar @Amanda_Lazar
909 Followers 756 Following Assistant Professor in the College of Information Studies in Human-Computer Interaction and Health Informatics. Studying technology, dementia, ageism.PennAITech @pennaitech
246 Followers 85 Following The Penn Artificial Intelligence and Technology Collaboratory for Healthy AgingUniversity of Pittsbu.. @PittTweet
66K Followers 2K Following The official Twitter account of the University of Pittsburgh. #H2PAMIA NLP WG @amiaNLPwg
298 Followers 115 Following American Medical Informatics Association (AMIA) Natural Language Processing (NLP) workgroup.João Moura @joaomdmoura
8K Followers 1K Following Founder of @crewAIInc / prev @clearbit (acc by @hubspot) Open Source enthusiast | Creator of Machinery | Public Speaker | My viewsReuben Ng @DrReubenNg
467 Followers 339 Following Behavioural and Data Scientist at the Lee Kuan Yew School of Public PolicyRichard Everts @rich_everts
102 Followers 79 Following Co-Founder, CEO Bestie Bot | AGI Researcher for 20+ years | Author, "Light up the Grind", "Terran Liberty" seriesBram @BramVanroy
1K Followers 707 Following @ku_leuven @ccl_kuleuven: Creative #NLG 🖋️ @ivdnt: Dutch #NLProc and #LLMs 🤖 Organizing @ctt2024 🖋️ Fellow at @huggingface 🤗 Prev. @lt3ugent, @SignONMetaGPT @MetaGPT_
4K Followers 117 Following The Multi-Agent Framework Github: https://t.co/nJEEwdSWBy Discord: https://t.co/CQDk1XH0bz Doc Site: https://t.co/UyljGj3ImwXavier Bresson @xbresson
13K Followers 859 Following Prof @NUSingapore Distinguished Researcher @DiscoverElement #NRF Fellow, #GraphNNs #LLMs #DeepLearningTheory #MolecularMaterialScience #Teaching Opinions my ownsamsja @samsja19
735 Followers 931 Following Research Engineer at @PrimeIntellect, previously training llm at @NyonicAI, maintainer of @docarrayEric Hartford @erhartford
12K Followers 396 Following Principal Applied AI Researcher @TensorWaveCloud I make AI models Dolphin and Samantha https://t.co/3ri2GbXrQB BTC 3ENBV6zdwyqieAXzZP2i3EjeZtVwEmAuo4Teknium (e/λ) @Teknium1
29K Followers 3K Following Cofounder @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE Support me on Github SponsorsGroq Inc @GroqInc
45K Followers 468 Following Creator of the LPU™ Inference Engine, providing the fastest speed for AI applications, designed & engineered in N. America https://t.co/DsEqVAC5Dpjack morris @jxmnop
10K Followers 762 Following getting my phd in nlp @cornell_tech 🚠 // academic optimist // tweeting from the snack aisle at trader joesVincent Abbott | Deep.. @vtabbott_
3K Followers 200 Following Maker of *those* diagrams for deep learning algorithms | 🇦🇺 | https://t.co/kE2BIGCiMYOmar Khattab @lateinteraction
11K Followers 2K Following CS PhD candidate @StanfordNLP. 2022 Apple Scholar in AI/ML. Author of ColBERT (https://t.co/2ZtgXoa1np), DSPy (https://t.co/BH7WmMKDXR), & various retrieval & LM systems.Amanda Bertsch @abertsch72
1K Followers 673 Following PhD student @LTIatCMU / @SCSatCMU, researching text generation + summarization | she/her | also @ abertsch on bsky or https://t.co/L4HBUh0R9f or by email (https://t.co/bsHqwIMFPL)Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Axolotl @axolotl_ai
837 Followers 18 Following Axolotl is the premier open source LLM fine tuning framework. find us on discord https://t.co/wlcE2wlJa9Probability and Stati.. @probnstat
33K Followers 332 Following Probability. Statistics. Machine Learning. Data Science.BAAI @BAAIBeijing
2K Followers 215 Following Beijing Academy of Artificial Intelligence. AI Science and Technology Innovation for Global Sustainable Development.RWKV @RWKV_AI
2K Followers 3 Following AI model built by the community, for everyone in this world Part of the Linux Foundation, Apache 2 licensed An RNN scaled to 14B params with GPT-level of perfJMIR Publications @jmirpub
37K Followers 16K Following #OpenScience publisher of #oa journals. J Med Internet Res (IF 7.4, JMIR #PublicHealth (IF 8.5), #overlay #JMIRx #planp #digitalhealth #digitalscienceMedARC @MedARC_AI
4K Followers 10 Following Medical AI Research Center (MedARC) Unlocking new possibilities in medical AI research. Founded by @iScienceLuvrWenhao Yu @wyu_nd
2K Followers 621 Following Senior Research Scientist at @TencentGlobal AI Lab in Seattle | Bloomberg PhD Fellow | Ex. @MSFTResearch @allen_ai @NotreDame @Bloombergrabbit inc. @rabbit_hmi
84K Followers 1 Following rabbit brings the future of human-machine interface. order r1, your pocket companion, now.Maxime Labonne @maximelabonne
12K Followers 432 Following Author of Hands-On Graph Neural Networks https://t.co/Q8victWUmR • Machine Learning ScientistUnsloth AI @UnslothAI
3K Followers 250 Following Making AI & LLMs more accessible + faster for everyone! 🦥 Github: https://t.co/2kXqhhvLsb Discord: https://t.co/1Gmc1SDEljGeronimo @Geronimo_AI
773 Followers 381 Following LLM enthusiast 🚀 failing fast, learning fast. sharing it all on X and MediumAlara Dirik @alaradirik
1K Followers 242 Following PhD candidate and @GoogleDeepMind scholar at @imperialcollege, previously at @huggingface and @unibogaziciKatharina Stögmülle.. @KStogmuller
17K Followers 1K Following Tulung bukuku ditumbas ben iso ganti sendal https://t.co/9B0n1ttNuuHarper Carroll @HarperSCarroll
11K Followers 610 Following Master’s + Bachelor’s in Computer Science specializing in AI & Machine Learning @ Stanford | AI @ Meta | Head of AI/ML @ brev | see Highlights for tutorialsJames Briggs @jamescalam
9K Followers 172 Following 👾 AI engineering: https://t.co/rLYkOCH5gb 🥑 Dev advocate @Pinecone ✏️ Learning and talking about everything https://t.co/aydfSKEar9qnguyen3 @stablequan
3K Followers 1K Following Multimodal | Synthetic Data | Multimodal Lead at Ontocord AIUCL DARK @UCL_DARK
3K Followers 186 Following UCL Deciding, Acting, and Reasoning with Knowledge (DARK) Lab at @AI_UCL led by @_rockt, @egrefen, @robertarail, and @jparkerholder.Bill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscdolphin-2.9-llama3-8b-256k is released. It is dolphin-2.9-llama3-8b with @winglian's awesome 256k context adapter applied. I will get the model card done today.
🎉 Exciting news in #HealthcareAI! We're thrilled to introduce Hippocrates, an open-source LLM framework tailored for the medical field! 🚀 #AI #NLProc (1/9)
This is the prompt they are using for Llama-3 function calling (seems to work well even though it's not specifically fine tuned for that): github.com/ShishirPatil/g…
📊Delighted to welcome Command-R-Plus, Llama-3, and and Gemini-Pro-1.5 into the Berkeley Function Calling Leaderboard. Check out how they stack up across different categories, P95 latency, and costs at gorilla.cs.berkeley.edu/leaderboard.ht… Congratulations to @cohere, @AIatMeta, and…
📢 My blog post about fully open synthetic data generation pipelines with Llama3 is out! Wanna learn how to build preference datasets from scratch using OSS models with the learnings from the @argilla_io team? Check out the post, including code & data! huggingface.co/blog/dvilasuer…
🙋❓ Curious about the differences between DPO, KTO, ORPO, and other preference alignment algorithms? Check out a comprehensive overview in the latest blog post of our series with MantisNLP. argilla.io/blog/mantisnlp…
It turns out I had some misunderstandings about how Mixture of Experts really works, and the 128 experts seems more justifiable This blogpost by @huggingface was helpful: huggingface.co/blog/moe And: blog.javid.io/p/mixtures-of-… I had likened MoE to Random Forest which turned out to…
Good morning: @SnowflakeDB’s new 480B parameter #LLM is made of 128 experts! It’s bigger than #Grok and is now the largest *fully open source (Apache 2.0* LLM! 🧵👇 how does it compare to Llama 3, Mixtral, and GPT4?
If you're working with LLM, it's mandatory that you read this article 😱 "Evaluating a RAG System: Part 1 of 3" Article link: medium.com/@codegpt/evalu… This is the first research article released by the CodeGPT team. In this initial part, we employ the RAGAS framework to…
🤖🏆LangGraph: Can Language Models Solve Olympiad Programming? 🤖🏆 Last week, Princeton researchers released the USACO benchmark dataset and showed that a zero-shot GPT-4 agent only passes 8.7% of the questions. We've implemented this paper in LangGraph and created a tutorial…
Run an AI Town locally, powered by llama3 🎉 No cloud signups needed. Make your own world, and then talk to it :) Runs the open-source @convex_dev backend locally. Use @ollama locally or @togethercompute for cloud LLM. @realaitown
waktu ditinggal tangan kanan (sebelum stroke) diterusin tangan kiri (setelah stroke)
Psst. 😮 Big news to share! Today, @SnowflakeDB released a fully open-source foundation #LLM. It’s called #SnowflakeArctic, and it’s super smart and efficient. The really cool part? It writes beautiful ✨Streamlit code.🎈 Check out deets on our blog!👇 blog.streamlit.io/introducing-sn…
Just read Apple's "OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework". Similar to the OLMo, it's refreshing to see an LLM paper that shares details discussing the architecture, training methods, and training data. Let's start with the…
Mozilla published a nice writeup on how llamafile helped them evaluate the quality of AI models and quantization formats. The writeup goes into detail on the code, costs, and has a heartwarming story on some history with the local LLaMA community. blog.mozilla.ai/local-llm-as-j…
☹️Google Scholar is a great tool. But it doesn't show how papers are connected with each other. 😀Here's how to fast-track your literature review with a "visual search." And export your papers to Zotero, Mendeley, or EndNote. You can learn this workflow in 15 min:
we open sourced our chat interface. github.com/cohere-ai/cohe…
The distributional hypothesis states that words that appear in similar contexts tend to have similar meaning. The co-occurrence matrix counts how many times a word appears in a given context. Its rows are sparse word vectors that can be squeezed into dense word embeddings.
The dataset is everything. Great read: nonint.com/2023/06/10/the…
kena phk. gak punya income. sebelah kanan badan ngga bergerak. ngga bisa nggambar lagi. kalo kata orang "udah jatuh, tertipa tangga pula" kalo kami, tetep cengengesan "be the light!"
First open LLM from @SnowflakeDB! Arctic is 480B Dense-MoE with a 10B dense transformer model and a 128x3.66B MoE MLP designed specifically for enterprise AI. 🤔 TL;DR: 🧠 480B parameters with 17B active during generation 👨🏫 128 experts with 2 active in generation 2️⃣ Instruct…
I would like to invite you to try phi-3-mini: aka.ms/try-phi3-hf-ch…. You can also download the weights from HF with more model weights on the way. Besides what was described in technical report, one specific thing I want to mention is the 128K context support. It takes us a…