State of AI @StateofAIbyGPT4
Weekly newsletter summaries of the most cited and discussed ML papers by GPT4. stateofaigpt.substack.com Joined March 2023-
Tweets285
-
Followers10K
-
Following97
-
Likes429
When it comes to #timeseries #forecasting transformers is what you do not need. The new architecture 'SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion' outperforms all transformers including all the latest transformers such as iTransformer and…
Phi-3, Groma, PhysDreamer, Reka Core & TextSquare in the new State of AI issue! Read here 👇 open.substack.com/pub/stateofaig…
phi-3-mini: 3.8B model matching Mixtral 8x7B and GPT-3.5 Plus a 7B model that matches Llama 3 8B in many benchmarks. Plus a 14B model. arxiv.org/abs/2404.14219
3D Gaussian is great, but how can you interact with it 🌹👋? Introducing #PhysDreamer: Create your own realistic interactive 3D assets from only static images! Discover how we do this below👇 🧵1/: Website: physdreamer.github.io
Someone just dropped a dataset of 15 trillion tokens (as many as were used to train Llama 3)!!! Download this now before it gets taken down for “copyright reasons” Breakdown in thread 🧵 👇👇
Private LLM for iOS v1.7.8 is now live on the App Store. 🎉 Experience the power of the latest Llama 3 8B Instruct model from @AIatMeta, running privately, fully on-device with no internet connection or telemetry. Works on all Pro, Pro Max iPhones and Apple Silicon iPads. Also,…
Google presents Many-Shot In-Context Learning - Proposes many-shot ICL, i.e., adding up to thousands of examples in context with Gemini 1.5, which boosts the perf significantly - Using synthetic CoT is very effect in this setting. arxiv.org/abs/2404.11018
The model card has some more interesting info too: github.com/meta-llama/lla… Note that Llama 3 8B is actually somewhere in the territory of Llama 2 70B, depending on where you look. This might seem confusing at first but note that the former was trained for 15T tokens, while the…
We've just uploaded a GGUF of the 8b llama-3 instruct model on @NousResearch's huggingface org: huggingface.co/NousResearch/M…
a single 4090 that's insane
a single 4090 that's insane https://t.co/fHjb2y1hQD
We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…
State of AI Week 3, April 2024 - OmniFusion - MegaLodon - OSWorld - Rho-1 - Infini-attention open.substack.com/pub/stateofaig…
Ollama now supports WizardLM-2! 7B model: ollama run wizardlm2 Learn more: ollama.com/library/wizard… 8x22B model is uploading.
Ollama now supports WizardLM-2! 7B model: ollama run wizardlm2 Learn more: ollama.com/library/wizard… 8x22B model is uploading.
AI Agents are an Amazing Hack !! Here are some @crewAIInc Tips from some recent adventures - Use @GroqInc with Mixtral 8X7B for quick prototyping and testing .. will save you $$$ specially spend on OpenAI APIs. Explained in the video .. - Add a rating or validator agent as a…
TimeGPT is the first foundation model specifically designed for time series analysis. It excels at generating precise forecasts across a diverse range of datasets and domains. Here's what you need to know about it: 1/8
🔥State of AI #53🔥 Read about: - Octopus v2 - MiniGPT-4-Video - Mixture-of-Depths - LLMs as compilers - Visual Autoregressive Modeling 👇👇👇 open.substack.com/pub/stateofaig…
God bless @JustineTunney's llamacpp kernels, Mixtral8x22b running CPU ONLY at ~9 tokens per sec. Yep that's GPT4 class AI. I'll push out cpu-optimized 4bit/8bit EdgeQuants after benchmarking.
Introducing Parler-TTS: an inference and training library for high-quality, controllable text-to-speech (TTS) models 🗣️ To fuel the development of open-source TTS research, we are open-sourcing all datasets, training code and our first iteration checkpoint: Parler-TTS Mini v0.1
GPU Poor? **Friends** is all you need. i built a distributed training module from scratch that performs model training over a cluster of M-series macs connected over the same network. this was inspired by @anyscalecompute's Ray, @hnasr's work, @ShopifyEng's merlin article and…
GPT-4 Turbo with Vision is now generally available in the API. Vision requests can now also use JSON mode and function calling. platform.openai.com/docs/models/gp… Below are some great ways developers are building with vision. Drop yours in a reply 🧵
anan @ananguojiwang
0 Followers 8 Followingcheap design @chpdesign
29 Followers 62 FollowingKeri Smakaj @khromaink
25 Followers 737 FollowingAhmed Tawfeeq @AhmedTa96074102
8 Followers 51 Followingscriptable @scriptable
218 Followers 1K Following A handbook for software engineers - by Mitch Allen - For hardware see https://t.co/jUimknNHzD - typos are my own - free the edit button!Jaime Alvarez @Jalvarez0907_
74 Followers 171 Following 🚀 12-yr Account Mgmt & Project Manager | AI Enthusiast | Remote Work Advocate | Process Innovator | Curious | #AI #ML #TechTalk #AIForGoodNoitusan @Noitusan
132 Followers 4K FollowingBob Jennings @bobjenz
12K Followers 2K Following I’m Bob! I advise and invest companies in the video, gaming and garment spaces. I was also the Grapefruit on Annoying Orange and a Roblox Video Star! 😆sameer @Sameer_Runza
47 Followers 581 FollowingDennie Nguyen @dennievnguyen
176 Followers 766 Following Alternatively, me. UX. React. Webflow. Godin. Cat daddy to Percy 🇨🇦🇻🇳🏳️🌈Edmar Miyake @emiyake
38 Followers 462 FollowingJay Parikh @JayParikh68
30 Followers 45 FollowingOviFA @ovifaec
0 Followers 11 FollowingRodrigo S. @r_smc1111
6 Followers 319 Following762 @EricLuszcz
106 Followers 394 FollowingInnoVergeProductions @InnoVergeProd
36 Followers 115 Following 🚀 AI & tech exploration | Simplifying complex topics, inspiring innovation, and building a vibrant community. Join us to innovate and inspire! #TechForGoodM D @Hammatt
41 Followers 620 FollowingEspen Hansteen @EHansteen32515
33 Followers 145 FollowingBrendan Hawk @brendanhawk03
150 Followers 860 Following freelance web developer Webflow, C#, html, css, Js 🤓 documenting my journey insta: b_the_devMark Chernisky @MChernisky
105 Followers 922 Following https:\\https://t.co/m3NCrDRGwu Business, Economic, Workforce Development ExecutiveABDELAZIZ @azooz9055
21 Followers 135 FollowingJustice Owusu @_owusudev
518 Followers 253 Following Software engineer .Front end developer .Back end developer .Full stack developer . copywriting don. || Motivating codersWilson Quarré @EstatesSB
102 Followers 198 Following Born in Bakersfield, CA. Grew up in San Francisco area. Live with wife, Peggy Wiley in Santa Barbara, CA and Ketchum, ID.De Centuries ♥ @De_Centuries
130 Followers 469 Following I don't react but trust me, I notice everything 💯 📌Jacob Somer @jacob_somer_
574 Followers 2K Following AI Enthusiast & Software Engineer 💻 Building intelligent systems that make a difference.Twittski @Twittski11
172 Followers 1K FollowingLuis Alfredo BUSTOS @luisalfbustos
10 Followers 689 Following Dime y lo olvido, enséñame y lo recuerdo, involucramé y lo aprendo. Benjamín Franklin🤪albertomegda @albertomegda
197 Followers 1K FollowingOnlysearch .flexible @businessalley1
9 Followers 53 FollowingDarshit kaklotar @Darshitkaklota2
97 Followers 705 FollowingSayan Banerjee 🇮�.. @San0387
24 Followers 144 Following Solving tech puzzles, batting for cricket, gearing up in cars. 🚀🏏🚗 #Innovate #BatterUp #VroomVroomMax @maxzpchen
208 Followers 2K Following Father, Investment Partner @factorialfunds; Love Tech, Sociology, Music, Photography;C @Ch1ng_x
26 Followers 42 FollowingSourov Roy @SourovRoyAI
26 Followers 530 Following Software Engineer at Amazon Interested in Man, Machine, Nature, Music, and all sorts of Platonic idealssimpletrading @simpletrad17722
399 Followers 6K FollowingChow Bela @chowbela007
34 Followers 189 FollowingPaul @paulandrew459
0 Followers 1K FollowingJ.luis. @josel_tello
37 Followers 164 Following En la violencia, existen actores principales, también secundarios que nunca seran visibles.Guy Swann ⚡️| Act.. @TheGuySwann
81K Followers 3K Following Liberty is a technology problem • Host of @BitcoinAudible, @Ai_Unchained • Pro Memecraft • Audiobook NarratorMohith @Mohith7548
80 Followers 338 Following Data Scientist | ML Engineer | Computer Science GraduateRandall Dodds @RandallDodds
64 Followers 125 FollowingDanny San T. @ErakkoS
43 Followers 993 Following 🇨🇱🌳🧠🐘🧬🎧🛸🐉 She/Her. Scientist, student & professor. Amateur weird collector & expert procrastinator. Intento no ofender, pero disfruto discutir.CNYILMAZ @cnyilmaz
17 Followers 98 FollowingBig Tech Alert @BigTechAlert
86K Followers 0 Following Follow what the CEOs and other high executives from Big Tech companies do on Twitter. DMs are open for feedbackBowen Cheng @bowenc0221
2K Followers 265 Following Research scientist @OpenAI | Ex-@Tesla | Ph.D. @ECEILLINOIS. | Ex-intern @MetaAI, @GoogleAI, @MSFTResearch.» teej @teej_m
9K Followers 2K Following » Working on Titan » https://t.co/aZwqUSdNXn » my friends call me teejGroq Inc @GroqInc
45K Followers 468 Following Creator of the LPU™ Inference Engine, providing the fastest speed for AI applications, designed & engineered in N. America https://t.co/DsEqVAC5Dprenji the synthetic d.. @brickroad7
8K Followers 3K Following AI Optimist. Empiricist, not "rationalist". Anti world government.Telegram Mini Apps @tappscenter
131K Followers 1 Following This account is dedicated to the Telegram Apps platform, including bots, web apps, and games.wenbin.org @wenbinf
2K Followers 113 Following © Listen Notes, Inc. @listennotes: podcast search engine https://t.co/OK4kdUtGhS https://t.co/rmbauiXTjK: serverless cms https://t.co/yrwIofaWu0: audio to text 📧 [email protected]Anysphere @anysphere
4K Followers 7 Following We're building AI tools to help humans focus on bigger problems. In particular: @cursor_aiNitish ⚡️ @nitishmutha
3K Followers 328 Following Co-founder and CTO @GenieAI - Building the world’s best AI Legal Assistant. @UCL alum.João Moura @joaomdmoura
8K Followers 1K Following Founder of @crewAIInc / prev @clearbit (acc by @hubspot) Open Source enthusiast | Creator of Machinery | Public Speaker | My viewsLiu Liu @liuliu
2K Followers 257 Following Maintains https://t.co/VoCwlJ9Eq0 / https://t.co/bMI9arVwcR / https://t.co/2agmCPOZ2t. Wrote iOS app Snapchat (2014-2020). Founded Facebook Videos with others (2013). Sometimes writes at https://t.co/Gyt4J9Z9TvSami Nas 👨⚕�.. @digitalhealthxx
8K Followers 9K Following Senior functional/technical consultant to bring added value via #digitalhealth #ai and #datascience based solutions #MedTwitterarXiv.org @arxiv
35K Followers 188 Following News from https://t.co/enurGFxpcS, a free distribution service and an open archive for scholarly articles. For help with arXiv, see https://t.co/LcWuhM0BOlTereza Tizkova @tereza_tizkova
3K Followers 1K Following DevRel @e2b_dev | Mathematics Graduate | Prev. McKinsey | Prague & San FranciscoOpenAI Developers @OpenAIDevs
71K Followers 0 Following Official @OpenAI account for anyone building on our APIs. Join us in building the future of AI. We ❤️ developers!Skunkworks AI @skunkworks_ai
3K Followers 7 Following Accelerating Open-Source AI https://t.co/B5v2ohlIbH https://t.co/9TNVZeJYjd no website for our groupAman Sanger @amanrsanger
15K Followers 656 Following building @cursor_ai at @anysphere https://t.co/EdcQJ2dv0J | https://t.co/vJ5zNuT6WOJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Teknium (e/λ) @Teknium1
29K Followers 3K Following Cofounder @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE Support me on Github SponsorsNous Research @NousResearch
18K Followers 29 Following The AI Accelerator Company. https://t.co/vrD0aDJetoLM Studio @LMStudioAI
16K Followers 184 Following Download & run local/open LLMs on your computer 👾 App: https://t.co/YS5uiRQ7TI (Mac/Windows/Linux)Ilya Sutskever @ilyasut
370K Followers 2 Following towards a plurality of humanity loving AGIs @openaiYi-01.AI @01AI_Yi
5K Followers 8 Following A global company building AI 2.0 platform and applicationsJason Zhou @jasonzhou1993
5K Followers 276 Following Sharing cool stuff about building AI Product | Co-founder of @qwestive, @BinanceLabs S4 Incubator | ex @juiceboxETH, ex @SafetyCultureHQPerplexity @perplexity_ai
132K Followers 28 Following Our mission is to serve the world’s curiosity. https://t.co/BBZ1kG0TVGSimons Foundation @SimonsFdn
18K Followers 208 Following Advancing the frontiers of basic science through grantmaking, research and public engagement. Sign up for our newsletter: https://t.co/s7KhKAFrjEMistral AI @MistralAI
90K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPPete Hunt 🚁 @floydophone
35K Followers 844 Following Dad CEO @dagster Built https://t.co/PgAYdNs1ZE Prev: Twitter, Excalidraw, Smyte, Instagram, Facebook, React.jsPeter Zeihan @PeterZeihan
286K Followers 230 Following Geopolitical strategist, speaker, author. Free Newsletter → https://t.co/dsuKifpb54 - My latest book and NYT Best Seller - https://t.co/BOfH9I4Wzp…Daniel Petrini @dpetrini
98 Followers 395 Following Father, husband, electrical engineer, engineering manager, PhD candidate in computer vision/AI, electronics projects, astronomy, and the history of science.Startup Archive @StartupArchive_
39K Followers 1 Following We curate the top 1% of ideas from the world's best founders. Join 8,000+ others who read the free newsletter 👇OSINTtechnical @Osinttechnical
930K Followers 800 Following OSINT guy, PAI enjoyer, journalist @hntrbrkmedia, my views/freezing cold takes are my own. Standard spiel about not endorsing retweets, likes, and comments.Physics In History @PhysInHistory
577K Followers 0 Following Photos from the history of physics | © with mentioned Archives. Shared for educational purposes. Einstein portrait © Ullsteinbild. Subscribe for curated papers.Mikhail Parakhin @MParakhin
17K Followers 21 Followingraulpuri.eth @TheRealRPuri
6K Followers 329 Following AI things @ OpenAI - GPT4V, GPT4, GPT3.5, Codex | past: NVIDIA - megatron, sentiment neurons | go bears 🐻Alexander Borzunov @sasha_borzunov
475 Followers 301 FollowingRobert Scoble @Scobleizer
504K Followers 69K Following Follow me on my new podcast with AI startups, Unaligned. Tech industry color commentator since 1993. Author/Blogger. Former strategist @Microsoft.Jim Keller @jimkxa
34K Followers 134 Following CEO @tenstorrent, Cofounder @atomic_semi @BayaSystems and FlexAI board member. Fan of 2x2 matrixes, books, refactoring and creative tensionTrending GitHub Repos.. @trending_repos
18K Followers 0 Following Tweeting the most starred GitHub repository of the: 📈 day - every day 🏅 week - every Monday 🏆 month - every 1st of the monthThomas Dimson @turtlesoupy
6K Followers 541 Following Technical Staff @OpenAI. Former CEO @illdotinc (acq. OpenAI). Authored “the algorithm” @Instagram. 🇨🇦. Purple monkey dishwasher.the tiny corp @__tinygrad__
33K Followers 63 Following We make tinygrad. Our mission is to commoditize the petaflop.Surge AI @HelloSurgeAI
4K Followers 146 Following Love language? So do we. Surge AI is the world's most powerful data labeling and RLHF platform, designed from the ground up for stunning AI.When it comes to #timeseries #forecasting transformers is what you do not need. The new architecture 'SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion' outperforms all transformers including all the latest transformers such as iTransformer and…
Cool new work from some colleagues at Apple: more accurate LLMs with fewer parameters and fewer pre-training tokens. Also has MLX support out of the box! Code here: github.com/apple/corenet/…
Apple presents OpenELM An Efficient Language Model Family with Open-source Training and Inference Framework The reproducibility and transparency of large language models are crucial for advancing open research, ensuring the trustworthiness of results, and
phi-3-mini: 3.8B model matching Mixtral 8x7B and GPT-3.5 Plus a 7B model that matches Llama 3 8B in many benchmarks. Plus a 14B model. arxiv.org/abs/2404.14219
3D Gaussian is great, but how can you interact with it 🌹👋? Introducing #PhysDreamer: Create your own realistic interactive 3D assets from only static images! Discover how we do this below👇 🧵1/: Website: physdreamer.github.io
Llama 3 on @GroqInc is incredible The 70b model beat opus on my financial RAG tests. Llama 3 RAG results: • speed: 2.59s • correctness: 81.33% This is the highest score I have seen on financial RAG. • 7 secs faster than opus • 4% more correct than opus With insane…
Someone just dropped a dataset of 15 trillion tokens (as many as were used to train Llama 3)!!! Download this now before it gets taken down for “copyright reasons” Breakdown in thread 🧵 👇👇
Private LLM for iOS v1.7.8 is now live on the App Store. 🎉 Experience the power of the latest Llama 3 8B Instruct model from @AIatMeta, running privately, fully on-device with no internet connection or telemetry. Works on all Pro, Pro Max iPhones and Apple Silicon iPads. Also,…
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing AlphaLLM for the self-improvements of LLMs, which integrates Monte Carlo Tree Search (MCTS) with LLMs to establish a self-improving loop, thereby enhancing the capabilities of LLMs without additional…
Google presents Many-Shot In-Context Learning - Proposes many-shot ICL, i.e., adding up to thousands of examples in context with Gemini 1.5, which boosts the perf significantly - Using synthetic CoT is very effect in this setting. arxiv.org/abs/2404.11018
The model card has some more interesting info too: github.com/meta-llama/lla… Note that Llama 3 8B is actually somewhere in the territory of Llama 2 70B, depending on where you look. This might seem confusing at first but note that the former was trained for 15T tokens, while the…
We've just uploaded a GGUF of the 8b llama-3 instruct model on @NousResearch's huggingface org: huggingface.co/NousResearch/M…
a single 4090 that's insane
Incredible stuff! Microsoft's new Model can produce Deepfake with 1-photo and 1-audio!!!!
We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…
Ollama now supports WizardLM-2! 7B model: ollama run wizardlm2 Learn more: ollama.com/library/wizard… 8x22B model is uploading.
🔥Today we are announcing WizardLM-2, our next generation state-of-the-art LLM. New family includes three cutting-edge models: WizardLM-2 8x22B, 70B, and 7B - demonstrates highly competitive performance compared to leading proprietary LLMs. 📙Release Blog:…
🚀Excited to introduce ResearchAgent🤖, your new partner in crafting the next best papers!📰 ResearchAgent is designed to revolutionize your research process, which automatically identifies problems, develops methods, and designs experiments. Paper: arxiv.org/abs/2404.07738
Kinda wild that you can merge models with SoTA techniques at the click of a button! 🤯 Presenting MergeKit UI - Drop in your config, access token and voila, you get a merged model back! Supported merging methods: 1. Model Soups 2. SLERP 3. Task Arithmetic 4. TIES 5. DARE TIES…
AI Agents are an Amazing Hack !! Here are some @crewAIInc Tips from some recent adventures - Use @GroqInc with Mixtral 8X7B for quick prototyping and testing .. will save you $$$ specially spend on OpenAI APIs. Explained in the video .. - Add a rating or validator agent as a…
TimeGPT is the first foundation model specifically designed for time series analysis. It excels at generating precise forecasts across a diverse range of datasets and domains. Here's what you need to know about it: 1/8
God bless @JustineTunney's llamacpp kernels, Mixtral8x22b running CPU ONLY at ~9 tokens per sec. Yep that's GPT4 class AI. I'll push out cpu-optimized 4bit/8bit EdgeQuants after benchmarking.