This paper sheds new light to examine the 5% of non-English data in llama3 and an opportunity. The presence of small quantities of multilingual data apparently helps generalization between languages. That’s good news for llama3.
I am betting that within weeks, we will have…
This paper sheds new light to examine the 5% of non-English data in llama3 and an opportunity. The presence of small quantities of multilingual data apparently helps generalization between languages. That’s good news for llama3.
I am betting that within weeks, we will have…
Llama3 on @GroqInc: 300 tokens/sec
Claude Opus: 18 tokens/sec
GPT-4: 36 tokens/sec
All three models are seemingly comparable in capability.
If you are a builder, which one will you choose?
Llama3 on @GroqInc: 300 tokens/sec
Claude Opus: 18 tokens/sec
GPT-4: 36 tokens/sec
All three models are seemingly comparable in capability.
If you are a builder, which one will you choose?
It's going to be hard to adapt Llama3 for Indic languages, in my opinion.
Here are a few reasons why:
👉🏼 The tokenizer used is TikToken-based, which is not really efficient in tokenizing Indic text despite having a vocabular size of 121k.
👉🏼 unlike sentence-piece based models,…
Early results are not in favor of LLama3 :-( Looks like it has trouble following instructions. Again, this is early. Anything can happen, including a sudden dip in the loss curve and my house getting nuked.
Early results are not in favor of LLama3 :-( Looks like it has trouble following instructions. Again, this is early. Anything can happen, including a sudden dip in the loss curve and my house getting nuked. https://t.co/lZTRKZKgeb
Llama 3 is officially the fastest model from release to #1 trending on Hugging Face - in just a few hours.
30,000 new models have been released based on llama 1 & 2 so I can't wait to see the impact that the third and most powerful version will have on the ecosystem! 🚀🚀🚀
67K Followers 261 FollowingDistinguished Scientist at Google Research. Computational Imaging, Machine Learning, and Vision. Tweets = personal opinions. May change or disappear over time.
60K Followers 2K Followinga combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).
32K Followers 965 FollowingCofounded & running @ml_collective.
Host of Deep Learning Classics & Trends.
Research at Google DeepMind.
DEI/DIA Chair of ICLR & NeurIPS.
Writing https://t.co/IbycyGfnDR
54K Followers 1K FollowingPhD at 19 |
Founder and CEO at @MedARC_AI |
Research Director at @StabilityAI |
@kaggle Notebooks GM |
Biomed. engineer @ 14 |
TEDx talk➡https://t.co/xPxwKTq6Qb
39K Followers 7K FollowingI lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.
30K Followers 475 FollowingProf. @Unige_en, Adjunct Prof. @EPFL_en, Research Fellow @idiap_ch, co-founder @nc_shape. AI and machine learning since 1994. I like reality.
45K Followers 2K FollowingResearch Director, @VectorInst. Canada CIFAR AI Chair. Associate Professor of Stats/CS @UofT. I study machine learning and AI, emphasis on theory.
50K Followers 500 FollowingDistinguished Professor (Emeritus), Oregon State Univ.; Former President, Assoc. for the Adv. of Artificial Intelligence; Robust AI & Comput. Sustainability
35K Followers 1K FollowingMachine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJ
12K Followers 2K FollowingPrincipal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.
46K Followers 1K FollowingCo-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechnique
180 Followers 1K Followingfamily first, technologist, entrepreneur, learner, objectivist, weakness for scruffy terriers, all forms of caffeine, japan travel and modern design
911 Followers 2K FollowingAssistant Prof at CS@Mines
Aerial and multiple robots; autonomous filming, mapping, perception
Rock climbing, bike rides, heavy music.
Call me on my ansible.
1K Followers 5K FollowingElites' election "choice" of old deluded war criminal v old deranged insurrectionist, new low. Need X Global Town Square Debates between experts in media silos.
3K Followers 584 FollowingCore team member at @qrledger. #Youtube creator. Engineer. Speaker. Follow me for the latest news at the intersection of #Bitcoin, #Blockchain, and #Quantum.
7K Followers 8K Following"Serving authentic Mediterranean flavors in the heart of Houston. Join us for a culinary journey with fresh ingredients and traditional recipes. #HoustonEats"
67K Followers 261 FollowingDistinguished Scientist at Google Research. Computational Imaging, Machine Learning, and Vision. Tweets = personal opinions. May change or disappear over time.
60K Followers 2K Followinga combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).
32K Followers 965 FollowingCofounded & running @ml_collective.
Host of Deep Learning Classics & Trends.
Research at Google DeepMind.
DEI/DIA Chair of ICLR & NeurIPS.
Writing https://t.co/IbycyGfnDR
39K Followers 7K FollowingI lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.
35K Followers 3K FollowingAI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
2K Followers 262 FollowingA research lab developing Expert AI, training large language models to prevent hallucination and enable knowledge-oriented, multilingual and multimodal tasks.
108 Followers 27 FollowingLanguage. Culture. Technology. Wellbeing. Follow us for UPDATES about our projects and research. Our website is: https://t.co/1Zp06Vx0vl
63K Followers 1K FollowingU.S. @ENERGY and @NNSAnews laboratory. We use science and technology to make the world a safer place. Verification: https://t.co/29pFxbpHmQ
27K Followers 3K FollowingDeputy executive editor @TheEconomist. Coauthor of "Framers" and NYT bestseller "Big Data". Always curious - to my detriment.
2K Followers 200 FollowingI make tiny, speedy neural networks and community-funded open source research. I also do consulting! Often holds the CIFAR10 speed record ( ;) ). she/they ❤️:')
266 Followers 136 Following@JohnsHopkins research & @JHUAPL development for autonomous systems that are safe, verifiable, & trustworthy. Likes/Retweets≠endorsements
66 Followers 333 FollowingWe are the Natural Language Processing community here at Imperial College London.
Looking forward to sharing more of our work over the coming months! #NLProc
9K Followers 394 FollowingSweden-based, Germany-adjacent, Scottish-Australian Mathematician.
Geometric analyst who pretends to be a physicist sometimes.
31K Followers 269 FollowingI built a C library that lets you compile 12kb static binaries that run natively on Linux, Mac, Windows, FreeBSD, OpenBSD, NetBSD and BIOS using just GCC/Clang.
2K Followers 36 FollowingCreating an abundant, wonder-filled future by unlocking powerful materials and manufacturing technologies that don’t have a home in other institutions.
35K Followers 1K FollowingMachine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJ
2K Followers 864 Following👨💻 AI Research & Engineering @GroqInc. I publish a weekly update about LLM Engineering on Substack, it’s free. Opinions are my own.
103 Followers 296 FollowingCo-founder of @open_avenues, building immigration pathways for foreign national innovators, experts, and entrepreneurs in the US.
100 Followers 135 FollowingWe offer the traditional #Rinzai #Zen temple schedule to all visitors. Trainees are welcomed into an environment designed to help us realize our True Nature.