I'm launching a free data engineering boot camp on YouTube on 11/15!
It will be a new data engineering video every day from 11/15 until 12/31! I'm excited to share my hard work with you in a way that is accessible!
The launch video is here:
youtube.com/watch?v=myhe0L…
Please
There are so many different roles in the data domain. Deciding which role is best for you depends on your preferences.
One dimension you can look at these roles is what percent is spent building infrastructure versus digging into the data to find issues.
I personally love #dataengineering because I’m a builder at heart but love diagnosing things with SQL.
I’ve seen myself gravitating both directions towards #softwareengineering and #analyticsengineering depending on what I’m currently frustrated with.
Do you like building stuff? Do you like finding root causes of business issues? This core differentiator between these roles might help you understand where in the data value you chain you want to sit!
We're big tennis fans here. For all the Rafa fans, here's Rafael Nadal's heartfelt retirement speech translated into English.
What a legend, @RafaelNadal 🎾🏆
Crawl4AI is an open-source web-crawling and data-extraction tool built to integrate well with LLM and AI applications.
It has a comprehensive list of features ranging from simultaneous URL crawling to advanced extraction strategies based on LLMs.
And most importantly, choose the model based on what you need.
👉 o1 is perfect for complex algorithms, coding, and planning.
👉 GPT-4 is better for quick answers and working with text and images.
𝗞𝗲𝗲𝗽 𝗶𝗻 𝗺𝗶𝗻𝗱:
✅ Simplify. Short, direct prompts work best.
✅ Don't use "chain-of-thought". o1 already does the reasoning without being told.
✅ Limit context in RAG. Less is more.
🔍 If you have complex challenges that require more analysis, try the o1 model and let me know how it goes.
And if you have more info about o1... drop the post in the comments, and we can make a thematic megathread 🍓🍓
⚠️ But beware, o1 isn't for everything.
If you're looking to generate or edit text, you won't notice much difference compared to GPT-4o. However, if you're dealing with mathematical or analytical problems… that's where o1 really shines. 🌟
Haha we've all been there. I stumbled by this tweet earlier today and tried to write a little utility that auto-generates git commit message based on the git diff of staged changes. Gist:
gist.github.com/karpathy/1dd02…
So just typing `gcm` (short for git commit -m) auto-generates a one-line commit message, lets you to accept, edit, regenerate or cancel. Might be fun to experiment with.
Uses the excellent `llm` CLI util from @simonwllm.datasette.io/en/stable/
Crawl4AI is an open-source web-crawling and data-extraction tool built to integrate well with LLM and AI applications.
It has a comprehensive list of features ranging from simultaneous URL crawling to advanced extraction strategies based on LLMs.
286K Followers 8K FollowingFounder and CEO @tabul_ai. Creator of @trainxgb. ML ex Nvidia. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
1.6M Followers 1K FollowingCo-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs
460K Followers 1K FollowingML/AI research engineer. Ex stats professor.
Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW) & reasoning (https://t.co/5TueQKx2Fk)
250K Followers 2K FollowingThe world's leading publication for data science and artificial intelligence professionals.
Submit an Article ✍️ https://t.co/57pIMegK1o
27K Followers 744 FollowingData Scientist, Consultant & Tech Writer in @wandb and @DataCamp
University Professor at @UNAV
Work with me on 👉🏻 https://t.co/VdsUvbaqA2
22K Followers 26 FollowingA cutting-edge framework for orchestrating role-playing, autonomous AI agents that work together seamlessly to tackle complex tasks.
30K Followers 402 FollowingThe retreat where curious programmers recharge and grow. 🌱
Work at the edge of your abilities, develop your volitional muscles, and learn generously.
359 Followers 1K FollowingMetaverse artist creator of: Cozomo Obsessione; NFT Cards Tribute to; Los Fruittispunks; and everything that flies on my mind and can be painted.
3K Followers 1K FollowingHead of AI for EMEA Digital Natives at @Microsoft | Cofounder at @ClibrainAI | Artificial Intelligence Expert | Board Member | @IEuniversity Professor
239K Followers 4K FollowingAI Evangelist & Optimist. Latest AI News, Trends and learn how to use AI tools to augment your abilities. [email protected] for partnerships 🤝
1.3M Followers 2 FollowingWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
1.1M Followers 172 FollowingNobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
98K Followers 0 FollowingThe world's most powerful general-purpose agent and all-in-one AI platform. ChatLLM from Abacus AI is your AGI control center across all SOTA AI models and LLMs
13K Followers 166 FollowingEnd to end CUDA accelerated #datascience libraries built on @apachearrow, scaled with @dask_dev & Dask-SQL for ETL, ML, graph analytics, and DL preprocessing
15K Followers 544 Followingdata analytics consultant. technical writer. love stats. write threads on data science and analytics. reading non-fiction in my free time. always learning.
652K Followers 116 FollowingLets Learn #Python with tips and tricks. Free Python Course: https://t.co/l9NKxZWrh7 biz : [email protected] AI Community Partner. DM for Everything.