Mitchell van Rijkom @MitchellvRijkom
Data Engineer @ The.NextGen | Tips & Tricks about PySpark, Databricks, and Data Engineering 💡| I Help Companies Build Scalable Big-Data Solutions 🚀 mitchellvanrijkom.com Joined April 2016-
Tweets1K
-
Followers605
-
Following263
-
Likes468
The logical next step after you perfect writing your prompt is to embrace AI tools into your daily workflows. Consider using GitHub Copilot to speed up your coding process, or Grammarly to correct your grammar mistakes. I understand that doing it yourself is powerful and you'll…
AI development isn't just important, it's crucial for our future. To get this right, we need: Public Funds: More investment Public Awareness: More people paying attention Public Participation: Everyone's involvement Let’s shape a better future together!
Is Data Really Neutral? Think Again! Data isn't neutral. When AI learns from data, it picks up our biases too. For instance, an AI for recruiting might favor men for tech jobs if most data points to men in those roles. What to do? Check for Bias: Understand that all data can…
Why Every Data Professional and Business Should Use What-If Risk Scenario Mapping ⚠️ What-if scenarios are more than just guesses. They're key tools that help us think ahead about possible risks and prepare better for the future. Here’s why they matter: Get Ready in Advance:…
Bad data can lead to more problems since more decisions are now based on data. With more focus on data, AI systems are working more on improving the datasets rather than just making better predictions. This shows a change from just wanting big data to wanting good data!
"Sometimes all you need for exceptional results is average effort repeated for an above-average amount of time" - James Clear Almost 2 years in, nearly 1000 posts deep: snippets, insights, detailed explanations, videos! The journey shows the power of consistency 🚀 What's next?
Programming languages have come a long way from the detailed assembly code to user-friendly languages like Python. This shift has transformed coding into a more accessible and intuitive process. Here’s what we’ve gained: - A shift from machine-focused to human-centric coding -…
Databricks + Delta Lake is making the life of a (big) data engineer so much easier these days! - Easily ingest data with autoloader - Maintain a data model with merge & update commands I think I can never go back now... #Databricks #DeltaLake
As a data professional, understanding how to model data and implement slowly changing dimensions and data marts is crucial. These skills help you store and manipulate data effectively, supporting your organization's data needs and driving business insights 🚀
A surrogate key uniquely identifies database records. It's a random integer with no business meaning, distinct from the source data. Used as primary keys, like in customer databases, it links records across tables without revealing business details.
Data investments? Check. Infrastructure? Check. Insights? Check. But if you can't trust your data, it's all for nothing! 🤷♂️
Accuracy is key for high-quality data! Data Engineers ensure data integrity and reliability through validation and quality checks, flagging discrepancies and inaccuracies.
Level up your data engineering skills! 📊 Data modeling, big data technologies, data quality, and data governance - learn what you need to know to succeed in data engineering! 💪🚀 #DataEngineering
🚀 Hey everyone! What are your favorite AI SaaS tools? Share your top picks! 💡
Andrew Ng, a respected figure in the field, believes that AI will revolutionize every industry, much like electricity did 100 years ago. I am confident that this transformation will indeed occur! While there are concerns about AI replacing jobs, Ng emphasizes that AI is still…
Everyone should learn SQL! While PySpark excels at dynamically creating pipelines, SQL's staying power is unmatched. Tools change, PySpark may be replaced, but SQL remains crucial. With Databricks leading the charge in templating and syntax, SQL is democratizing data…
AGI could be just 3 years away! 🚀 What plans do you have for when it arrives?
If you think you have the best idea in the world, write it out in as much detail as you can. Then comes the hardest part... Wait for 2 months. Go back to your idea; if you still think it's perfect, execute it right away!
Andreas Kretz @andreaskayy
6K Followers 270 Following I teach #DataEngineering & make YouTube videosDarshil | Data Engine.. @parmardarshil07
20K Followers 437 Following Freelance Data Engineer • YouTube (100k+) • Making data easier for everyone • @AWS Community Builder • Ex- @WayfairSimon Späti 🏔️ @sspaeti
3K Followers 1K Following Dad. Technical Author, Data Engineer and Educator https://t.co/49Ty3GXkqs, https://t.co/7r8pihWPQz. Tweets mostly: #dataengineering, #opensource, #writing, #pkm and #neovimData Girl @hanuna_ma_data
5K Followers 3K Following Cloud and Big Data Engineer Tweeting about daily problems & solutions of an engineer Learning #kubernetesUmesh Bhati @Umesh_bhati_
2 Followers 10 FollowingNico Acosta @acossta
9K Followers 1K Following Co-founder & CEO at @PropelDataCloud. Ex-product lead @ TwilioCatherinaCully @CatherinaC30011
10 Followers 725 FollowingChristopher Ogude @OgudeChris
8 Followers 108 FollowingAaditya ; @Aaditya26082004
566 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈Erika Hoss @ErikaHoss64387
52 Followers 5K FollowingReba Aldape @re_aldape
16 Followers 3K FollowingRuna Symonds @RunaSymon
64 Followers 5K FollowingOnlineMoneyStystem @OMS2024
693 Followers 3K FollowingClarisa Cieslak @cies_clar
52 Followers 5K FollowingAzariah Shreckengost @AShreckeng95030
31 Followers 5K FollowingGenerative AI @generativeaihub
7K Followers 7K Following Inspired by Algorithms, Powered by Imagination: Unleashing the Potential of Generative AI. #GenerativeAI #deeplearning #AI #MachineLearningKunal @Kunalc232
6 Followers 114 FollowingRaghul @Raghul03669900
17 Followers 127 Following Nothing is permanent ☺️ Fan of @actorvijay ❤️🔥and @KeerthyOfficial ♥️Tahir @aslanmtahir
287 Followers 256 Following 🤖 Coding and joking - that's my style! Data engineer turning life's quirks into funny tweets. Need laughs and coding tips? You're in the right place!Chinmaya Kr Puhan @PuhanKr
3 Followers 47 FollowingAurelius @Franz_Kafka_111
55 Followers 122 FollowingTom Kirkman @tomkirkman3003
0 Followers 55 FollowingNamoo @Orduh
327 Followers 3K Following From the Land of the Rising Sun ⛅, Artificial Intelligence and Data Analytics, budding infosecNikhil Sharma @NikhilS46942797
1 Followers 50 FollowingGuruprasad Tandlekar @gurutandlekar27
41 Followers 163 Following "90% of our life revolves around uncertainties and if you land here it must be the other 10%"numb @numbbsense
14 Followers 56 FollowingDileep Bharati @dileepbharati2
28 Followers 477 Following Data Analyst | Machine Learning | Data scienceRk @Rk1050063
1 Followers 47 FollowingAnshul Negi @anshulspider
4 Followers 31 Following Being public keeps you in reality check Standing on the shoulder of giants. Project based learningNaruto uzmaki @All_hail_tiger
296 Followers 646 Following Bezawada, Retired ICT fan, @tarak9999, @msdhoni, @imvkohli, @chennaiIPL @ncbnTech Booster @tech_booster
49 Followers 214 Followingஇளவேனில.. @Ilavenilmaaran
206 Followers 2K Following பிறப்பொக்கும் எல்லா உயிர்க்கும். (All living things are same in nature and by birth). Belongs to the Great Dravidian Stock. ஓரிறை கொள்கை உடையவன்.AYUSH SINGH @AYUSHSI08217440
307 Followers 4K FollowingIlemona @atawodimona
137 Followers 152 Following Trainee at Wallbreakers, Coffee, Linux Mint, Cinnamon and C++..Diego G. Teixeira @diegogteixeira
8 Followers 394 FollowingR Lopez @1louvro
70 Followers 284 FollowingKhalid @khalidzaheer_
43 Followers 178 Following Data Engineer. They plan. And Allah plans. And Allah is the best of planners. ~ Qur'an 8:30Franzcsia @franzcsia
2 Followers 7 FollowingCaptain Pascal 👲�.. @_Alphamide
315 Followers 3K Following Student Leader @aspire_leaders. A para-discipline individual 🪄Mohit P @realmohitp
16 Followers 23 Following Data Enthusiast | Transforming chaos into insight with every byte....Vikram Krishna @vikram__krishna
131 Followers 250 Following Python Loving Data Science and Machine Learning Practitioner | Life long learner 🤍 | 10K + on LinkedIn https://t.co/om22UXfCwgThomas Kara @thomaskara87
41 Followers 181 Following I design data products to integrate technology with human needs & I'm passionated about my personal fitness and health ...Something exciting is coming soon!Siddharth Nahata Jain @SidNahataJain
13 Followers 245 Following I posts value adding content #daily #Facts | #Finance | #growth | #motivation |#realestate |#jobs | Hare Krishna🙏Ric @Ricardojr48
108 Followers 70 FollowingSantiago @svpino
354K Followers 447 Following I tell stories about technology and teach hard-core Machine Learning at https://t.co/iZifcK7n47. YouTube: https://t.co/pROi08OZYJZach Wilson @EcZachly
30K Followers 897 Following Founder @ https://t.co/CWvLDHTuVZ $1m ARR | ADHD | 700k+ followers on all platforms | 10 yrs DE experience |ex @facebook, @netflix, and @airbnbStart Data Engineerin.. @startdataeng
8K Followers 31 Following I write about data engineering | SQL | Python | Distributed systems. Get my free data engineering course at https://t.co/sZTEcV0Q9WMike Driscoll @driscollis
118K Followers 1K Following I tweet about everything #Python Writing about Python @mousevspython @realpython Teaching at @TeachMePy Author of multiple books - https://t.co/MdP25zw5zQCharly Wargnier @DataChaz
112K Followers 31K Following Developer Advocate @Streamlit (acq. @SnowflakeDB) | prev. Samsung • 𝕏 about LLMs, AI, Data Science, Web Apps and SEO • My heart is open source • Views mine!Akshay 🚀 @akshay_pachaar
138K Followers 419 Following Simplifying LLMs, MLOps, Python & Machine Learning for you! • AI Engineering @LightningAI • Lead DataScientist • BITS Pilani • 3 PatentsData Science Dojo @DataScienceDojo
188K Followers 968 Following We make learning data science and LLMs easy! Join the community of 10,000+ professionals. #DSDojoPatrick Loeber @patloeber
55K Followers 891 Following Software Engineer • YouTube 250K+ • Helping you to learn Python and Machine Learning • AI Developer Advocate @AssemblyAI • @python_engineer founderDatabricks @databricks
70K Followers 1K Following Databricks is the data and AI company, helping data teams solve the world’s toughest problems.Modern Data Stack @moderndatastack
6K Followers 459 Following Everything that you need to know about building and operating a Modern Data Stack. Operated by team at @quantive_incAndreas Kretz @andreaskayy
6K Followers 270 Following I teach #DataEngineering & make YouTube videosMichael Kahan @kahandata
735 Followers 38 Following Helping small data teams build modern architectures faster | Founder of The Modern Data Community | Former corporate employee turned independent consultantErgest Xheblati @ergestx
8K Followers 991 Following Data + Business | Author: Minimum Viable SQL Patterns | Newsletter: Data Patterns (see links below)SeattleDataGuy @SeattleDataGuy
23K Followers 8K Following Data Engineer/Data Science Consultant and All Around Data Guy | Angel Investor - Now In Denver https://t.co/KZ9mVAg37vNick Singh | The Data.. @NickSinghTech
30K Followers 4K Following Author of Ace the Data Science Interview. Free Book Preview 👇 https://t.co/1izgOFy1Kt Founder of https://t.co/yyE4B5Ltpf (SQL Interview Prep) Ex-FacebookDarshil | Data Engine.. @parmardarshil07
20K Followers 437 Following Freelance Data Engineer • YouTube (100k+) • Making data easier for everyone • @AWS Community Builder • Ex- @WayfairSebastian Raschka @rasbt
268K Followers 885 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Simon Späti 🏔️ @sspaeti
3K Followers 1K Following Dad. Technical Author, Data Engineer and Educator https://t.co/49Ty3GXkqs, https://t.co/7r8pihWPQz. Tweets mostly: #dataengineering, #opensource, #writing, #pkm and #neovimSwapna Kumar Panda @swapnakpanda
175K Followers 167 Following | Tech Writer, Educator | prev. Architect | Python, JavaScript, SQL | Programming, Development, Databases, AI Tools, Remote Jobs | Building @JabardastDEV |Generative AI @generativeaihub
7K Followers 7K Following Inspired by Algorithms, Powered by Imagination: Unleashing the Potential of Generative AI. #GenerativeAI #deeplearning #AI #MachineLearningRuben Hassid @RubenHssd
12K Followers 362 Following Daily LLMs benchmarks & prompt engineering. Founder at https://t.co/n6tTy5Q7uX (bootstraped)Johannes Vink @JohannesVink
486 Followers 247 Following Business Intelligence/Data Engineering Architect. ETL. DWH. Data Platform. SQL. Python. Fun.AGI - Tech Gone Wild .. @AGItechgonewild
2K Followers 1K Following Follow the Wild Tech towards AGI - Artificial General Intelligence, Superintelligence and Beyond ✨Norwegian Engineer & Tech Director 🫡 🇳🇴Techno Optimist 🦾Tanthong Nguyen @Machine1235
118 Followers 532 Following PhD, AlphaZero in JAX, text-to-speech models, mechanistic interpretability, exploring neural nets' internal circuits...David Jayatillake @DSJayatillake
1K Followers 386 Following Co-Founder & CEO @ Delphi | I write every week at https://t.co/5HPzmIyuPs | https://t.co/K6IGvlqp76Matthew Housley @doctorhousley
398 Followers 170 Following Co-founder at Ternary Data, co-author of Fundamentals of Data Engineering (https://t.co/TP2uGUL94p).DuckDB @duckdb
13K Followers 3 Following DuckDB is an in-process SQL OLAP database management system. "DuckDB" and the DuckDB logo are registered trademarks of the DuckDB Foundation.Hilary @AghasiliHilary
303 Followers 1K Following Co-Founder @ei_volv || Cloud&Devops Engineer || Data Engineering enthusiastJohn Rush @johnrushx
21K Followers 3K Following 20 bootstrapped Tools For Busy Founders. Sharing lessons on Startups & Growth. ⑴https://t.co/PJscUxOC4Z ⑵https://t.co/wxaRNYF9F5 ⑶https://t.co/hS4xMWThHi … ⒇⇢https://t.co/Fpjq9yZPMZData Engineer @techtonic190427
2K Followers 2K Following Data Engineer | Pythonist | Only data can judge on behalf of God; all else is mere opinion. Those who control the data control both you and me.Hypefury - Simple aud.. @hypefury
73K Followers 99 Following Simple social automation & content creation for entrepreneurs who dream big 🚀 Free Twitter growth tips in your 👉 📩 https://t.co/KWuQw0Dos8Siraj Raval @sirajraval
63K Followers 947 Following Founder of WagerGPT & GPT School. Youtube Creator. AI Engineer. Author of "DApps" (O'Reilly 2016). Investor. Ex-Twilio, Meetup, Polygon & Columbia. AGI by 2025LlamaIndex 🦙 @llama_index
64K Followers 25 Following The way to connect LLMs to your data. Github: https://t.co/HC19j7vMwc Docs: https://t.co/QInqg2zksh Discord: https://t.co/3ktq3zzYII https://t.co/UXeIlwvvbABob Haffner @bobhaffner
259 Followers 120 Following Data Engineer | Host of @EngSideOfData #dataengineering #dataengineer podcast: https://t.co/07UIkRSRciRivery @RiveryData
2K Followers 1K Following Unify your ELT pipelines, workflow orchestration, & #data operations with Rivery's complete #SaaS platform. #WinningWithDataJoseph Tsar @joseph_tsar_
4K Followers 74 Following Sharing what I learn as I improve my verbal athleticism. Building @nounceaiStepan Hlinka @stepanhlinka
54K Followers 975 Following Building 8-Figure Saas company. Generated over $11mil in sales for clients using direct response marketing. Sharing tips on marketing, sales & business growthJoel Konye || Data Pl.. @JoelKonye
291 Followers 151 Following Data Engineer || Data Scientist || Technical Writer || Sharing practical insights and strategies to simplify your data engineering journeySandeep Pawar @PawarBI
4K Followers 1K Following Data Analytics | Data Science | Power BI | Microsoft Fabric | Data Platform MVP 🇮🇳🇺🇲 https://t.co/ZDAcUyygbo | Home Barista ☕ , Cricket fan 🏏Nitish ⚡️ @nitishmutha
3K Followers 327 Following Co-founder and CTO @GenieAI - Building the world’s best AI Legal Assistant. @UCL alum.MinIO @Minio
7K Followers 370 Following MinIO is pioneering high-performance, Kubernetes-native object storage. Enterprise grade + Amazon S3 compatible, its the #1 choice for multi-cloud deployments.Matthew Powers @neapowers
307 Followers 167 Following Spark / DataFrame nerd. Helping Latin@s / Brasilians in tech. Blog and write open source to make ppl more productive.Nathan Sundararajan @SqlNathan
684 Followers 355 Following Databricks, Fabric, Pyspark, Python, Machine learning , AI,Powerbi, Delta lake.Bryan Johnson /dd @bryan_johnson
257K Followers 435 Following Death is no longer inevitable. Founder Blueprint & Braintree Venmo.dennylee @dennylee
3K Followers 2K Following Sparkitect on Delta Force One (tweets are my own). @[email protected]dbt Labs @dbt_labs
8K Followers 1 Following The creators and maintainers of @getdbt. We’re hiring! https://t.co/HKiRwQXTHe…Sarah Floris @ADutchEngineer
2K Followers 421 Following Senior Data and ML Platform Engineer | https://t.co/Xu2Bi73Bwk | demystifying data and machine learning engineering to accelerate your careerOpenBCI @OpenBCI
20K Followers 2K Following Open source tools for neuroscience since 2014. Makers of the OpenBCI biosensing boards (EEG, EMG, ECG), the @Ultracortex, and @Galea_XR. https://t.co/zXfZswNdSDSDN Cast @SDNCast
117 Followers 6 Following Informele webcast waarbij we het tech nieuws en software wetenswaardigheden van de afgelopen week bespreken.PyData Amsterdam @pydataamsterdam
2K Followers 123 Following Brings together users and developers of open source data analysis tools in Amsterdam through meetups and a conference!Andrei Khobnia @AI_Kho_
3K Followers 3K Following Machine Learning🧠, Natural Language Processing📖 and Information Retrieval🔍 Search Engines, Recommenders, Chat-Bots💬 Empowering Researchers and EngineersAlex Hormozi @AlexHormozi
632K Followers 109 Following Day Job: I invest and scale companies at https://t.co/gQN7OehYd2 | Co-Owner, Skool. Side Hustle: I make content showing how we do it. Grab my new book here ⬇️bodo.ai @bodo_ai
217 Followers 117 Following Lightning fast, extremely efficient SQL and Python data processing | The next-gen compute engine for your data stackDelta Lake @DeltaLakeOSS
8K Followers 66 Following Delta Lake is an open-source storage framework that enables building a Lakehouse architecture for Spark, Flink, Trino, Hive, Scala, Java, Rust, Python, & more!Murali Krishna @VMuraliKRaju
2K Followers 3K Following TOGAF Certified Architect - DW, Bigdata,Analytics, Datascience,IOT,Cloud - Azure, AWS & GCP Cloud. Opinions are my own and do not represent my employer.Alexander Wagner 🇺.. @Triamus1
232 Followers 324 Following Data engineer @VattenfallGroup. Previously risk quant @DeutscheBank. Mostly here for the tech stuff. Views are my own.Phil Hartman @gvillearchitect
989 Followers 1K Following Director and #Data #Architect at @InfoWorksTN. #integration #HealthIT #cloud #HIPAA #leadership #innovation #nashville & more. My Tweets are mine alone.Hevo Data @HevoData
837 Followers 741 Following A no-code data pipeline platform for new age businesses. Try Hevo to integrate your disparate data & build a single source of truth. #ETL #ELT #DataPipelineKarim Jedda @KarimJDDA
872 Followers 528 Following Senior Tech Lead (Infra & Data) @ Parity Technologies | Data engineering with Rust: https://t.co/3mBsS2pQcnWhen I was 15 years old, I had a stroke. Due to aggressively self-adjusting my cervical (neck) vertebrae hundreds of times daily, I had impinged critical nerves and inflamed muscle tissue which restricted blood flow to the brain. Among other side effects, I experienced…
@MitchellvRijkom Love using @databricks the user experience is amazing
@MitchellvRijkom Fully agree on that. That is why we put our data mart transformations mostly in SQL. And put most logic in the data mart, not in the reporting tool. 7-8 years ago we started with Power BI, don't know what kind of reporting tools we use 7-8 years in the future.
Folks who do unit tests on SQL queries--how are you doing them?
a good article explaining how Databricks Vector Search works .. medium.com/@tsiciliani/ha…
got some data from one place to match with some data from another place and now i can start my weekend riding that high
Don't Stop at Pandas and Sklearn! Get Started with Spark DataFrames and Big Data ML using PySpark dailydoseofds.com/dont-stop-at-p…
Is this why LLM agents are called “agents”? We may want to choose less scary names for AI stuff, no idea how associations like this will affect peoples’ subconscious thinking. We really don’t want a new model to be called SkyNet 🤦♂️.
In a sea of job applicants, stand out by crafting a compelling public profile. Uncover the secret sauce of personal branding where it's all about authentically being the engineer you are💡 #DataEngineer #DataEngineering #LearnDataEngineering #EngineeringCareer #PersonalBranding
Today I'm ecstatic to introduce Nounce! Nounce (@nounceai) is a platform designed to help cultivate articulate and intelligent speech. This platform was built to fill I void I identified in myself - finding a way to efficiently practice developing clear and creative speech.…
This is scary - ETL pipelines and ORMs are likely going away - or at least I shouldn't be getting paid for doing them anymore. This is AI generating thousands of lines of typespecs and DDLs (with no more context than the dataset), and somehow it's all 100% correct. Rant?👇
RAG with LLMs seems deceptively simple but is extraordinarily hard to do well. Building an intelligent ChatGPT-like tool with a custom knowledge base requires multiple non-trivial components. A simple vector database for retrieval is rarely enough; you need a semantic…
If only someone told me this before my 1st startup: 1. Validate idea first. I wasted at least 5 years building stuff nobody needed. 2. Kill your EGO. It's not about me, but the user. I must want what the user wants, not what I want. 3. Don't chaise investors, chase users, and…
Remove the generic illustration from your landing page They don't support your copy or help your visitor Increase clarity and conversions with one of these instead 1/ The annotated visual
Notion shared how they replaced #Fivetran and #Snowflake with #ApacheHudi, #ApacheSpark and #PineconeDB. ~$1.25M/yr cost savings. They now generate 100M+ AI vector embeddings per batch w/ Hudi and Pinecone. 🎬 RECORDING: 👉 linkedin.com/events/ahudili… #datalakehouse #vectordatabase
@luminousmen You reminded me about this classic
@figment2156021 @MitchellvRijkom Yes but I’d recommend using other approaches like change data capture. I’m also very wary of where your surrogate keys end up downstream bounded context and all that.