-
Tweets416
-
Followers448
-
Following671
-
Likes633
Here is a sneak peak of something that I have been working on. @dremio reads @apachehudi tables. This is another example of "interoperability" with open lakehouse table formats.
Have general Hudi quesitons? Wonder about Hudi's best practices or tips for troubleshooting? We are happy to start hosting additional 1-1 office hours every week! Book it now at calendly.com/apache-hudi/of…
Saddle up for #DataCouncil 🤠. Let's corral a fireside chat on #lakehouse table formats #ApacheHudi, #DeltaLake, and #ApacheIceberg. You don't want to miss this 🌶 discussion 3/27 11:30am. I will also intro the brand new @apachextable (prev known as OneTable) #apachextable
We made lots of memories at #Supercloud6 📸 Did you miss the fun? Here’s some top moments with #AIinnovators, and catch up anytime on-demand! events.cube365.net/supercloud/sup… #theCUBE #theCUBEresearch #AIinfra
While the name and links may have changed, the principles of the project remain unwavering. XTable creates "cross-table" interop between #ApacheIceberg, #ApacheHudi, and #DeltaLake. Incubating into ASF is an exciting step for the project to strengthen its neutrality and grow 🚀
While the name and links may have changed, the principles of the project remain unwavering. XTable creates "cross-table" interop between #ApacheIceberg, #ApacheHudi, and #DeltaLake. Incubating into ASF is an exciting step for the project to strengthen its neutrality and grow 🚀
Onehouse @Onehousehq
915 Followers 98 Following Onehouse is the universal data lakehouse, offering a cloud-native managed lakehouse built on @apachehudi, accessible across table formats, engines and clouds.Vinoth Chandar @byte_array
1K Followers 236 Following Founder @Onehousehq, Creator of @apachehudi. Distributed/Data Systems, Linkedin, Uber, Confluent alum. (views are mine)Ananth Packkildurai @ananthdurai
2K Followers 2K Following Data @Zendesk, @SlackHQ | Author https://t.co/rvlBOXX0cy | Creator of https://t.co/XdMVrxUay6 | Angel Investor | Advisor for early stage data startupsapachehudi @apachehudi
3K Followers 134 Following Official twitter handle of Apache Hudi. We marry stream processing to petabytes of data. https://t.co/Ka1NABVHlwsbalnojan @sbalnojan
1K Followers 2K Following Head of Marketing @archdotdev | Ph.D. | Data PM & Data Technologist| “Three Data Point Thursday” | co-author of Data Mesh in ActionFelipe Hoffa @fhoffa@.. @felipehoffa
15K Followers 5K Following Data Cloud Advocate at @SnowflakeDB. Originally from Chile, now in San Francisco and around the world. https://t.co/MNCcLQpl9i https://t.co/qL3HETyPftModern Data Stack @moderndatastack
6K Followers 459 Following Everything that you need to know about building and operating a Modern Data Stack. Operated by team at @quantive_incLeonard Xu @Leonardxbj
2K Followers 699 Following Flink PMC Member & Flink CDC Lead, Flink Connector TL @alibaba_cloud, focus on Streaming SQL & Data IntegrationMim @mim_djo
9K Followers 3K Following #Fabric Enthusiast, Small Data And self service, #Microsoftemployee since Nov 2023 , but my tweets are my ownAdi Polak @AdiPolak
14K Followers 802 Following DevX @ Confluent • Cloud • ML/AI & Data Platforms • Ex Microsoft, Akamai • Keynote Speaker • Author of Scaling ML Systems(O'Reilly) • Opinions are mineGwen (Chen) Shapira @gwenshap
26K Followers 9K Following Co-founder of @niledatabase. Making SaaS global, elastic and chill. Find me at: https://t.co/uyuHg400cpMartijn Visser @MartijnVisser82
1K Followers 913 Following Product Manager @ConfluentInc PMC/Committer @ApacheFlink, Product, Community, CI/CD, Data, SQL, SDKs, Analytics, Technology, Security, IT, Law, Sports. He/himDecodable @Decodableco
3K Followers 2K Following Decodable is a serverless real-time data platform built on #ApacheFlink. No clusters to set up. No code to write. No PhD required.Rahul Jain @rahulj51
3K Followers 2K Following Principal Software Engineer at @justbobsledit. Formerly led Data and Engineering at @thebeatapp , @omioglobal , @thoughtworks .Dipankar Mazumdar🥑 @Dipankartnt
1K Followers 528 Following Staff Data Engineering Advocate @OnehouseHQ, prev DevRel @Dremio, R&D @Qlik, Data @OtisElevatorCo | Author (O’Reilly) | Research: https://t.co/AiDKzVJCGajohn kutay @JohnKutay
1K Followers 1K Following engineering: data and AI @ striim (streaming sql / cdc / pipelines). opinions and tweets my own (often satire). what’s new in data for 🎙️Simon Späti 🏔️ @sspaeti
3K Followers 1K Following Dad. Technical Author, Data Engineer and Educator https://t.co/49Ty3GXkqs, https://t.co/7r8pihWPQz. Tweets mostly: #dataengineering, #opensource, #writing, #pkm and #neovimRajkumar Sen @rajkrsen
2K Followers 2K Following Engineer, Founder (Arcion Labs @ArcionLabs), ex-MemSQL, avid soccer watcher and aspiring movie script writer !Mark Kovarski @mkovarski
2K Followers 5K Following Responsible AI, Cloud, SaaS, Product 🤖 🫶🌐💡 | https://t.co/2vuiFosXlm 📪venkashank @venkashank
14 Followers 187 Following excel enjoyer, powerpoint curator, python and sql dabbler.Nam Tran @NamTran0110
35 Followers 190 Followingpan jadę @Sergzerodel
163 Followers 1K FollowingLeonce Nshuti @LeonceNshuti
277 Followers 2K Following Data Engineer @Sony. Ex-UBS, Vanderbilt, Harvard. https://t.co/kOPPM3IA54. Google Scholar: https://t.co/UWXNmktdq0. Opinions my own.Ofir Manor @ofirm
350 Followers 352 Following senior program manager at Speedata | loves data tech | don't blame my corporate overlords for my opinions | also @[email protected]tinyrobots @tinyrobots
804 Followers 5K FollowingGuowei Ma @guowei_ma
7 Followers 182 FollowingJim @jane_chow9
3 Followers 531 FollowingArnav @_arnav_11
81 Followers 2K FollowingClint J. @SearchDataEng
619 Followers 993 Following 🔍 | Data , LLM , & Search Engineer. 💼 | Seeking new opportunities! ॐThomas Ciszek @Ciszek
13K Followers 8K Following Information Professional | Acronyms @Amazon Robotics | Hagiography | #sunset | local politics, x-@twitter, @RANDCorporation, @UNC🐏📣Paige Roberts @RobertsPaige
6K Followers 1K Following Director of Product Innovation Presenter: Analytics IOT ML data architecture streaming graph... Author: O'Reilly & fantasy books Decent shot with a bow.Christian Minich @ChristianNolan
1K Followers 1K Following Sales Eng @dagster | Ex Informatica Power Center, ex Qlikview, ex Crystal Reports, ex export graphs to PowerPointPinging @SilasHuu
25 Followers 214 FollowingEva @KatharineL34850
2 Followers 178 Following Love life, enjoy traveling, enjoy different landscapes and cultural history (hope to meet friends with common interests)Tomasz Andrzejewski @Sedin3o3
64 Followers 601 FollowingSon Nguyen @sonnguyen370713
68 Followers 1K FollowingJon Morehouse @nuonjon
804 Followers 3K Following Bring your own cloud, for everyone. - https://t.co/P44GwYDwlkAlbert M. @a_muncu
58 Followers 556 FollowingRobin Hood @robinwood2015
227 Followers 2K Following If the truth is a cruel mistress, then a lie must be a nice girl....Juraj Pohanka @juraj_pohanka
87 Followers 675 FollowingKeenanJugan @JuganKeena30339
89 Followers 2K FollowingEric Kaplan @eric_kaplan_nyc
215 Followers 600 Following FinTech investor at @BessemerVP | Previously growth & product @ridewithvia | Seinfeld enthusiast ericnyc.ethOpen Data Blend @opendatablend
112 Followers 405 Following Gain an edge with open data. Frictionless datasets derived from UK open data: NHS England prescriptions, road safety, and MOT testing. Created by @nimblelearn.onlyfor such @Onlyforsuch9
17 Followers 361 FollowingDave Vellante @dvellante
13K Followers 6K Following Analyst, Writer, Cofounder & Co-CEO SiliconANGLE Media, co-host of @theCUBE, fortunate husband, father of four, advocate of #WiT #BreakingAnalysisKAI @kaisai121
159 Followers 3K Following一級屠豬士 @bunko4
248 Followers 3K Following #Oracle,#MySQL, #Postgresql, #Golang, #Erlang, #Elixir #Haskell #FSharp & 🦀 #Rust Lovershesh mishra @mishrases
12 Followers 55 FollowingTejas Karkhanis @tejaskarkhanis
8 Followers 3K Following Googler at Google (Search, Data, and AI)|Angel investor & advisor| Opinions expressed are solely my own and do not express the views or opinions of my employer.Anton Revyako @antonrevyako
146 Followers 477 Following founder of: https://t.co/qsrCbCMsGJ (@dwhdev), https://t.co/QUEOxdgDJw (@holistic_dev), https://t.co/1XKB3XeDw1 #snowflakedb, #postgresql, https://t.co/LFyFoU7aSOShawn Gordon @ProgRockRec
2K Followers 198 Following Datalake developer advocate, software designer, and prolific musician. Check out my music https://t.co/F1vCoD2Czyawadhesh singh @awadheshsingh14
21 Followers 112 Followingluxposttenebras @luxposttenebras
23 Followers 264 FollowingP @_funnysideup_
111 Followers 1K FollowingKrishnakumar Raghunat.. @krshnkmrraghu
9 Followers 92 FollowingIce🎀 @KAREEMSHANKWTE
14 Followers 379 Following Women can withstand lies, suffer perfunctory, tolerate deception, forget promises, and finally have to smile to disguise the tears that fall.🎊🎊ProfessionalBIll @ProfessionalBP
0 Followers 2 FollowingJvar0k @Janeth0714
11 Followers 77 FollowingPulkit Arora @pulkit6559
33 Followers 268 Following ML Engineer, building data products | CS Masters @RWTH #NLP #LLMs #DataEngineeringOnehouse @Onehousehq
915 Followers 98 Following Onehouse is the universal data lakehouse, offering a cloud-native managed lakehouse built on @apachehudi, accessible across table formats, engines and clouds.Vinoth Chandar @byte_array
1K Followers 236 Following Founder @Onehousehq, Creator of @apachehudi. Distributed/Data Systems, Linkedin, Uber, Confluent alum. (views are mine)Taylor A Murphy @tayloramurphy
7K Followers 376 Following Founder & CEO @archdotdev | maintainer: @meltanodata | prev: Data @gitlab | Paid Data Actor/Influencer | Husband and Dad(x2)Ananth Packkildurai @ananthdurai
2K Followers 2K Following Data @Zendesk, @SlackHQ | Author https://t.co/rvlBOXX0cy | Creator of https://t.co/XdMVrxUay6 | Angel Investor | Advisor for early stage data startupsSarah Catanzaro @sarahcat21
12K Followers 1K Following “All methods are sacred if they are internally necessary” (GP @amplifypartners, prev @canvasvc; Head of Data @Mattermark; @palantirtech; @c4ads)Gunnar Morling 🌍 @gunnarmorling
51K Followers 302 Following Software engineer @Decodableco · Ex-lead of Debezium · Spec lead of Bean Validation 2.0 · Creator of JfrUnit, kcctl and MapStruct · Java Champion · 🚴apachehudi @apachehudi
3K Followers 134 Following Official twitter handle of Apache Hudi. We marry stream processing to petabytes of data. https://t.co/Ka1NABVHlwChris Riccomini @criccomini
8K Followers 241 Following I post about software infrastructure · SWE at WePay, LinkedIn, PayPal · Project https://t.co/wWKIqaVLfI · Newsletter https://t.co/1LZOT8NNDd · Author https://t.co/Wi3qaKkJlS» teej @teej_m
9K Followers 2K Following » Working on Titan » https://t.co/aZwqUSdNXn » my friends call me teejJosh Wills @josh_wills
18K Followers 2K Following Engineering at @datologyai; @duckdb enthusiast, ex-@slackhqFelipe Hoffa @fhoffa@.. @felipehoffa
15K Followers 5K Following Data Cloud Advocate at @SnowflakeDB. Originally from Chile, now in San Francisco and around the world. https://t.co/MNCcLQpl9i https://t.co/qL3HETyPftDavid Jayatillake @DSJayatillake
1K Followers 386 Following Co-Founder & CEO @ Delphi | I write every week at https://t.co/5HPzmIyuPs | https://t.co/K6IGvlqp76Modern Data Stack @moderndatastack
6K Followers 459 Following Everything that you need to know about building and operating a Modern Data Stack. Operated by team at @quantive_incABC @Ubunta
3K Followers 3K Following Data & ML Infrastructure for Healthcare https://t.co/FwocCiCQAT Opinions are पड़ोसी' In 🇩🇪Berlin from 🇮🇳Kolkata/छत्तीसगढ़Leonard Xu @Leonardxbj
2K Followers 699 Following Flink PMC Member & Flink CDC Lead, Flink Connector TL @alibaba_cloud, focus on Streaming SQL & Data IntegrationPete Soderling @petesoder
3K Followers 1K Following Engineer, Entrepreneur, Investor. Founder @DataCouncilAI + @ZeroPrimeVC. Helping 10k engineers start companies 🤓🖖Robin Moffatt 🍻�.. @rmoff
10K Followers 661 Following DevEx Engineer at @Decodableco. Doing fun stuff with data and open source. 🌐 https://t.co/WparjfmCF5 🔗 Mastodon: @[email protected]Mim @mim_djo
9K Followers 3K Following #Fabric Enthusiast, Small Data And self service, #Microsoftemployee since Nov 2023 , but my tweets are my ownClint J. @SearchDataEng
619 Followers 993 Following 🔍 | Data , LLM , & Search Engineer. 💼 | Seeking new opportunities! ॐJon Morehouse @nuonjon
804 Followers 3K Following Bring your own cloud, for everyone. - https://t.co/P44GwYDwlkRichard Artoul @richardartoul
1K Followers 174 Following Co-Founder @ WarpStream Labs. Prev Datadog/Uber. Passionate about distributed storage.YeongHer(永和) @AgileQuery
548 Followers 428 Following 🌈 连续创业者 & https://t.co/fB7Ak7gf2x 创始人,全栈,聚焦数据分析领域... 有需要试用Agile Query 的朋友,联系方式: 微信:braisdom 邮件:[email protected]Albert M. @a_muncu
58 Followers 556 FollowingDave Vellante @dvellante
13K Followers 6K Following Analyst, Writer, Cofounder & Co-CEO SiliconANGLE Media, co-host of @theCUBE, fortunate husband, father of four, advocate of #WiT #BreakingAnalysisOpen Data Blend @opendatablend
112 Followers 405 Following Gain an edge with open data. Frictionless datasets derived from UK open data: NHS England prescriptions, road safety, and MOT testing. Created by @nimblelearn.羅鴻樟HongZhang Lu.. @Lowu1960
694 Followers 2K Following https://t.co/nZmyxJTPDt https://t.co/vYOy1p4o7V…Mike @mdkarp
2K Followers 1K Followingiamrobotbear 👺 @iamrobotbear
4K Followers 5K Following Product Manager working on Generative AI & machine learning. Opinions are my own, not my employer's. RT !=endorsementJason Risch @rischter_scale
108 Followers 60 Following Investor Greylock (@GreylockVC) | @Stanford alumMichael Rys @MikeDoesBigData
1K Followers 322 FollowingDipti Borkar @dborkar
746 Followers 457 Following VP @Microsoft | Founder https://t.co/xzJ3Zvm1zw 🌅 ❤️ databases 💙 opensource data infra geek 🤓, mom of 👦🏻👧🏻, Ex @Couchbase @IBM @CAL @UCSanDiego. Posts are my ownMUBusinessDataAnalyti.. @MUAnalytics
354 Followers 533 Following Mondragon Unibertsitatea - Business Data AnalyticsFernando Segura @fernandosegom
215 Followers 376 Following Software developer, languages i like to use for programming #Ruby, #Javascript, interest in #Android, #Webdevelopment #augmentedreality, and #startupsJavier Andres Garcia @javandrex
569 Followers 5K Following https://t.co/UHh9jR7dSa @UbicaEcuador #javascript #geomatica #datascience #elasticsearchMrShyamKyo @ShyamKyo
14 Followers 444 FollowingSubhash Pophale @SubhashPophale
167 Followers 845 Following Software Engineer, Atheist, Rational and common man. PuneRoger Taylor @Roger_M_Taylor
6K Followers 3K Following PM & BA, Proud dad of 3, #AI, #LLMs, #LangChain, #DataScience, #Startups, #Tech, #Entrepreneur, #Leadership,Lawrence Rabasotho @hazard12dance
451 Followers 3K FollowingPerth Woratana 🇦�.. @woraperth
15K Followers 1K Following 🦘 Data Analytics Engineer in Australia 🔥 แชร์เรื่อง Data, การสร้างธุรกิจเล็ก ๆ สร้างรายได้เสริม อย่างมีความสุข ⚡️ อยากช่วยคนนำความถนัดของตัวเองมาสร้างรายได้Yogesh Patil @patilyogesh
211 Followers 641 Following Data Enthusiast and now #Heartfulness trainer.For free meditation classes IN PERSON or remote distance please send me a DMmani #clouddog☁️ .. @maninekkalapudi
681 Followers 272 Following Data Engineer 📊 Cloud Enthusiast 💭 Book Worm 📚 Programmer 👨💻 Pythonista 🐍 Occasional Scribbler ✍️ Working on @data_engineered 🛠️DWA 🌊 🚀 @DataWaveAU
256 Followers 1K FollowingSai Sampath @saisampath_dev
108 Followers 2K Following ML Engineer | Let’s connect on LinkedIn: https://t.co/9PEHl5JJBzThanveer Ahamed @Thanvee34394651
26 Followers 232 Following Seeker of knowledge. Trying to understand the wisdom. ML Engineer at @HeHealthai | FIT @MoratuwaUniDevon Cayer @devcayer
185 Followers 707 Following entrepreneur, founder Om, founder of 1859, inc. — love intersection of AI and techbio — prefer to build.Pedro Vallejo @PedroVallejo4
1K Followers 5K Following Founder: @DatlasMX || 🤖 Impulsando el #bigdata en #Latam | 📊Startup,🎙️Podcast, 💻 Blog ,👩🏻🎓👨🏽🎓 Academy : https://t.co/mfvEPOLe6ipavan kumar - పవ�.. @Guvvalapavanku1
568 Followers 4K Following Storytelling through lens of Power Bi and Data ................. Proud Hindu ...Bhavik Shah @AgnosticPrani
33 Followers 98 Following Product Management | Micro to Macro | Everything under the sunNatasha @devnatash
866 Followers 4K Following Let's make stuff. same handle as here. 8sky or 1nfosec dot 3xchangeDorian Banutoiu @canonicalizedco
968 Followers 589 Following Data-driven • Tableau passionate | Growth Helper @ https://t.co/IGZeu2Jj4WRaphael Mansuy 🍵 @raphaelmansuy
893 Followers 5K Following Data Engineering | DataScience | AI & Innovation 🚀 🚀 CTO of ELITIZON a startup studio https://t.co/CZYAiY4IpBElad Leev @Eladleev
541 Followers 1K Following Stream all the things | @confluentinc Community Catalyst 🦸, @gitlab Hero | #DistributedSystems #DataStreams #Databases #Scalability #kafka #golangvasudev anubrolu @vasudevanubrolu
70 Followers 1K Following ML enthusiast, Engineer. Sr. SE @koredotai. ex @deloitte @vmware. @bitspilaniindia 2015-20.Gaurav Sharma @Gaurav1105
11K Followers 5K Following Founder and CEO. Tech Entrepreneur. (Playground = FinTech, AI, Blockchain)subho @subhobrata1
791 Followers 5K Following🎉 Excited to bring the Apache Hudi Meetup to Bangalore! Join us at Navi Office to hear about the challenges in data ingestion, improving time-to-insight & innovations in Hudi 1.0. 📅 Date: May 11, 2024 📍 Location: Navi Office, Bangalore 🔗 Link: forms.gle/wpd9gbKmC99GxK…
Look forward to speaking at LinkedIn’s Big Data Meetup next week happening in Mountain View. Come to our “Table Talk: Decoding Table Format Adoption & Operational Excellence” This whole hour session will be everything about #lakehouse open table formats!
[Blogged]: Interoperability between lakehouse table formats with Apache XTable. Link: onehouse.ai/blog/dremio-la…
Query Optimization with 'Clustering' in Apache Hudi. Today I presented how the clustering service in Hudi makes a huge impact on the overall query perf. To highlight the difference, I ran the same query using Presto once before clustering & after in a 1 TB TPC-DS dataset.
🎤 Just like the microphone after a great set, the Data Council talk from our own Kyle Weller has finally dropped - and check out the cool comments to Kyle’s post on LinkedIn. lnkd.in/gdBR5szu #onehouse #opensource #nolockin #datalakehouse #dataengineering #apachehudi…
☯️ “The only constant in life is change,” said a philosopher. How can a database keep up? 👨🏫 Onehouse product manager Andy Walner explains it all, in a magisterial exposition as to how Onehouse manages schema evolution. onehouse.ai/blog/schema-ev… #onehouse #opensource…
Want to read about Apache XTable in depth? Check out the paper which goes deep into: ✅ Need for interoperability ✅ LST overview ✅ XTable architecture ✅ Implementation Overview Excellent collaboration from @Microsoft, @Onehousehq & Google. arxiv.org/pdf/2401.09621…
Let's do an ⚡️Alt Data⚡️ lesson investigating Snowflake $SNOW claims now that Lacework is back in the news. My goal here with this X post is to teach you how to do quick content scans of company claims and how to do basic sizing with Fermi estimation. I'm way deeper. I'll give…
This Lacework "deal" at ~$200M down from an over $8B valuation is one of the best short cases for Snowflake $SNOW. Here are the two main questions that actually matter for Lacework: 1) What is Lacework's total spend on Snowflake per year? As in, how much do they spend, per…
Bin Packing Algorithm for "Small File" Issue in Lakehouses. Small File problem is one of the critical problems in a data lake that impacts query performance when reading files using compute engines. The problem occurs when writing data in smaller chunks 🧵
Had a time to watch Vendor X keynotes, one copy, open Table format,multiple Engines,that sounds very Familiar 😁
Super excited to be speaking at @LinkedIn Big Data Meetup happening on 3rd May at HQ, Sunnyvale! I will be talking about @apachextable & interoperability in open table formats such as Apache Hudi, Iceberg & Delta Lake in collab with Microsoft folks! Join us on 3rd May.
Apache XTable provides omni-directional interoperability between lakehouse table formats such as Apache Hudi, Apache Iceberg & Delta Lake. A 🧵
Source from LI: linkedin.com/posts/lakehous…
From Data Council, @KyleJWeller ‘s survey: This survey is kinda strange. EMR , DB are both essentially Spark. I’m not surprised Spark is still out there. Flink for general querying is not something I’ve seen. Presto and Trino separate shows how far they drifted.
Compression Codecs for Apache Parquet. Compression algorithms reduces the size of data files, making storage & data transfer more efficient. This is important specifically when dealing with larger data volumes as it might have significant impact on performance and costs.
📰 NOW Insurance is a pioneer in the insurtech space. Their rapid growth required them to quickly evolve their data infrastructure. 🏡 They chose the Universal Data Lakehouse architecture - and partnered with Onehouse to make it happen, fast. 🏎️ You can too! Read all about it…