Real-time, operational data lineage tracing that helps data engineers and data scientists find, fix, and prevent operational issues.datakin.comJoined January 2020
2/2 To see @rossturk himself dive deeper into OpenLineage (the OSS tool for data lineage), along with Staff Software Engineer @PeladoCollado, join them on Tuesday at 2 PM (ET) for our next webinar.
hubs.ly/Q016N5Tq0
1/2 To operate in today’s distributed #data ecosystems, you need a complete and up-to-date picture of your data. And you can’t have one without #dataLineage. Learn why it matters, expertly explained by @rossturk, our Senior Director of Community. #bloghubs.ly/Q0177vcx0
The contributions we make to @OpenLineage and @MarquezProject will continue. In fact, we will be able to dedicate more resources to this important work.
When we started Datakin, we had an ambitious goal: to bring data lineage to the modern data stack.
This goal has not changed; our new friends at Astronomer share it with us!
“Our job duration tab enables a more detailed inspection of how a given job fits into the overall pipeline. You can evaluate the execution times of all the upstream jobs in the most recent run cycle.” datak.in/3n8VocI
Without a real time #datalineage graph “not only is it hard to identify duration issues, it’s costly and time-consuming to diagnose their cause.” datak.in/30kpNfj
Sometimes a long-running data pipeline job is more than just an inconvenience, it is the sign of a more complex problem. In this post, Peter Hicks shows how to identify data pipeline bottlenecks with #datalineage and Datakin: datak.in/3D6PVbU
The team at Northwestern Mutual has begun to capture and trace the lineage of key datasets, creating a real-time map of their pipelines. Over on the OpenLineage blog, Kevin Mellott explains how real-time #datalineage helps them stay ahead.
datak.in/3Cip0JY
"If you use Datakin to observe @getdbt models as they run, you can always know exactly where your datasets came from and how they were created."
datak.in/3vgLZlQ
Using @OpenLineage, the team at Northwestern Mutual has begun to capture and trace the #datalineage of key datasets, creating a real-time map of their pipelines. Register now for the session on Oct 21 at datak.in/3lJZBTz and learn how it helps them stay ahead of the game.
In this tutorial, we show how to use Datakin to observe @getdbt jobs. Learn how to trace #datalineage, observe changes across the pipeline, and troubleshoot performance bottlenecks:
datak.in/3BL0XTB
"Increasingly, we will find data science at the edges, decentralized and largely ungoverned. We think that data lineage is the key. It can establish a 'chain of custody' for key datasets, contextualize fragmented work, and build organzational trust."
datak.in/3Bfigfg
#DataLineage tip:
“Be sure to use the `{{ ref() }}` and `{{ source() }}` jinja functions when referring to data sources if you want to accurately capture @getdbt data lineage with @MarquezProject.”
datak.in/3FkfbNe
4K Followers 1K FollowingWe help data teams have confidence in their data, no matter what. GX Cloud, our end-to-end SaaS data quality platform, is powered by the open source GX Core.
93 Followers 340 FollowingAshraf | Resume Roasts
Software Engineer building a Resume scanner for Data Engineers
Built in public → scan for free
https://t.co/QYTOkO2T1J
16K Followers 18K FollowingEx-Goldman Sachs partner. Internet analyst since 1994. I've covered every tech wave. This AI one is different. Daily takes → AI: Reset to Zero ↓
124 Followers 3K FollowingDjPizza™ at night, Deviloper at rest, Anti-pattern Architect, advocate at B.D.S.M Business Development Sales Marketing. Always Habibis ❤️
509 Followers 2K FollowingIT recruitment. Search & selection specialists for permanent & contract roles across the Data Analytics & BI sector. Our business is to improve your business 🔍
89 Followers 507 FollowingQualytics is the complete solution to instill trust and confidence in your enterprise data ecosystem. Do you trust your data? Let us give you #DataConfidence.
194 Followers 2K FollowingAI & Data-driven Culture: AI automation and enabling, Data Engineering, DataOps & MLOps.
Big Data & Cloud, Genomics Pipelines
4K Followers 1K FollowingWe help data teams have confidence in their data, no matter what. GX Cloud, our end-to-end SaaS data quality platform, is powered by the open source GX Core.
174 Followers 29 FollowingJoin us on October 27th as we come together again to shorten the distance between data and the decision-making that empowers businesses to be insight-driven
1K Followers 77 FollowingStoryteller, marketer, open source advocate, community strategist, gadget lover, guitar player, maker of things, he/him. Find me anywhere but here.
229 Followers 420 FollowingStaff Eng @Astronomer via @DatakinHQ; big data nerd; formerly @ Cruise and @ Amazon; Christian; Husband; Father; Seattle lover; cyclist; geek; 🇩🇴🇵🇷🇺🇸