Collect, aggregate, and visualize a data ecosystem's metadata
-
Updated
Jul 1, 2024 - Java
Collect, aggregate, and visualize a data ecosystem's metadata
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Egeria core
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
HiveMQ Edge is an MQTT gateway that enables interoperability between OT devices and IT systems. It translates diverse protocols into MQTT for streamlined communication and helps organize data into a unified namespace, making managing and streaming data across your infrastructure easier.
POC to demonstrate how to alter incoming/outgoing records in Kafka. It's a toy, don't use it in production.
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Data-Stash是基于FISCO-BCOS的数据仓库组件,通过解析节点的binlog日志,生成该节点状态的全量备份,从而使节点能够实现冷热数据分离和数据裁剪。
Data-Reconcile是一款基于区块链的对账组件,提供基于区块链智能合约账本的通用化数据对账解决方案,并提供了一套可动态扩展的对账框架,支持定制化开发。
an open source dataworks platform
System Design, Solution Architecture, Data Systems Practice
Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quality and user behaviour. This solution creates Data Catalog Tags history in BigQuery since Data Catalog keeps only the latest version of metadata for fast searchability.
Describe your Data Protection rules and Personal Identifying Information as part of your schema
DataSphere is the first open-source cloud-native data observability platform that helps you trace the whole data infrastructure in your warehouses, lakes and databases.
Classify Confluence pages using existing SKOS and RDFS controlled vocabularies, improve search, build better tables of contents, capture structured data alongside other content and integrate Confluence into knowledge graphs using SPARQL.
Add a description, image, and links to the data-governance topic page so that developers can more easily learn about it.
To associate your repository with the data-governance topic, visit your repo's landing page and select "manage topics."