Orchestration and automation platform to execute millions of scheduled and event-driven workflows declaratively in code and from the UI
-
Updated
Oct 1, 2024 - Java
Orchestration and automation platform to execute millions of scheduled and event-driven workflows declaratively in code and from the UI
Scalable data pre processing and curation toolkit for LLMs
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Always know what to expect from your data.
The open-source tool for building high-quality datasets and computer vision models
Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.
Synthetic data quality evaluation & visualization
The Open Source Feature Store for Machine Learning
Automated Preprocessing Pipeline - DataFrame
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
A demo of Bufstream, a drop-in replacement for Apache Kafka that's 10x less expensive to operate
Home of the Open Data Contract Standard (ODCS).
Intelligent Data Analysis (IAU_B) @ FIIT STU in Bratislava
lakeFS - Data version control for your data lake | Git for data
DataDP is a comprehensive Python package designed for data quality detection and processing.
Example API implementation for Data Caterer
KGHeartBeat is a community-shared open-source knowledge graph quality assessment tool to perform quality analysis on a wide range of freely available knowledge graphs registered on the LOD cloud and DataHub. Web-App: http://www.isislab.it:12280/kgheartbeat/
Source-available data quality tool
Possibly the fastest DataFrame-agnostic quality check library in town.
Watchmen Platform is a low code data platform for data pipeline, meta data management , analysis, indicator objective analysis and quality management
Add a description, image, and links to the data-quality topic page so that developers can more easily learn about it.
To associate your repository with the data-quality topic, visit your repo's landing page and select "manage topics."