Skip to content

Should a table be profiled first before adding data quality tests? #5456

Discussion options

You must be logged in to vote

OpenMetadata has two types of workflows: Ingestion, and Profiler.

Ingestion is the fast-paced one to extract metadata and make entities available. It is maxed at one workflow per service. On the other hand, from the profiler, you can schedule as many profiler workflows as required, with different schedules and relating to different batches of entities. You can disable the profiler during the metadata ingestion and use the profiler workflows directly. You can deploy multiple of them with different filter patterns.

The profiler workflow is heavier as it runs metrics for the table and columns. Data quality tests are based on those results, so for now running a fresh profile is required to co…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by ShilpaVernekar
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant