Skip to content

Python package for performing quality control (QC) for data coordination (DC)

License

Notifications You must be signed in to change notification settings

Sage-Bionetworks-Workflows/py-dcqc

Repository files navigation

py-dcqc

PyPI-Server codecov Project generated with PyScaffold

Python package for performing quality control (QC) for data coordination (DC)

This Python package provides a framework for performing quality control (QC) on data files. Quality control can range from low-level integrity checks (e.g. MD5 checksum, file extension) to high-level checks such as conformance to a format specification and consistency with associated metadata.

Early versions of this package were developed to be used by its sibling, the nf-dcqc Nextflow workflow. You can see examples of how to leverage py-dcqc there. Note that the initial command-line interface (CLI) was developed with nf-dcqc in mind, so smaller steps were favored to enable parallelism in Nextflow. Future iterations of this package will include user-friendly, high-level CLI commands.

PyScaffold

This project has been set up using PyScaffold 4.3. For details and usage information on PyScaffold see https://pyscaffold.org/.

putup --name dcqc --markdown --github-actions --pre-commit --license Apache-2.0 py-dcqc

About

Python package for performing quality control (QC) for data coordination (DC)

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors 4

  •  
  •  
  •  
  •  

Languages