Skip to content

Latest commit

 

History

History
32 lines (20 loc) · 944 Bytes

README.md

File metadata and controls

32 lines (20 loc) · 944 Bytes

Error Detection and Classification

This codebase detects errors in data and generates predictions for the labels of the data using machine learning. The code uses the Poetry package manager for dependency management.

Installation

Clone the repository: git clone https://github.com/edbezci/error-detection.git
Navigate into the repository: cd error-detection
Install dependencies: poetry install
Activate poetry: poetry shell

Usage

Place the input data CSV files in the data folder.
Navigate into 'src': cd src
Run the code: python error_detect.py
The output will be generated in the out_data folder as a CSV file with a unique file name.

Logging

The code uses the logging module to log events and errors to a log file and the console.

Testing

The code uses pytest for unit testing.

Navigate into 'src': cd src
Run the code: pytest ..\test

License

This code is released under the MIT License.