CSV Data Handler

A console utility program that allows getting a selection of the cheapest N products from the input CSV files, but no more than M products with the same ID. Used parallel processing to increase performance. Reading and handling data from file in parts to save memory.

Initial Data:

Several CSV files. The number of files can be quite large (up to 100,000).
The number of rows within each file can reach up to several million.
Each file contains 5 columns:
- Product ID (int),
- Name (String),
- Condition (String),
- State (String),
- Price (double).
The same product IDs may occur more than once in different CSV files and in the same CSV file.

How to use the program

1. Run main.Main

Pass the following arguments :

directoryPath
delimiter (defaultValue: ,)
productResultRowsCount (defaultValue: 1000)
duplicateProductsMaxCount (defaultValue: 20)

Example:

directoryPath=C:\Users\Tiran\Desktop\files\csv delimiter=, productResultRowsCount=1000 duplicateProductsMaxCount=20

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src/main/java		src/main/java
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CSV Data Handler

How to use the program

1. Run main.Main

Pass the following arguments :

2. After the process, you must specify the path to the output file.

About

Releases

Packages

Languages

tiran-manukyan/csv-data-handler

Folders and files

Latest commit

History

Repository files navigation

CSV Data Handler

How to use the program

1. Run main.Main

Pass the following arguments :

2. After the process, you must specify the path to the output file.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages