Skip to content

Simple CSV parser for huge volumes of data with the use of the library Pandas for Python for getting specific columns of a CSV file and putting the extracted data into one or more files (each column in a separated file or all of them in the same output) in a short amount of time.

License

Notifications You must be signed in to change notification settings

dawntech/pandas-simple-csv-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pandas Simple CSV Parser

Simple CSV parser for huge volumes of data with the use of the library Pandas for Python for getting specific columns of a CSV file and putting the extracted data into one or more files (each column in a separated file or all of them in the same output) in a short amount of time.

building

Author: Levindo Gabriel Taschetto Neto.

Python Environment

Python Version: Python 3.6.5 :: Anaconda, Inc.

Pre-Requisites

$ conda install pandas

How to Use

$ python pandasParser.py <INPUT> <[COLUMN_NUMBERS(comma-separated)]> <[OUTPUTS_IN_ORDER_OF_COLUMNS(comma-separated)]> <multipleOutputsBoolean> <DELIMITER_SYMBOL> <createQuotedFilesBoolean>

Examples

In this example, the columns NU_CNPJ (0) and NO_PRODUTO (5) are extracted from the file medicines.csv and placed into the outputs cnpj.csv and productNames.csv respectively.

$ python pandasParser.py tests/in/medicines.csv [0,5] [tests/out/cnpj.csv,tests/out/productNames.csv] true ; true

License

MIT License. Click here for more information about this license.

About

Simple CSV parser for huge volumes of data with the use of the library Pandas for Python for getting specific columns of a CSV file and putting the extracted data into one or more files (each column in a separated file or all of them in the same output) in a short amount of time.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages