This repository contains sample Comma Separated Value (CSV) files. CSV is a generic flat file format used to store structured data. Datasets are split in 3 categories: Customers, Users and Organizations. For each, sample CSV files range from 100 to 2 millions records. Those CSV files can be used for testing purpose. They can be open by any application compatible with CSV files or with a CSV editor.
The datasets are generated using random values. Mosly using Python Faker package.
- customers-100.csv - Zip version - Customers CSV with 100 records
- customers-1000.csv - Zip version - Customers CSV with 1000 records
- customers-10000.csv - Zip version - Customers CSV with 10000 records
- customers-100000.csv - Zip version - Customers CSV with 100000 records
- customers-500000.csv - Customers CSV with 500000 records
- customers-1000000.csv - Customers CSV with 1000000 records
- customers-2000000.csv - Customers CSV with 2000000 records
- Index
- Customer Id
- First Name
- Last Name
- Company
- City
- Country
- Phone 1
- Phone 2
- Subscription Date
- Website
- people-100.csv - Zip version - People CSV with 100 records
- people-1000.csv - Zip version - People CSV with 1000 records
- people-10000.csv - Zip version - People CSV with 10000 records
- people-100000.csv - Zip version - People CSV with 100000 records
- people-500000.csv - People CSV with 500000 records
- people-1000000.csv - People CSV with 1000000 records
- people-2000000.csv - People CSV with 2000000 records
- Index
- User Id
- First Name
- Last Name
- Sex
- Phone
- Date of birth
- Job Title
- organizations-100.csv - Zip version - Organizations CSV with 100 records
- organizations-1000.csv - Zip version - Organizations CSV with 1000 records
- organizations-10000.csv - Zip version - Organizations CSV with 10000 records
- organizations-100000.csv - Zip version - Organizations CSV with 100000 records
- organizations-500000.csv - Organizations CSV with 500000 records
- organizations-1000000.csv - Organizations CSV with 1000000 records
- organizations-2000000.csv - Organizations CSV with 2000000 records
- Index
- Organization Id
- Name
- Website
- Country
- Description
- Founded
- Industry
- Number of employees
Create a Python virtual env:
python3 -m venv venv/sample-csv
Activate it
source venv/sample-csv/bin/activate
So you can install dependencies:
pip install -r requirements.txt
python src/main.py