Skip to content
/ visa Public

Playground project that transforms visa issuance data from travel.state.gov into datasets

Notifications You must be signed in to change notification settings

hminaya/visa

Repository files navigation

Visa Issuance Dataset Playground

This is a small python project I'm currently using to play around with Datasets. I'm using publicly available data from travel.state.gov to read visa issuances by consulates abroad.

I've broken up the process into 4 python scripts you can run individually to extract, transform and load the data.

Project Organization

├── data
│   ├── catalog                 <- Lookup tables for visa types
│   ├── raw                     <- 
│   ├── interim                 <- 
│   ├── processed               <- 
│   └── consolidate             <- 
│    │      
├── models                      <- Trained and serialized models, model predictions, or model summaries (WIP)
│      
├── notebooks                   <- Jupyter notebooks. Playground for data, charts, reports
│      
├── requirements.txt            <- The requirements file for reproducing the analysis environment, e.g.
│                                  generated with `pip freeze > requirements.txt`
│      
├── setup.py                    <- makes project pip installable (pip install -e .) so src can be imported
└── src                         <- Source code for use in this project.
    │      
    ├── data                    <- Scripts to download and generate data
       │── raw_import.py        <- Imports PDF files from travel.state.gov
       │── pdf_to_csv.py        <- Converts PDF files into CSV
       │── clean_csv.py         <- Cleans up csv files (Code is messy!)
       └── consolidate_csv.py   <- Consolidates data into 4 csv files

About

Playground project that transforms visa issuance data from travel.state.gov into datasets

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published