Disaster Response Pipeline Project

This repository is the work for my second project from the Udacity Data Scientist Nanodegree Program. In this project, I applied data engineering skills to build an ETL pipeline to process the raw data then the data will go through an ML pipeline to classify data.

The classification model will help the people from disaster organizations classify the message into related categories so they can respond to the event more accurately and faster.

Prerequisites

These are libraries that is used in this project:

pandas
numpy
sklearn

Project Structure

.
├── README.md
├── app
│   ├── run.py # Flask file that runs app
│   └── template
│       ├── go.html # Classification result page of web app
│       └── master.html # Main page of web app
├── data
│   ├── DisasterResponse.db # Database to save clean data
│   ├── disaster_categories.csv # Input data to process
│   ├── disaster_messages.csv # Input data to process
│   └── process_data.py # ETL pipeline
├── models
│   └── train_classifier.py # ML pipeline
│    └── classifier.pkl # Saved model. Please run the ML pipeline to create this file.
└── notebook # notebook used for preparing the code
    ├── ETL Pipeline Preparation.ipynb
    └── ML Pipeline Preparation.ipynb

Instructions

Run the following commands in the project's root directory to set up your database and model.
- To run ETL pipeline that cleans data and stores in database
```
python data/process_data.py \
    data/disaster_messages.csv \
    data/disaster_categories.csv \
    data/DisasterResponse.db
```
- To run ML pipeline that trains classifier and saves the model as pickle file
```
python models/train_classifier.py \
    data/DisasterResponse.db \
    models/classifier.pkl
```
Go to app directory: cd app
Run your web app: python run.py
Go to http://0.0.0.0:3000/ to access the website.

Acknowledgements

This project use disaster data from Appen (formally Figure 8).

The code is inspired by Udacity Data Scientist Nanodegree Program.

🔨 Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the Project.
Create your Feature Branch (git checkout -b feature/Feature).
Commit your Changes (git commit -m 'Add some feature').
Push to the Branch (git push origin feature/Feature).
Open a Pull Request.

📫 Contact

Huy Tran (dhuy237) - d.huy723@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disaster Response Pipeline Project

🚀 Table of contents

Prerequisites

Project Structure

Instructions

Acknowledgements

🔨 Contributing

📫 Contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
app		app
data		data
models		models
notebook		notebook
.gitignore		.gitignore
README.md		README.md

dhuy237/disaster-response-pipeline

Folders and files

Latest commit

History

Repository files navigation

Disaster Response Pipeline Project

🚀 Table of contents

Prerequisites

Project Structure

Instructions

Acknowledgements

🔨 Contributing

📫 Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages