Investigate-a-Dataset-Gapminder-Health

Udacity Data Analytics Nanodegree Project 1 - Exploring Gapminder Datasets using Python and Jupyter Notebook

Project Overview

In this project, I will analyze a dataset and then communicate my findings about it. I will use the Python libraries NumPy, pandas, and Matplotlib to make my analysis easier.

What do I need to install?

I will need an installation of Python, plus the following libraries:

pandas
NumPy
Matplotlib
csv

What will I learn?

After completing the project, I will:

Know all the steps involved in a typical data analysis process
Be comfortable posing questions that can be answered with a given dataset and then answering those questions
Know how to investigate problems in a dataset and wrangle the data into a format you can use
Have practice communicating the results of your analysis
Be able to use vectorized operations in NumPy and pandas to speed up your data analysis code
Be familiar with pandas' Series and DataFrame objects, which let you access your data more conveniently
Know how to use Matplotlib to produce plots showing your findings

Project details

Step One - Choose the data

I chose four datasets from Gapminder.

Female salaried employee
Female literacy rate
Gdp per capita
Life expectancy at birth

Step two - Get Organized

Create a single folder (repositories) that contains:

The report communicating your findings
Any Python code you wrote as part of your analysis
The data set you used

Step three - Analyse the data

Make questions that promote looking at relationships between multiple variables. I aimed to analyze at least one dependent variable and three independent variables in my investigation. Make sure I use NumPy and pandas where they are appropriate!

Step four - Share your findings

Create a report that shares the findings I found most interesting.

Reference

https://www.gapminder.org/data/

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Investigate_a_Dataset.html		Investigate_a_Dataset.html
Investigate_a_Dataset.ipynb		Investigate_a_Dataset.ipynb
README.md		README.md
child_mortality_0_5_year_olds_dying_per_1000_born.csv		child_mortality_0_5_year_olds_dying_per_1000_born.csv
children_per_woman_total_fertility.csv		children_per_woman_total_fertility.csv
data-health.csv		data-health.csv
literacy_rate_youth_total_percent_of_people_ages_15_24.csv		literacy_rate_youth_total_percent_of_people_ages_15_24.csv
stc_TV_Dataset_Analysis.ipynb		stc_TV_Dataset_Analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Investigate-a-Dataset-Gapminder-Health

Project Overview

What do I need to install?

What will I learn?

Project details

Step One - Choose the data

Step two - Get Organized

Step three - Analyse the data

Step four - Share your findings

Reference

About

Releases

Packages

Languages

Khaled259/Investigate-a-Dataset-Gapminder-Health

Folders and files

Latest commit

History

Repository files navigation

Investigate-a-Dataset-Gapminder-Health

Project Overview

What do I need to install?

What will I learn?

Project details

Step One - Choose the data

Step two - Get Organized

Step three - Analyse the data

Step four - Share your findings

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages