Skip to content

This is a repository for all of my projects on Exploratory Data Analysis, including Data cleaning, data analyzing and data visualization

Notifications You must be signed in to change notification settings

lutang123/Data-Science-Projects

Repository files navigation

Data Science Projects:

This is a repository for a summary of most of my projects on Data Science, including Data cleaning, data analyzing and data visualizing and machine learning.

There are 11 projects in this repository, each project has its own repository, please click on each project link to see the code and analyzing result in Jupyter notebook.

Project 1: Artificial Financial Advisor on Peer-to-Peer lending

Analyzed Peer-to-peer loan risk with millions of loan data from Lending Club and created a machine learning model that can reduce investor’s risk by 14% while still give 8.04% of return.

Project includes data cleaning, data analyzing and testing with different machine learning models, logistic regression, random forest and neural networks etc.

GitHub link: GitHub Repo

Links for code and analysis result:

Part 1: Data Wrangling

Part 2: Testing models

Project 2: Data Visualization on The World Happiness Report

Created an animated bubble chart that displays changes in happiness level on country level and a dashboard on Tableau.

Programming language: Python, main libraries used: Numpy, Pandas and Plotly.

GitHub link: GitHub Repo

Link for code and analysis result: Animated Bubble Plot

(tip: go down to the bottom of this page, and click Autoscale for the best view of the chart).

For dashboard: in the file of Happy Country Dashboard with Tableau.twb (need Tabeleau account to display)

Project 3: Analyzing-Fuel-Economy-Data-for-2008-and-2018

This project will compare Fuel Economy Data for 2008 and 2018, and analyze the changes in vehicles and its fuel efficiency

GitHub link: GitHub Repo

Link for code and analysis result: Analyzing Fuel Economy Data

Project 4: Analyzing eBay Car Sales

This Project analyzes more than 37000 data from eBay. What are the most popular car brands in German and their prices? What are the factors affecting the car price? This project will give you the answer.

GitHub link: GitHub Repo

Link for code and analysis result: Analyzing eBay Car Sales

Project 5: Investigate The Movie Database

This is a comprehensive EDA project analyzing about 10,000 movies collected from The Movie Database (TMDb). The project includes data cleaning, analyzing and visualization using Python and its libraryies: Pandas, Matplotlib and Seaborn.

GitHub link: GitHub Repo

Link for code and analysis result: Investigate The Movie Database

Project 6: Exploring Weather Trends

This project will compare Fuel Economy Data for 2008 and 2018, and analyze the changes in vehicles and its fuel efficiency

GitHub link: GitHub Repo

Link for code and analysis result: Exploring Weather Trends

Data Science side projects on other various topics:

This Repo includes 5 side projects done by myself, and it aims to present my data science learning journey.#### GitHub link: GitHub Repo

Links for code and analysis result:

About

This is a repository for all of my projects on Exploratory Data Analysis, including Data cleaning, data analyzing and data visualization

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published