This is a repository for a summary of most of my projects on Data Science, including Data cleaning, data analyzing and data visualizing and machine learning.
There are 11 projects in this repository, each project has its own repository, please click on each project link to see the code and analyzing result in Jupyter notebook.
Analyzed Peer-to-peer loan risk with millions of loan data from Lending Club and created a machine learning model that can reduce investor’s risk by 14% while still give 8.04% of return.
Project includes data cleaning, data analyzing and testing with different machine learning models, logistic regression, random forest and neural networks etc.
GitHub link: GitHub Repo
Created an animated bubble chart that displays changes in happiness level on country level and a dashboard on Tableau.
Programming language: Python, main libraries used: Numpy, Pandas and Plotly.
GitHub link: GitHub Repo
Link for code and analysis result: Animated Bubble Plot
(tip: go down to the bottom of this page, and click Autoscale for the best view of the chart).
For dashboard: in the file of Happy Country Dashboard with Tableau.twb (need Tabeleau account to display)
This project will compare Fuel Economy Data for 2008 and 2018, and analyze the changes in vehicles and its fuel efficiency
GitHub link: GitHub Repo
Link for code and analysis result: Analyzing Fuel Economy Data
This Project analyzes more than 37000 data from eBay. What are the most popular car brands in German and their prices? What are the factors affecting the car price? This project will give you the answer.
GitHub link: GitHub Repo
Link for code and analysis result: Analyzing eBay Car Sales
This is a comprehensive EDA project analyzing about 10,000 movies collected from The Movie Database (TMDb). The project includes data cleaning, analyzing and visualization using Python and its libraryies: Pandas, Matplotlib and Seaborn.
GitHub link: GitHub Repo
Link for code and analysis result: Investigate The Movie Database
This project will compare Fuel Economy Data for 2008 and 2018, and analyze the changes in vehicles and its fuel efficiency
GitHub link: GitHub Repo
Link for code and analysis result: Exploring Weather Trends
This Repo includes 5 side projects done by myself, and it aims to present my data science learning journey.#### GitHub link: GitHub Repo