Skip to content

This project explores data from Prosper which is America's first marketplace lending platform.

License

Notifications You must be signed in to change notification settings

F-Zarian/Exploration-of-Prosper-Loan-Dataset

Repository files navigation

Exploration of Prosper Loan Dataset Using Python

Project Overview

This project explores data from Prosper which is America's first marketplace lending platform. Prosper has funded over 12 billion dollars in loans. On porsper, borrowers list their loan requests between 2,000 and 40,000 dollars and individual investors can invest in as little as 25 dollars in their listing of choice.

Data Set

The dataset was provided by Udacity as part of the Data Analyst Nanodegree Program certification in January 2021. The dataset can be found here with feature documentation available here

Preliminary Wrangling & Exploratory Data Analysis

This exploration will contain statistics with visualizations to build understanding of Prosper dataset. The dataset consists of 81 variables and 113,937 observations. Visualizations will include univariate, bivariate and multivariate visualizations of several variables in the dataset, allowing the reader to gain understanding of variable distributions as well as their relationships.

Summary of Findings

Prosper provides a reliable platform for investors to lend and borrow money. The loans provided through Prosper show extremely low historical rates for the borrower with negative service fees for the majority of loans. More than 99 percent of the loan listings are fully funded. The default rates of the loans are less than 5 percent.

Key Insights for Presentation

For the presentation, I focus on the influence of Original Loan Amount, Income Range, EmploymentStatus, Credit Score and Loan Term on the Borrower APR. I start by introducing these variables and their distributions to the pairwise relationships of the variables in bivariate plots, followed by introduction of multivariate relationships among the variables of intereset by use of multivariate plots. Each plot is followed by detailed analysis of the findings and the next step(s).

The plots used for this explotration include histograms, heatmaps, violin plots, scatter plots and several plot matrices.

Technologies Used

  • Python, Pandas, Numpy, Matplotlib, Seaborn
  • Jupyter Notebook

Resources

License

The contents of this repository are covered under the MIT License.

About

This project explores data from Prosper which is America's first marketplace lending platform.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published