This repository contains the "Online Payment Fraud Detection" project implemented using Python, machine learning, and exploratory data analysis.
In the fast growth of online shopping and transactions, the need for strong security to stop fake activities is very important. The "Online Payment Fraud Detection" project addresses this critical concern by leveraging advanced data science techniques to identify and prevent fraudulent transactions in real-time.
The primary objective of this project is to train machine learning models for identifying fraudulent and non-fraudulent payments.
The dataset consists of 10 variables:
step
: represents a unit of time where 1 step equals 1 hourtype
: type of online transactionamount
: the amount of the transactionnameOrig
: customer starting the transactionoldbalanceOrg
: balance before the transactionnewbalanceOrig
: balance after the transactionnameDest
: recipient of the transactionoldbalanceDest
: initial balance of recipient before the transactionnewbalanceDest
: the new balance of the recipient after the transactionisFraud
: indicates whether the transaction is fraudulent
- pandas
- numpy
- seaborn
- matplotlib
- tabulate
- scikit-learn (sklearn)
Random Forest and Logistics Regression were used to identify online payment fraud due to the large dataset.
Read the complete Online Payment Fraud Detection project here
The dataset used in this project is available on Kaggle. You can access it here.
The best-performing model for identifying fraudulent and non-fraudulent payments is Random Forest.