Skip to content

The problem statement is to predict fraudulent credit card transactions with the help of machine learning models. In this project, we need to analyse customer-level data that has been collected and analysed during a research collaboration of Worldline and the Machine Learning Group.

Notifications You must be signed in to change notification settings

bagishmehrotra/creditcardFraud

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

creditcardFraud

The problem statement is to predict fraudulent credit card transactions with the help of machine learning models.Here we will analyse customer-level data that has been collected and analysed during a research collaboration of Worldline and the Machine Learning Group. The data set is taken from the Kaggle website (https://www.kaggle.com/mlg-ulb/creditcardfraud) and has a total of 2,84,807 transactions; out of these, 492 are fraudulent. Since the data set is highly imbalanced, it needs to be handled before model building. Data dictionary: The data set includes credit card transactions made by European cardholders over a period of two days in September 2013. Out of a total of 2,84,807 transactions, 492 were fraudulent. This data set is highly unbalanced, with the positive class (frauds) accounting for 0.172% of the total transactions. The data set has also been modified with principal component analysis (PCA) to maintain confidentiality. Apart from ‘time’ and ‘amount’, all the other features (V1, V2, V3, up to V28) are the principal components obtained using PCA. The feature 'time' contains the seconds elapsed between the first transaction in the data set and the subsequent transactions. The feature 'amount' is the transaction amount. The feature 'class' represents class labelling, and it takes the value of 1 in cases of fraud and 0 in others.

Note: Used Google Colab to process the data

About

The problem statement is to predict fraudulent credit card transactions with the help of machine learning models. In this project, we need to analyse customer-level data that has been collected and analysed during a research collaboration of Worldline and the Machine Learning Group.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages