Machine Learning for Prediction of Imbalanced Data: Credit Fraud Detection
Online transactions have increased drastically over the past decades. Credit card transactions account for a large percentage of these transactions. This leads to rise activities of credit card fraud transactions, causing losses in the finance industry. Therefore, it is vital to create reliable frau...
Saved in:
Published in | 2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM) pp. 1 - 7 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
04.01.2021
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/IMCOM51814.2021.9377352 |
Cover
Summary: | Online transactions have increased drastically over the past decades. Credit card transactions account for a large percentage of these transactions. This leads to rise activities of credit card fraud transactions, causing losses in the finance industry. Therefore, it is vital to create reliable fraud detection systems, including two labels of fraud and no-fraud. However, there are highly unbalanced data between these two labels. In this paper, we use two resampling approaches of synthetic minority oversampling technique (SMOTE) and adaptive synthetic (ADASYN) to handle an imbalanced dataset to obtain the balanced dataset. The machine learning (ML) algorithms, named random forest, k nearest neighbors, decision tree, and logistic regression are applied to this balanced dataset. The comprehensive classification measurements, including fundamental, combined, and graphical measurements are used to evaluate the performances of these models. We observe that after resampling the dataset, the ML algorithms mentioned show the positive results of classification for fraudulent activities. |
---|---|
DOI: | 10.1109/IMCOM51814.2021.9377352 |