Machine Learning for Prediction of Imbalanced Data: Credit Fraud Detection

Online transactions have increased drastically over the past decades. Credit card transactions account for a large percentage of these transactions. This leads to rise activities of credit card fraud transactions, causing losses in the finance industry. Therefore, it is vital to create reliable frau...

Full description

Saved in:

Bibliographic Details
Published in	2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM) pp. 1 - 7
Main Authors	Tran, Thanh Cong, Dang, Tran Khanh
Format	Conference Proceeding
Language	English
Published	IEEE 04.01.2021
Subjects	ADASYN Classification Classification algorithms classification measurements Credit cards Fraud Detection Imbalanced Data Machine learning Machine learning algorithms Prediction algorithms Random forests Regression tree analysis Reliability SMOTE
Online Access	Get full text
DOI	10.1109/IMCOM51814.2021.9377352

Cover

More Information
Summary:	Online transactions have increased drastically over the past decades. Credit card transactions account for a large percentage of these transactions. This leads to rise activities of credit card fraud transactions, causing losses in the finance industry. Therefore, it is vital to create reliable fraud detection systems, including two labels of fraud and no-fraud. However, there are highly unbalanced data between these two labels. In this paper, we use two resampling approaches of synthetic minority oversampling technique (SMOTE) and adaptive synthetic (ADASYN) to handle an imbalanced dataset to obtain the balanced dataset. The machine learning (ML) algorithms, named random forest, k nearest neighbors, decision tree, and logistic regression are applied to this balanced dataset. The comprehensive classification measurements, including fundamental, combined, and graphical measurements are used to evaluate the performances of these models. We observe that after resampling the dataset, the ML algorithms mentioned show the positive results of classification for fraudulent activities.
DOI:	10.1109/IMCOM51814.2021.9377352