Machine Learning for Prediction of Imbalanced Data: Credit Fraud Detection

Online transactions have increased drastically over the past decades. Credit card transactions account for a large percentage of these transactions. This leads to rise activities of credit card fraud transactions, causing losses in the finance industry. Therefore, it is vital to create reliable frau...

Full description

Saved in:
Bibliographic Details
Published in2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM) pp. 1 - 7
Main Authors Tran, Thanh Cong, Dang, Tran Khanh
Format Conference Proceeding
LanguageEnglish
Published IEEE 04.01.2021
Subjects
Online AccessGet full text
DOI10.1109/IMCOM51814.2021.9377352

Cover

More Information
Summary:Online transactions have increased drastically over the past decades. Credit card transactions account for a large percentage of these transactions. This leads to rise activities of credit card fraud transactions, causing losses in the finance industry. Therefore, it is vital to create reliable fraud detection systems, including two labels of fraud and no-fraud. However, there are highly unbalanced data between these two labels. In this paper, we use two resampling approaches of synthetic minority oversampling technique (SMOTE) and adaptive synthetic (ADASYN) to handle an imbalanced dataset to obtain the balanced dataset. The machine learning (ML) algorithms, named random forest, k nearest neighbors, decision tree, and logistic regression are applied to this balanced dataset. The comprehensive classification measurements, including fundamental, combined, and graphical measurements are used to evaluate the performances of these models. We observe that after resampling the dataset, the ML algorithms mentioned show the positive results of classification for fraudulent activities.
DOI:10.1109/IMCOM51814.2021.9377352