Financial Statement Fraud Detection: An Analysis of Statistical and Machine Learning Algorithms

This study compares the performance of six popular statistical and machine learning models in detecting financial statement fraud under different assumptions of misclassification costs and ratios of fraud firms to nonfraud firms. The results show, somewhat surprisingly, that logistic regression and...

Full description

Saved in:
Bibliographic Details
Published inAuditing : a journal of practice and theory Vol. 30; no. 2; pp. 19 - 50
Main Author Perols, Johan
Format Journal Article
LanguageEnglish
Published Sarasota Assoc 01.05.2011
American Accounting Association
Subjects
Online AccessGet full text
ISSN0278-0380
1558-7991
DOI10.2308/ajpt-50009

Cover

More Information
Summary:This study compares the performance of six popular statistical and machine learning models in detecting financial statement fraud under different assumptions of misclassification costs and ratios of fraud firms to nonfraud firms. The results show, somewhat surprisingly, that logistic regression and support vector machines perform well relative to an artificial neural network, bagging, C4.5, and stacking. The results also reveal some diversity in predictors used across the classification algorithms. Out of 42 predictors examined, only six are consistently selected and used by different classification algorithms: auditor turnover, total discretionary accruals, Big 4 auditor, accounts receivable, meeting or beating analyst forecasts, and unexpected employee productivity. These findings extend financial statement fraud research and can be used by practitioners and regulators to improve fraud risk models. Data Availability: A list of fraud companies used in this study is available from the author upon request. All other data sources are described in the text.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0278-0380
1558-7991
DOI:10.2308/ajpt-50009