Financial Statement Fraud Detection: An Analysis of Statistical and Machine Learning Algorithms

This study compares the performance of six popular statistical and machine learning models in detecting financial statement fraud under different assumptions of misclassification costs and ratios of fraud firms to nonfraud firms. The results show, somewhat surprisingly, that logistic regression and...

Full description

Saved in:

Bibliographic Details
Published in	Auditing : a journal of practice and theory Vol. 30; no. 2; pp. 19 - 50
Main Author	Perols, Johan
Format	Journal Article
Language	English
Published	Sarasota Assoc 01.05.2011 American Accounting Association
Subjects	Algorithms Auditing Auditing standards Auditors Bilanzdelikt Classification Comparative analysis Costs Datasets Discriminant analysis Efficient markets Financial statements Fraud Fraud prevention Klassifikation Machine learning Mustererkennung Neural networks Probability Regression analysis Studies Support vector machines USA Wirtschaftsprüfung
Online Access	Get full text
ISSN	0278-0380 1558-7991
DOI	10.2308/ajpt-50009

Cover

More Information
Summary:	This study compares the performance of six popular statistical and machine learning models in detecting financial statement fraud under different assumptions of misclassification costs and ratios of fraud firms to nonfraud firms. The results show, somewhat surprisingly, that logistic regression and support vector machines perform well relative to an artificial neural network, bagging, C4.5, and stacking. The results also reveal some diversity in predictors used across the classification algorithms. Out of 42 predictors examined, only six are consistently selected and used by different classification algorithms: auditor turnover, total discretionary accruals, Big 4 auditor, accounts receivable, meeting or beating analyst forecasts, and unexpected employee productivity. These findings extend financial statement fraud research and can be used by practitioners and regulators to improve fraud risk models. Data Availability: A list of fraud companies used in this study is available from the author upon request. All other data sources are described in the text.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0278-0380 1558-7991
DOI:	10.2308/ajpt-50009