Evaluating multiple classifiers for stock price direction prediction

•We predict long term stock price direction.•We benchmark three ensemble methods against four single classifiers.•We use five times twofold cross-validation and AUC as a performance measure.•Random Forest is the top algorithm.•This study is the first to make such an extensive benchmark in this domai...

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 42; no. 20; pp. 7046 - 7056
Main Authors Ballings, Michel, Van den Poel, Dirk, Hespeels, Nathalie, Gryp, Ruben
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 15.11.2015
Subjects
Online AccessGet full text
ISSN0957-4174
1873-6793
DOI10.1016/j.eswa.2015.05.013

Cover

More Information
Summary:•We predict long term stock price direction.•We benchmark three ensemble methods against four single classifiers.•We use five times twofold cross-validation and AUC as a performance measure.•Random Forest is the top algorithm.•This study is the first to make such an extensive benchmark in this domain. Stock price direction prediction is an important issue in the financial world. Even small improvements in predictive performance can be very profitable. The purpose of this paper is to benchmark ensemble methods (Random Forest, AdaBoost and Kernel Factory) against single classifier models (Neural Networks, Logistic Regression, Support Vector Machines and K-Nearest Neighbor). We gathered data from 5767 publicly listed European companies and used the area under the receiver operating characteristic curve (AUC) as a performance measure. Our predictions are one year ahead. The results indicate that Random Forest is the top algorithm followed by Support Vector Machines, Kernel Factory, AdaBoost, Neural Networks, K-Nearest Neighbors and Logistic Regression. This study contributes to literature in that it is, to the best of our knowledge, the first to make such an extensive benchmark. The results clearly suggest that novel studies in the domain of stock price direction prediction should include ensembles in their sets of algorithms. Our extensive literature review evidently indicates that this is currently not the case.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2015.05.013