Evaluating multiple classifiers for stock price direction prediction

•We predict long term stock price direction.•We benchmark three ensemble methods against four single classifiers.•We use five times twofold cross-validation and AUC as a performance measure.•Random Forest is the top algorithm.•This study is the first to make such an extensive benchmark in this domai...

Full description

Saved in:

Bibliographic Details
Published in	Expert systems with applications Vol. 42; no. 20; pp. 7046 - 7056
Main Authors	Ballings, Michel, Van den Poel, Dirk, Hespeels, Nathalie, Gryp, Ruben
Format	Journal Article
Language	English
Published	Elsevier Ltd 15.11.2015
Subjects	Benchmark Benchmarking Ensemble methods Industrial plants Kernels Logistics Mathematical models Neural networks Raw materials Regression Single classifiers Stock price direction prediction Stock price direction prediction Benchmark Ensemble methods Single classifiers
Online Access	Get full text
ISSN	0957-4174 1873-6793
DOI	10.1016/j.eswa.2015.05.013

Cover

More Information
Summary:	•We predict long term stock price direction.•We benchmark three ensemble methods against four single classifiers.•We use five times twofold cross-validation and AUC as a performance measure.•Random Forest is the top algorithm.•This study is the first to make such an extensive benchmark in this domain. Stock price direction prediction is an important issue in the financial world. Even small improvements in predictive performance can be very profitable. The purpose of this paper is to benchmark ensemble methods (Random Forest, AdaBoost and Kernel Factory) against single classifier models (Neural Networks, Logistic Regression, Support Vector Machines and K-Nearest Neighbor). We gathered data from 5767 publicly listed European companies and used the area under the receiver operating characteristic curve (AUC) as a performance measure. Our predictions are one year ahead. The results indicate that Random Forest is the top algorithm followed by Support Vector Machines, Kernel Factory, AdaBoost, Neural Networks, K-Nearest Neighbors and Logistic Regression. This study contributes to literature in that it is, to the best of our knowledge, the first to make such an extensive benchmark. The results clearly suggest that novel studies in the domain of stock price direction prediction should include ensembles in their sets of algorithms. Our extensive literature review evidently indicates that this is currently not the case.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0957-4174 1873-6793
DOI:	10.1016/j.eswa.2015.05.013