Experimenting with prequential variations for data stream learning evaluation

Processing data streams requires new demands not existent on static environments. In online learning, the probability distribution of the data can often change over time (concept drift). The prequential assessment methodology is commonly used to evaluate the performance of classifiers in data stream...

Full description

Saved in:
Bibliographic Details
Published inComputational intelligence Vol. 35; no. 4; pp. 670 - 692
Main Authors Hidalgo, Juan I. González, Maciel, Bruno I. F., Barros, Roberto S. M.
Format Journal Article
LanguageEnglish
Published Hoboken Blackwell Publishing Ltd 01.11.2019
Subjects
Online AccessGet full text
ISSN0824-7935
1467-8640
DOI10.1111/coin.12208

Cover

More Information
Summary:Processing data streams requires new demands not existent on static environments. In online learning, the probability distribution of the data can often change over time (concept drift). The prequential assessment methodology is commonly used to evaluate the performance of classifiers in data streams with stationary and non‐stationary distributions. It is based on the premise that the purpose of statistical inference is to make sequential probability forecasts for future observations, rather than to express information about the past accuracy achieved. This article empirically evaluates the prequential methodology considering its three common strategies used to update the prediction model, namely, Basic Window, Sliding Window, and Fading Factors. Specifically, it aims to identify which of these variations is the most accurate for the experimental evaluation of the past results in scenarios where concept drifts occur, with greater interest in the accuracy observed within the total data flow. The prequential accuracy of the three variations and the real accuracy obtained in the learning process of each dataset are the basis for this evaluation. The results of the carried‐out experiments suggest that the use of Prequential with the Sliding Window variation is the best alternative.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0824-7935
1467-8640
DOI:10.1111/coin.12208