Upgrades of Genetic Programming for Data-Driven Modeling of Time Series

In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are...

Full description

Saved in:
Bibliographic Details
Published inEvolutionary computation Vol. 31; no. 4; pp. 401 - 432
Main Authors Murari, A., Peluso, E., Spolladore, L., Rossi, R., Gelfusa, M.
Format Journal Article
LanguageEnglish
Published United States MIT Press Journals, The 01.12.2023
Subjects
Online AccessGet full text
ISSN1530-9304
1063-6560
1530-9304
DOI10.1162/evco_a_00330

Cover

More Information
Summary:In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are proposed and tested to improve the explorative capabilities of symbolic regression (SR) via genetic programming (GP) for the investigation of time series, with the objective of extracting mathematical models directly from the available signals. The main task is not simply prediction but consists of identifying interpretable equations, reflecting the nature of the mechanisms generating the signals. The implemented improvements involve almost all aspects of GP, from the knowledge representation and the genetic operators to the fitness function. The unique capabilities of genetic programming, to accommodate prior information and knowledge, are also leveraged effectively. The proposed upgrades cover the most important applications of empirical modeling of time series, ranging from the identification of autoregressive systems and partial differential equations to the search of models in terms of dimensionless quantities and appropriate physical units. Particularly delicate systems to identify, such as those showing hysteretic behavior or governed by delayed differential equations, are also addressed. The potential of the developed tools is substantiated with both a battery of systematic numerical tests with synthetic signals and with applications to experimental data.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1530-9304
1063-6560
1530-9304
DOI:10.1162/evco_a_00330