Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh – A Python package)

Time series feature engineering is a time-consuming process because scientists and engineers have to consider the multifarious algorithms of signal processing and time series analysis for identifying and extracting meaningful features from time series. The Python package tsfresh (Time Series FeatuRe...

Full description

Saved in:
Bibliographic Details
Published inNeurocomputing (Amsterdam) Vol. 307; pp. 72 - 77
Main Authors Christ, Maximilian, Braun, Nils, Neuffer, Julius, Kempa-Liehr, Andreas W.
Format Journal Article
LanguageEnglish
Published Elsevier B.V 13.09.2018
Subjects
Online AccessGet full text
ISSN0925-2312
1872-8286
1872-8286
DOI10.1016/j.neucom.2018.03.067

Cover

More Information
Summary:Time series feature engineering is a time-consuming process because scientists and engineers have to consider the multifarious algorithms of signal processing and time series analysis for identifying and extracting meaningful features from time series. The Python package tsfresh (Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests) accelerates this process by combining 63 time series characterization methods, which by default compute a total of 794 time series features, with feature selection on basis automatically configured hypothesis tests. By identifying statistically significant time series characteristics in an early stage of the data science process, tsfresh closes feedback loops with domain experts and fosters the development of domain specific features early on. The package implements standard APIs of time series and machine learning libraries (e.g. pandas and scikit-learn) and is designed for both exploratory analyses as well as straightforward integration into operational data science applications.
ISSN:0925-2312
1872-8286
1872-8286
DOI:10.1016/j.neucom.2018.03.067