The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances

doi:10.1007/S10618-016-0483-9

Open AccessJournal ArticleDOI

The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances

Anthony J. Bagnall, +4 more

- 01 May 2017 -

Data Mining and Knowledge Discovery

- Vol. 31, Iss: 3, pp 606-660

Chats0

TLDR

This work implemented 18 recently proposed algorithms in a common Java framework and compared them against two standard benchmark classifiers (and each other) by performing 100 resampling experiments on each of the 85 datasets, indicating that only nine of these algorithms are significantly more accurate than both benchmarks.

Abstract:

In the last 5 years there have been a large number of new time series classification algorithms proposed in the literature. These algorithms have been evaluated on subsets of the 47 data sets in the University of California, Riverside time series classification archive. The archive has recently been expanded to 85 data sets, over half of which have been donated by researchers at the University of East Anglia. Aspects of previous evaluations have made comparisons between algorithms difficult. For example, several different programming languages have been used, experiments involved a single train/test split and some used normalised data whilst others did not. The relaunch of the archive provides a timely opportunity to thoroughly evaluate algorithms on a larger number of datasets. We have implemented 18 recently proposed algorithms in a common Java framework and compared them against two standard benchmark classifiers (and each other) by performing 100 resampling experiments on each of the 85 datasets. We use these results to test several hypotheses relating to whether the algorithms are significantly more accurate than the benchmarks and each other. Our results indicate that only nine of these algorithms are significantly more accurate than both benchmarks and that one classifier, the collective of transformation ensembles, is significantly more accurate than all of the others. All of our experiments and results are reproducible: we release all of our code, results and experimental details and we hope these experiments form the basis for more robust testing of new algorithms in the future.

The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances

Citations

Exploring the Detection of Spontaneous Recollections during Video-viewing In-the-Wild using Facial Behavior Analysis

A systematic review of Python packages for time series analysis

A Novel Parameter-Free Energy Efficient Fuzzy Nearest Neighbor Classifier for Time Series Data

Shapelet-based Temporal Association Rule Mining for Multivariate Time Series Classification

Using Topic Modelling to Improve Prediction of Financial Report Commentary Classes

References

The WEKA data mining software: an update

Statistical Comparisons of Classifiers over Multiple Data Sets

Domain-adversarial training of neural networks

Experiencing SAX: a novel symbolic representation of time series

Querying and mining of time series data: experimental comparison of representations and distance measures

Related Papers (5)

Deep learning for time series classification: a review

Time series classification from scratch with deep neural networks: A strong baseline

Statistical Comparisons of Classifiers over Multiple Data Sets

Time series shapelets: a new primitive for data mining

Dynamic programming algorithm optimization for spoken word recognition