The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances

doi:10.1007/S10618-016-0483-9

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning for time series classification: a review

[...]

Hassan Ismail Fawaz¹, Germain Forestier², Jonathan Weber¹, Lhassane Idoumghar¹, Pierre-Alain Muller¹ - Show less +1 more•Institutions (2)

University of Upper Alsace¹, Monash University²

01 Jul 2019-Data Mining and Knowledge Discovery

TL;DR: This article proposes the most exhaustive study of DNNs for TSC by training 8730 deep learning models on 97 time series datasets and provides an open source deep learning framework to the TSC community.

...read moreread less

Abstract: Time Series Classification (TSC) is an important and challenging problem in data mining. With the increase of time series data availability, hundreds of TSC algorithms have been proposed. Among these methods, only a few have considered Deep Neural Networks (DNNs) to perform this task. This is surprising as deep learning has seen very successful applications in the last years. DNNs have indeed revolutionized the field of computer vision especially with the advent of novel deeper architectures such as Residual and Convolutional Neural Networks. Apart from images, sequential data such as text and audio can also be processed with DNNs to reach state-of-the-art performance for document classification and speech recognition. In this article, we study the current state-of-the-art performance of deep learning algorithms for TSC by presenting an empirical study of the most recent DNN architectures for TSC. We give an overview of the most successful deep learning applications in various time series domains under a unified taxonomy of DNNs for TSC. We also provide an open source deep learning framework to the TSC community where we implemented each of the compared approaches and evaluated them on a univariate TSC benchmark (the UCR/UEA archive) and 12 multivariate time series datasets. By training 8730 deep learning models on 97 time series datasets, we propose the most exhaustive study of DNNs for TSC to date.

...read moreread less

1,833 citations

Cites background or methods from "The great time series classificatio..."

...– This type of methods are mainly proposed for tasks other than classification or as part of a larger classification scheme (Bagnall et al., 2017);...
[...]
...Several variants of CNNs have been proposed and validated on a subset of the UCR/UEA archive (Chen et al., 2015b; Bagnall et al., 2017) such as Residual Networks (ResNets) (Wang et al., 2017b; Geng and Luo, 2018) which add linear shortcut connections for the convolutional layers potentially…...
[...]
...This type of classifiers have been referred to as Model-based classifiers in the TSC community (Bagnall et al., 2017)....
[...]
...…cannot cover an empirical study of all approaches validated in all TSC domains, we decided to only include approaches that were validated on the whole (or a subset of) the univariate time series UCR/UEA archive (Chen et al., 2015b; Bagnall et al., 2017) and/or on the MTS archive (Baydogan, 2015)....
[...]
...To achieve its high accuracy, HIVE-COTE becomes hugely computationally intensive and impractical to run on a real big data mining problem (Bagnall et al., 2017)....
[...]

Journal Article•DOI•

InceptionTime: Finding AlexNet for time series classification

[...]

Hassan Ismail Fawaz¹, Benjamin Lucas², Germain Forestier¹, Germain Forestier², Charlotte Pelletier², Charlotte Pelletier³, Daniel F. Schmidt², Jonathan Weber¹, Geoffrey I. Webb², Lhassane Idoumghar¹, Pierre-Alain Muller¹, François Petitjean² - Show less +8 more•Institutions (3)

University of Upper Alsace¹, Monash University², University of Southern Brittany³

07 Sep 2020-Data Mining and Knowledge Discovery

TL;DR: An important step towards finding the AlexNet network for TSC is taken by presenting InceptionTime---an ensemble of deep Convolutional Neural Network models, inspired by the Inception-v4 architecture, which outperforms HIVE-COTE's accuracy together with scalability.

...read moreread less

Abstract: This paper brings deep learning at the forefront of research into time series classification (TSC). TSC is the area of machine learning tasked with the categorization (or labelling) of time series. The last few decades of work in this area have led to significant progress in the accuracy of classifiers, with the state of the art now represented by the HIVE-COTE algorithm. While extremely accurate, HIVE-COTE cannot be applied to many real-world datasets because of its high training time complexity in $$O(N^2\cdot T^4)$$ for a dataset with N time series of length T. For example, it takes HIVE-COTE more than 8 days to learn from a small dataset with $$N=1500$$ time series of short length $$T=46$$ . Meanwhile deep learning has received enormous attention because of its high accuracy and scalability. Recent approaches to deep learning for TSC have been scalable, but less accurate than HIVE-COTE. We introduce InceptionTime—an ensemble of deep Convolutional Neural Network models, inspired by the Inception-v4 architecture. Our experiments show that InceptionTime is on par with HIVE-COTE in terms of accuracy while being much more scalable: not only can it learn from 1500 time series in one hour but it can also learn from 8M time series in 13 h, a quantity of data that is fully out of reach of HIVE-COTE.

...read moreread less

377 citations

Cites background or methods or result from "The great time series classificatio..."

...This is considered a common best-practice before classifying time series data (Bagnall et al., 2017)....
[...]
...A comprehensive detailed review of recent methods for TSC can be found in Bagnall et al. (2017)....
[...]
...Similarly to Ismail Fawaz et al. (2019b), when comparing with the stateof-the-art results published in Bagnall et al. (2017) we used the deep learning model’s median test accuracy over the different runs....
[...]
...These problems, known as time series classification (TSC), differ significantly to traditional supervised learning for structured data, in that the algorithms should be able to handle and harness the temporal information present in the signal (Bagnall et al., 2017)....
[...]
...5 illustrates the critical difference diagram with InceptionTime added to the mix of the current state-of-the-art classifiers for time series data, whose results were taken from Bagnall et al. (2017)....
[...]

Journal Article•DOI•

ROCKET: exceptionally fast and accurate time series classification using random convolutional kernels

[...]

Angus Dempster¹, François Petitjean¹, Geoffrey I. Webb¹•Institutions (1)

Monash University¹

13 Jul 2020-Data Mining and Knowledge Discovery

TL;DR: This paper shows that simple linear classifiers using random convolutional kernels achieve state-of-the-art accuracy with a fraction of the computational expense of existing methods for time series classification.

...read moreread less

Abstract: Most methods for time series classification that attain state-of-the-art accuracy have high computational complexity, requiring significant training time even for smaller datasets, and are intractable for larger datasets. Additionally, many existing methods focus on a single type of feature such as shape or frequency. Building on the recent success of convolutional neural networks for time series classification, we show that simple linear classifiers using random convolutional kernels achieve state-of-the-art accuracy with a fraction of the computational expense of existing methods. Using this method, it is possible to train and test a classifier on all 85 ‘bake off’ datasets in the UCR archive in $$<\,2\,\hbox {h}$$ , and it is possible to train a classifier on a large dataset of more than one million time series in approximately 1 h.

...read moreread less

341 citations

Cites background or methods from "The great time series classificatio..."

...To this end, Bagnall et al. (2017) used resamples of the datasets to assess performance....
[...]
...A seminal paper, Bagnall et al. (2017), conducted thorough comparative benchmarking of a large number of methods for time series classification on the 85 datasets in the archive as of 2017....
[...]
...Different methods for time series classification represent different approaches for extracting useful features from time series (Bagnall et al. 2017)....
[...]
...BOSS is one of several dictionary-based methods which use a representation based on the frequency of occurrence of patterns in time series (Bagnall et al. 2017)....
[...]
...There are other, more scalable, shapelet methods, but these are less accurate (Bagnall et al. 2017)....
[...]

Journal Article•DOI•

The UCR time series archive

[...]

Hoang Anh Dau¹, Anthony J. Bagnall², Kaveh Kamgar¹, Chin-Chia Michael Yeh¹, Yan Zhu¹, Shaghayegh Gharghabi¹, Chotirat Ann Ratanamahatana³, Eamonn Keogh¹ - Show less +4 more•Institutions (3)

University of California, Riverside¹, University of East Anglia², Chulalongkorn University³

08 Nov 2019-IEEE/CAA Journal of Automatica Sinica

TL;DR: The UCR time series archive as discussed by the authors has become an important resource in the time series data mining community, with at least one thousand published papers making use of one data set from the archive.

...read moreread less

Abstract: The UCR time series archive–introduced in 2002, has become an important resource in the time series data mining community, with at least one thousand published papers making use of at least one data set from the archive. The original incarnation of the archive had sixteen data sets but since that time, it has gone through periodic expansions. The last expansion took place in the summer of 2015 when the archive grew from 45 to 85 data sets. This paper introduces and will focus on the new data expansion from 85 to 128 data sets. Beyond expanding this valuable resource, this paper offers pragmatic advice to anyone who may wish to evaluate a new algorithm on the archive. Finally, this paper makes a novel and yet actionable claim: of the hundreds of papers that show an improvement over the standard baseline ( 1-nearest neighbor classification ), a fraction might be mis-attributing the reasons for their improvement. Moreover, the improvements claimed by these papers might have been achievable with a much simpler modification, requiring just a few lines of code.

...read moreread less

327 citations

Journal Article•DOI•

Temporal Convolutional Neural Network for the Classification of Satellite Image Time Series

[...]

Charlotte Pelletier, Geoffrey I. Webb, François Petitjean

04 Mar 2019-Remote Sensing

TL;DR: The experimental results show that TempCNNs are more accurate than the current state of the art for SITS classification, and some general guidelines on the network architecture, common regularization mechanisms, and hyper-parameter values such as batch size are provided.

...read moreread less

Abstract: Latest remote sensing sensors are capable of acquiring high spatial and spectral Satellite Image Time Series (SITS) of the world. These image series are a key component of classification systems that aim at obtaining up-to-date and accurate land cover maps of the Earth’s surfaces. More specifically, current SITS combine high temporal, spectral and spatial resolutions, which makes it possible to closely monitor vegetation dynamics. Although traditional classification algorithms, such as Random Forest (RF), have been successfully applied to create land cover maps from SITS, these algorithms do not make the most of the temporal domain. This paper proposes a comprehensive study of Temporal Convolutional Neural Networks (TempCNNs), a deep learning approach which applies convolutions in the temporal dimension in order to automatically learn temporal (and spectral) features. The goal of this paper is to quantitatively and qualitatively evaluate the contribution of TempCNNs for SITS classification, as compared to RF and Recurrent Neural Networks (RNNs) —a standard deep learning approach that is particularly suited to temporal data. We carry out experiments on Formosat-2 scene with 46 images and one million labelled time series. The experimental results show that TempCNNs are more accurate than the current state of the art for SITS classification. We provide some general guidelines on the network architecture, common regularization mechanisms, and hyper-parameter values such as batch size; we also draw out some differences with standard results in computer vision (e.g., about pooling layers). Finally, we assess the visual quality of the land cover maps produced by TempCNNs.

...read moreread less

310 citations

Cites background or methods from "The great time series classificatio..."

...The state-of-the-art approaches are now lead by more complex algorithms [62], that we describe hereafter....
[...]
...In machine learning, the input data are generally znormalized by subtracting the mean and divided by the standard deviation for each time series [62]....
[...]

Collapse

The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances

Citations

Cites background or methods from "The great time series classificatio..."

Cites background or methods or result from "The great time series classificatio..."

Cites background or methods from "The great time series classificatio..."

Cites background or methods from "The great time series classificatio..."

References

"The great time series classificatio..." refers background or methods in this paper

"The great time series classificatio..." refers background or methods in this paper

Related Papers (5)