Experimental comparison of representation methods and distance measures for time series data

doi:10.1007/S10618-012-0250-5

Open AccessJournal ArticleDOI

Experimental comparison of representation methods and distance measures for time series data

Xiaoyue Wang, +5 more

- 01 Mar 2013 -

Data Mining and Knowledge Discovery

- Vol. 26, Iss: 2, pp 275-309

Chats0

TLDR

An extensive experimental study re-implementing eight different time series representations and nine similarity measures and their variants and testing their effectiveness on 38 time series data sets from a wide variety of application domains gives an overview of these different techniques and presents comparative experimental findings regarding their effectiveness.

Abstract:

The previous decade has brought a remarkable increase of the interest in applications that deal with querying and mining of time series data. Many of the research efforts in this context have focused on introducing new representation methods for dimensionality reduction or novel similarity measures for the underlying data. In the vast majority of cases, each individual work introducing a particular method has made specific claims and, aside from the occasional theoretical justifications, provided quantitative experimental observations. However, for the most part, the comparative aspects of these experiments were too narrowly focused on demonstrating the benefits of the proposed methods over some of the previously introduced ones. In order to provide a comprehensive validation, we conducted an extensive experimental study re-implementing eight different time series representations and nine similarity measures and their variants, and testing their effectiveness on 38 time series data sets from a wide variety of application domains. In this article, we give an overview of these different techniques and present our comparative experimental findings regarding their effectiveness. In addition to providing a unified validation of some of the existing achievements, our experiments also indicate that, in some cases, certain claims in the literature may be unduly optimistic.

Experimental comparison of representation methods and distance measures for time series data

Citations

Time-series clustering - A decade review

The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances

k-Shape: Efficient and Accurate Clustering of Time Series

Fast Shapelets: A Scalable Algorithm for Discovering Time Series Shapelets.

Using dynamic time warping distances as features for improved time series classification

References

Data Mining: Concepts and Techniques

Pattern Classification and Scene Analysis.

Pattern classification and scene analysis

A study of cross-validation and bootstrap for accuracy estimation and model selection

Introduction to Data Mining

Related Papers (5)

Dynamic programming algorithm optimization for spoken word recognition

Exact indexing of dynamic time warping

Using dynamic time warping to find patterns in time series

Fast subsequence matching in time-series databases

Experiencing SAX: a novel symbolic representation of time series