Showing papers on "Dynamic time warping published in 2008"

PDF

Open Access

Dynamic Time Warping

[...]

Meinard Müller

01 Jan 2008

544 citations

Journal Article•DOI•

Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification

[...]

Joan Serrà¹, Emilia Gómez¹, Perfecto Herrera¹, Xavier Serra¹•Institutions (1)

Pompeu Fabra University¹

01 Aug 2008-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: A new technique for audio signal comparison based on tonal subsequence alignment and its application to detect cover versions (i.e., different performances of the same underlying musical piece) is presented.

...read moreread less

Abstract: We present a new technique for audio signal comparison based on tonal subsequence alignment and its application to detect cover versions (i.e., different performances of the same underlying musical piece). Cover song identification is a task whose popularity has increased in the music information retrieval (MIR) community along in the past, as it provides a direct and objective way to evaluate music similarity algorithms. This paper first presents a series of experiments carried out with two state-of-the-art methods for cover song identification. We have studied several components of these (such as chroma resolution and similarity, transposition, beat tracking or dynamic time warping constraints), in order to discover which characteristics would be desirable for a competitive cover song identifier. After analyzing many cross-validated results, the importance of these characteristics is discussed, and the best performing ones are finally applied to the newly proposed method. Multiple evaluations of this one confirm a large increase in identification accuracy when comparing it with alternative state-of-the-art approaches.

...read moreread less

274 citations

Proceedings Article•DOI•

Aligned Cluster Analysis for temporal segmentation of human motion

[...]

Feng Zhou¹, Fernando De la Torre¹, Jessica K. Hodgins¹•Institutions (1)

Carnegie Mellon University¹

01 Sep 2008

TL;DR: In this paper, the authors propose aligned cluster analysis (ACA), a robust method to temporally segment streams of motion capture data into actions, which extends standard kernel k-means clustering in two ways: the cluster means contain a variable number of features, and a dynamic time warping (DTW) kernel is used to achieve temporal invariance.

...read moreread less

Abstract: Temporal segmentation of human motion into actions is a crucial step for understanding and building computational models of human motion. Several issues contribute to the challenge of this task. These include the large variability in the temporal scale and periodicity of human actions, as well as the exponential nature of all possible movement combinations. We formulate the temporal segmentation problem as an extension of standard clustering algorithms. In particular, this paper proposes aligned cluster analysis (ACA), a robust method to temporally segment streams of motion capture data into actions. ACA extends standard kernel k-means clustering in two ways: (1) the cluster means contain a variable number of features, and (2) a dynamic time warping (DTW) kernel is used to achieve temporal invariance. Experimental results, reported on synthetic data and the Carnegie Mellon Motion Capture database, demonstrate its effectiveness.

...read moreread less

197 citations

Journal Article•DOI•

Sign Language Recognition by Combining Statistical DTW and Independent Classification

[...]

J.F. Lichtenauer¹, Emile A. Hendriks¹, Marcel J. T. Reinders¹•Institutions (1)

Delft University of Technology¹

01 Nov 2008-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: It is found that combining likelihoods of multiple models in a second classification stage degrades performance of the proposed classifiers, while improving performance with HMM and SD TW, and combining DFFM mappings of multiple SDTW models with SDTW likelihoods can provide significant improvement over SDTW.

...read moreread less

Abstract: To recognize speech, handwriting, or sign language, many hybrid approaches have been proposed that combine dynamic time warping (DTW) or hidden Markov models (HMMs) with discriminative classifiers. However, all methods rely directly on the likelihood models of DTW/HMM. We hypothesize that time warping and classification should be separated because of conflicting likelihood modeling demands. To overcome these restrictions, we propose using statistical DTW (SDTW) only for time warping, while classifying the warped features with a different method. Two novel statistical classifiers are proposed - combined discriminative feature detectors (CDFDs) and quadratic classification on DF Fisher mapping (Q-DFFM) - both using a selection of discriminative features (DFs), and are shown to outperform HMM and SDTW. However, we have found that combining likelihoods of multiple models in a second classification stage degrades performance of the proposed classifiers, while improving performance with HMM and SDTW. A proof-of-concept experiment, combining DFFM mappings of multiple SDTW models with SDTW likelihoods, shows that, also for model-combining, hybrid classification can provide significant improvement over SDTW. Although recognition is mainly based on 3D hand motion features, these results can be expected to generalize to recognition with more detailed measurements such as hand/body pose and facial expression.

...read moreread less

178 citations

Book Chapter•DOI•

Improving Shape Retrieval by Learning Graph Transduction

[...]

Xingwei Yang¹, Xiang Bai², Xiang Bai³, Longin Jan Latecki¹, Zhuowen Tu³ - Show less +1 more•Institutions (3)

Temple University¹, Huazhong University of Science and Technology², University of California³

12 Oct 2008

TL;DR: This paper considers the existing shapes as a group, and study their similarity measures to the query shape in a graph structure, and learns a better metric through graph transduction by propagating the model through existing shapes, in a way similar to com- puting geodesics in shape manifold.

...read moreread less

Abstract: Shape retrieval/matching is a very important topic in computer vision. The recent progress in this domain has been mostly driven by designing smart features for providing better similarity measure between pairs of shapes. In this paper, we provide a new perspective to this problem by considering the existing shapes as a group, and study their similarity measures to the query shape in a graph structure. Our method is general and can be built on top of any existing shape matching algorithms. It learns a better metric through graph transduction by propagating the model through existing shapes, in a way similar to computing geodesics in shape manifold. However, the proposed method does not require learning the shape manifold explicitly and it does not require knowing any class labels of existing shapes. The presented experimental results demonstrate that the proposed approach yields significant improvements over the state-of-art shape matching algorithms. We obtained a retrieval rate of 91% on the MPEG-7 data set, which is the highest ever reported in the literature.

...read moreread less

151 citations

Journal Article•DOI•

Pairwise curve synchronization for functional data

[...]

Rong Tang¹, Hans-Georg Müller²•Institutions (2)

Center for Devices and Radiological Health¹, University of California, Davis²

01 Dec 2008-Biometrika

TL;DR: In this article, a curve-synchronization method that uses every trajectory in the sample as a reference to obtain pairwise warping functions in the first step is presented. And then, these initial pairwise Warping functions are then used to create improved estimators of the underlying individual warping function in the second step.

...read moreread less

Abstract: SUMMARY Data collected by scientists are increasingly in the form of trajectories or curves. Often these can be viewed as realizations of a composite process driven by both amplitude and time variation. We consider the situation in which functional variation is dominated by time variation, and develop a curve-synchronization method that uses every trajectory in the sample as a reference to obtain pairwise warping functions in the first step. These initial pairwise warping functions are then used to create improved estimators of the underlying individual warping functions in the second step. A truncated averaging process is used to obtain robust estimation of individual warping functions. The method compares well with other available time-synchronization approaches and is illustrated with Berkeley growth data and gene expression data for multiple sclerosis.

...read moreread less

149 citations

Journal Article•DOI•

Combining Registration and Fitting for Functional Models

[...]

Alois Kneip, James O. Ramsay

01 Sep 2008-Journal of the American Statistical Association

TL;DR: In this article, the authors define a new type of registration process, in which the warping functions optimize the fit of a principal components decomposition to the aligned curves, effectively the features that this process aligns.

...read moreread less

Abstract: A registration method can be defined as a process of aligning features of a sample of curves by monotone transformations of their domain. The aligned curves exhibit only amplitude variation, and the domain transformations, called warping functions, capture the phase variation in the original curves. In this article we precisely define a new type of registration process, in which the warping functions optimize the fit of a principal components decomposition to the aligned curves. The principal components are effectively the features that this process aligns. We discuss the relationship of registration to closure of a function space under convex operations, and define consistency for registration methods. We define an explicit decomposition of functional variation into amplitude and phase partitions, and develop an algorithm for combining registration with principal components analysis, and apply it to simulated and real data.

...read moreread less

144 citations

Proceedings Article•DOI•

Support vector machines and dynamic time warping for time series

[...]

Steinn Gudmundsson¹, Thomas Philip Runarsson¹, Sven Sigurdsson¹•Institutions (1)

University of Iceland¹

01 Jun 2008

TL;DR: The feasibility of the similarity based approach for DTW is investigated by applying the method to a large set of time-series classification problems.

...read moreread less

Abstract: Effective use of support vector machines (SVMs) in classification necessitates the appropriate choice of a kernel. Designing problem specific kernels involves the definition of a similarity measure, with the condition that kernels are positive semi-definite (PSD). An alternative approach which places no such restrictions on the similarity measure is to construct a set of inputs and let each example be represented by its similarity to all the examples in this set and then apply a conventional SVM to this transformed data. Dynamic time warping (DTW) is a well established distance measure for time series but has been of limited use in SVMs since it is not obvious how it can be used to derive a PSD kernel. The feasibility of the similarity based approach for DTW is investigated by applying the method to a large set of time-series classification problems.

...read moreread less

106 citations

Journal Article•DOI•

Automatic Identification of Defect Patterns in Semiconductor Wafer Maps Using Spatial Correlogram and Dynamic Time Warping

[...]

Young-Seon Jeong¹, Seong-Jun Kim, Myong K. Jeong¹•Institutions (1)

Rutgers University¹

05 Nov 2008-IEEE Transactions on Semiconductor Manufacturing

TL;DR: A new methodology in which spatial correlogram is used for the detection of the presence of spatial autocorrelations and for the classification of defect patterns on the wafer map and it is shown that the method is robust to random noise and has a robust performance regardless of defect location and size.

...read moreread less

Abstract: A wafer map is a graphical illustration of the locations of defective chips on a wafer. Defective chips are likely to exhibit a spatial dependence across the wafer map, which contains useful information on the process of integrated circuit (IC) fabrication. An analysis of wafer map data helps to better understand ongoing process problems. This paper proposes a new methodology in which spatial correlogram is used for the detection of the presence of spatial autocorrelations and for the classification of defect patterns on the wafer map. After the detection of spatial autocorrelation based on our proposed spatial randomness test using spatial correlogram, the dynamic time warping algorithm which provides nonlinear alignments between two sequences to find optimal warping path is adopted for the automatic classification of spatial patterns based on spatial correlogram. We also develop generalized join-count (JC)-based statistics and then propose a procedure to determine the optimal weights of JC-based statistics. The proposed method is illustrated using real-life examples and simulated data sets. The experimental results show that our method is robust to random noise and has a robust performance regardless of defect location and size.

...read moreread less

105 citations

Book Chapter•

Dynamic time warping

[...]

Rubita Sudirman, Khairul Nadiah Khalid

01 Jan 2008

TL;DR: DTW is considered as one effective method in speech pattern recognition, however the bad side of this method is that it requires a long processing time plus large storage capacity, especially for real time recognitions.

...read moreread less

Abstract: Template matching is an alternative to perform speech recognition. However, the template matching encountered problems due to speaking rate variability, in which there exist timing differences between the two utterances. Speech has a constantly changing signal, thus it is almost impossible to get the same signal for two same utterances. The problem of time differences can be solved through DTW algorithm: warping the template with the test utterance based on their similarities. So, DTW algorithm actually is a procedure, which combines both warping and distance measurement. DTW is considered as one effective method in speech pattern recognition, however the bad side of this method is that it requires a long processing time plus large storage capacity, especially for real time recognitions.

...read moreread less

102 citations

Book Chapter•DOI•

Text-Dependent Speaker Recognition

[...]

Matthieu Hébert

01 Jan 2008

TL;DR: The intrinsic dependence that the lexical content of the password phrase has on the accuracy is demonstrated and several research results will be presented and analyzed to show key techniques used in text-dependent speaker recognition systems from different sites.

...read moreread less

Abstract: Text-dependent speaker recognition characterizes a speaker recognition task, such as verification or identification, in which the set of words (or lexicon) used during the testing phase is a subset of the ones present during the enrollment phase. The restricted lexicon enables very short enrollment (or registration) and testing sessions to deliver an accurate solution but, at the same time, represents scientific and technical challenges. Because of the short enrollment and testing sessions, text-dependent speaker recognition technology is particularly well suited for deployment in large-scale commercial applications. These are the bases for presenting an overview of the state of the art in text-dependent speaker recognition as well as emerging research avenues. In this chapter, we will demonstrate the intrinsic dependence that the lexical content of the password phrase has on the accuracy. Several research results will be presented and analyzed to show key techniques used in text-dependent speaker recognition systems from different sites. Among these, we mention multichannel speaker model synthesis and continuous adaptation of speaker models with threshold tracking. Since text-dependent speaker recognition is the most widely used voice biometric in commercial deployments, several

...read moreread less

Journal Article•DOI•

Using dynamic time warping for online temporal fusion in multisensor systems

[...]

Ming Hsiao Ko¹, Geoff West¹, Svetha Venkatesh¹, Mohan Kumar²•Institutions (2)

Curtin University¹, University of Texas at Arlington²

01 Jul 2008-Information Fusion

TL;DR: A robust and efficient framework that uses dynamic time warping (DTW) as the core recognizer to perform online temporal fusion on either the raw data or the features is proposed and performance results are compared with a Hidden Markov Model (HMM) based system.

...read moreread less

Collapse