Home
/
Topics
/
Dynamic time warping

Topic

Dynamic time warping

About: Dynamic time warping is a research topic. Over the lifetime, 6013 publications have been published within this topic receiving 133130 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1972
1970
1968
1967
1962

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Discriminative feature selection for on-line signature verification

[...]

Xinghua Xia¹, Xiaoyu Song¹, Fangun Luan¹, Jungang Zheng¹, Zhili Chen¹, Xiaofu Ma - Show less +2 more•Institutions (1)

Shenyang Jianzhu University¹

01 Feb 2018-Pattern Recognition

TL;DR: Two methods, which are based on full factorial experiment design and optimal orthogonal experiment design, are proposed for selecting discriminative features among candidates to improve the robustness of on-line handwritten signatures.

...read moreread less

44 citations

Posted Content•

Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches

[...]

Shane Settle, Karen Livescu

08 Nov 2016-arXiv: Computation and Language

TL;DR: In this paper, a discriminative embedding model based on recurrent neural networks (RNNs) was proposed for word discrimination tasks, which can outperform dynamic time warping on query-by-example search.

...read moreread less

Abstract: Acoustic word embeddings --- fixed-dimensional vector representations of variable-length spoken word segments --- have begun to be considered for tasks such as speech recognition and query-by-example search. Such embeddings can be learned discriminatively so that they are similar for speech segments corresponding to the same word, while being dissimilar for segments corresponding to different words. Recent work has found that acoustic word embeddings can outperform dynamic time warping on query-by-example search and related word discrimination tasks. However, the space of embedding models and training approaches is still relatively unexplored. In this paper we present new discriminative embedding models based on recurrent neural networks (RNNs). We consider training losses that have been successful in prior work, in particular a cross entropy loss for word classification and a contrastive loss that explicitly aims to separate same-word and different-word pairs in a "Siamese network" training setting. We find that both classifier-based and Siamese RNN embeddings improve over previously reported results on a word discrimination task, with Siamese RNNs outperforming classification models. In addition, we present analyses of the learned embeddings and the effects of variables such as dimensionality and network structure.

...read moreread less

44 citations

Journal Article•DOI•

Accurate automatic visible speech synthesis of arbitrary 3D models based on concatenation of diviseme motion capture data

[...]

Jiyong Ma¹, Ronald A. Cole, Bryan L. Pellom, Wayne H. Ward, Barbara Wise - Show less +1 more•Institutions (1)

University of Colorado Boulder¹

01 Dec 2004-Computer Animation and Virtual Worlds

TL;DR: Time warping and motion vector blending at the juncture of two divisemes and the algorithm to search the optimal concatenated visible speech are developed to provide the final concatenative motion sequence.

...read moreread less

Abstract: We present a technique for accurate automatic visible speech synthesis from textual input. When provided with a speech waveform and the text of a spoken sentence, the system produces accurate visible speech synchronized with the audio signal. To develop the system, we collected motion capture data from a speaker's face during production of a set of words containing all diviseme sequences in English. The motion capture points from the speaker's face are retargeted to the vertices of the polygons of a 3D face model. When synthesizing a new utterance, the system locates the required sequence of divisemes, shrinks or expands each diviseme based on the desired phoneme segment durations in the target utterance, then moves the polygons in the regions of the lips and lower face to correspond to the spatial coordinates of the motion capture data. The motion mapping is realized by a key-shape mapping function learned by a set of viseme examples in the source and target faces. A well-posed numerical algorithm estimates the shape blending coefficients. Time warping and motion vector blending at the juncture of two divisemes and the algorithm to search the optimal concatenated visible speech are also developed to provide the final concatenative motion sequence. Copyright © 2004 John Wiley & Sons, Ltd.

...read moreread less

44 citations

Journal Article•DOI•

Is the DTW “distance” really a metric? An algorithm reducing the number of DTW comparisons in isolated word recognition

[...]

Enrique Vidal Ruiz¹, Francisco Casacuberta Nolla¹, Héctor Rulot Segovia¹•Institutions (1)

University of Valencia¹

01 Dec 1985-Speech Communication

TL;DR: Empirical evidence of loose satisfaction of these properties with real speech will be presented, allowing the assumption of a “loose metric space” structure in the set of parametric representations of words in a given vocabulary.

...read moreread less

44 citations

Book Chapter•DOI•

Efficient search in document image collections

[...]

Anand Kumar¹, C. V. Jawahar¹, R. Manmatha²•Institutions (2)

International Institute of Information Technology, Hyderabad¹, University of Massachusetts Amherst²

18 Nov 2007

TL;DR: This paper presents an efficient indexing and retrieval scheme for searching in document image databases that achieves high precision and recall, using a large image corpus consisting of seven Kalidasa's books in the Telugu language.

...read moreread less

Abstract: This paper presents an efficient indexing and retrieval scheme for searching in document image databases. In many non-European languages, optical character recognizers are not very accurate. Word spotting - word image matching - may instead be used to retrieve word images in response to a word image query. The approaches used for word spotting so far, dynamic time warping and/or nearest neighbor search, tend to be slow. Here indexing is done using locality sensitive hashing (LSH) - a technique which computes multiple hashes - using word image features computed at word level. Efficiency and scalability is achieved by content-sensitive hashing implemented through approximate nearest neighbor computation. We demonstrate that the technique achieves high precision and recall (in the 90% range), using a large image corpus consisting of seven Kalidasa's (a well known Indian poet of antiquity) books in the Telugu language. The accuracy is comparable to using dynamic time warping and nearest neighbor search while the speed is orders of magnitude better - 20000 word images can be searched in milliseconds.

...read moreread less

44 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
…
122
123
124
125
126
127
128
…
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

6,722

Papers

154,377

Citations

No. of papers in the topic in previous years
Year	Papers
2023	236
2022	471
2021	341
2020	416
2019	420
2018	377

Dynamic time warping

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics