scispace - formally typeset
Proceedings ArticleDOI

Cross-words reference template for DTW-based speech recognition systems

Waleed H. Abdulla, +2 more
- Vol. 4, pp 1576-1579
TLDR
A simple novel technique for preparing reliable reference templates to improve the recognition rate score and produces templates called crosswords reference templates (CWRTs), which can be adapted to any DTW-based speech recognition systems to improve its performance.
Abstract
One of the main problems in dynamic time-warping (DTW) based speech recognition systems are the preparation of reliable reference templates for the set of words to be recognised. This paper presents a simple novel technique for preparing reliable reference templates to improve the recognition rate score. The developed technique produces templates called crosswords reference templates (CWRTs). It extracts the reference template from a set of examples rather than one example. This technique can be adapted to any DTW-based speech recognition systems to improve its performance. The speaker-dependent recognition rate, as tested on the English digits, is improved from 85.3%, using the traditional technique to 99%, using the developed technique.

read more

Citations
More filters
Journal ArticleDOI

Toward accurate dynamic time warping in linear time and space

TL;DR: This paper introduces FastDTW, an approximation of DTW that has a linear time and space complexity and shows a large improvement in accuracy over existing methods.
Journal ArticleDOI

Time-series clustering - A decade review

TL;DR: This review will expose four main components of time-series clustering and is aimed to represent an updated investigation on the trend of improvements in efficiency, quality and complexity of clustering time- series approaches during the last decade and enlighten new paths for future works.

FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space

TL;DR: This paper introduces FastDTW, an approximation of DTW that has a linear time and space complexity that uses a multilevel approach that recursively projects a solution from a coarse resolution and refines the projected solution.
Proceedings ArticleDOI

On Clustering Multimedia Time Series Data Using K-Means and Dynamic Time Warping

TL;DR: It is demonstrated that unfortunately, k-means clustering will sometimes fail to give correct results, and a suggestion of a method to potentially find the shape-based time series average that satisfies the required properties is suggested.
Proceedings ArticleDOI

Time-series clustering by approximate prototypes

TL;DR: This work defines an optimal prototype as an optimization problem and proposes a local search solution to it and finds out that the proposed prototype with agglomerative clustering followed by k-means algorithm provides best clustering accuracy.
References
More filters
Book

Fundamentals of speech recognition

TL;DR: This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech.
Journal ArticleDOI

Dynamic programming algorithm optimization for spoken word recognition

TL;DR: This paper reports on an optimum dynamic progxamming (DP) based time-normalization algorithm for spoken word recognition, in which the warping function slope is restricted so as to improve discrimination between words in different categories.

Cepstrum analysis technique for automatic speaker verification

S. Furui
TL;DR: New techniques for automatic speaker verification using telephone speech based on a set of functions of time obtained from acoustic analysis of a fixed, sentence-long utterance using a new time warping method using a dynamic programming technique.
Journal ArticleDOI

Cepstral analysis technique for automatic speaker verification

TL;DR: In this paper, a set of functions of time obtained from acoustic analysis of a fixed, sentence-long utterance are extracted by means of LPC analysis successively throughout an utterance to form time functions, and frequency response distortions introduced by transmission systems are removed.
Journal ArticleDOI

Speaker-independent isolated word recognition using dynamic features of speech spectrum

TL;DR: This paper proposes a new isolated word recognition technique based on a combination of instantaneous and dynamic features of the speech spectrum that is shown to be highly effective in speaker-independent speech recognition.