Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov model

doi:10.1109/TNSRE.2005.856074

Journal ArticleDOI

Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov model

P.D. Polur, +1 more

- Vol. 13, Iss: 4, pp 558-561

Chats0

TLDR

The hidden Markov Model constructed and conditions investigated that would provide improved performance for a dysarthric speech (isolated word) recognition system found that a Mel cepstrum based model outperformed a fast Fourier transform and linear prediction based model.

Abstract:

In this study, a hidden Markov Model was constructed and conditions were investigated that would provide improved performance for a dysarthric speech (isolated word) recognition system. The speaker dependant system was intended to act as an assistive/control tool. A small size vocabulary spoken by three cerebral palsy subjects was chosen. Fast Fourier transform, linear predictive, and Mel frequency cepstral coefficients extracted from data provided training input to several whole-word hidden Markov model configurations. The effect of model structure, number of states, and frame rates were also investigated. It was noted that a 10-state ergodic model using 15 msec frames was better than other configurations. Furthermore, it was found that a Mel cepstrum based model outperformed a fast Fourier transform and linear prediction based model. The system offers effective and robust application as a rehabilitation and/or control tool to assist dysarthric motor impaired individuals.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A speech-controlled environmental control system for people with severe dysarthria.

Mark S. Hawley, +11 more

- 01 Jun 2007 -

Medical Engineering & Physics

TL;DR: It is concluded that a speech-controlled ECS is a viable alternative to switch-scanning systems for some people with severe dysarthria and would lead, in many cases, to more efficient control of the home.

...read moreread less

Journal ArticleDOI

Silent Speech Recognition as an Alternative Communication Device for Persons With Laryngectomy

Geoffrey S. Meltzner, +5 more

- 01 Dec 2017 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This study provides a compelling proof-of-concept for sEMG-based alaryngeal speech recognition, with the strong potential to further improve recognition performance.

...read moreread less

Proceedings ArticleDOI

Statistical Pattern Recognition and Built-in Reliability Test for Feature Extraction and Health Monitoring of Electronics under Shock Loads

Pradeep Lall, +4 more

TL;DR: In this paper, a new approach has been developed to monitor product-level damage during shock and vibration using the dynamic response of the electronic equipment, which is applicable at the system level for identification of impending failures to trigger repair or replacement significantly prior to failure.

...read moreread less

Journal ArticleDOI

Artificial neural networks as speech recognisers for dysarthric speech: Identifying the best-performing set of MFCC parameters and studying a speaker-independent approach

Seyed Reza Shahamiri, +1 more

- 01 Jan 2014 -

Advanced Engineering Informatics

TL;DR: This paper studies the application of ANNs as a fixed-length isolated-word SI ASR for individuals who suffer from dysarthria and identifies the best-performing set of MFCC parameters, which can represent dysarthric acoustic features to be used in Artificial Neural Network (ANN)-based ASR.

...read moreread less

Journal ArticleDOI

An SVD audio watermarking approach using chaotic encrypted images

Waleed Al-Nuaimy, +10 more

- 01 Dec 2011 -

Digital Signal Processing

TL;DR: Experimental results show that the proposed audio watermarking approach maintains the high quality of the audio signal and that the watermark extraction and decryption are possible even in the presence of attacks.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Book

Statistical methods for speech recognition

Frederick Jelinek

TL;DR: The speech recognition problem hidden Markov models the acoustic model basic language modelling the Viterbi search hypothesis search on a tree and the fast match elements of information theory.

...read moreread less

Proceedings ArticleDOI

The Nemours database of dysarthric speech

Xavier Menendez-Pidal, +4 more

TL;DR: The database structure and techniques adopted to improve the performance of a Discrete Hidden Markov Model (DHMM) labeler used to assign initial phoneme labels to the elements of the Nemours database are described.

...read moreread less

Book

Design of Microcomputer-Based Medical Instrumentation

Willis J. Tompkins, +1 more

TL;DR: A new book enPDFd design of microcomputer based medical instrumentation that can be a new way to explore the knowledge and one thing to always remember in every reading time, even step by step is shown.

...read moreread less

Proceedings Article

Automatic speech recognition with sparse training data for dysarthric speakers.

Phil D. Green, +5 more

TL;DR: A battery of measures of consistency and confusability, based on forced-alignment, which can be used to predict recogniser performance are presented and how these measures perform are shown to the clinicians who are the users of the system.

...read moreread less

Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov model

Citations

A speech-controlled environmental control system for people with severe dysarthria.

Silent Speech Recognition as an Alternative Communication Device for Persons With Laryngectomy

Statistical Pattern Recognition and Built-in Reliability Test for Feature Extraction and Health Monitoring of Electronics under Shock Loads

Artificial neural networks as speech recognisers for dysarthric speech: Identifying the best-performing set of MFCC parameters and studying a speaker-independent approach

An SVD audio watermarking approach using chaotic encrypted images

References

A tutorial on hidden Markov models and selected applications in speech recognition

Statistical methods for speech recognition

The Nemours database of dysarthric speech

Design of Microcomputer-Based Medical Instrumentation

Automatic speech recognition with sparse training data for dysarthric speakers.

Related Papers (5)

Cepstral analysis technique for automatic speaker verification

Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method

Neural Networks Theory

Automatic speech recognition with sparse training data for dysarthric speakers.

Neural Networks: Methodology and Applications