scispace - formally typeset
Journal ArticleDOI

Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov model

P.D. Polur, +1 more
- Vol. 13, Iss: 4, pp 558-561
Reads0
Chats0
TLDR
The hidden Markov Model constructed and conditions investigated that would provide improved performance for a dysarthric speech (isolated word) recognition system found that a Mel cepstrum based model outperformed a fast Fourier transform and linear prediction based model.
Abstract
In this study, a hidden Markov Model was constructed and conditions were investigated that would provide improved performance for a dysarthric speech (isolated word) recognition system. The speaker dependant system was intended to act as an assistive/control tool. A small size vocabulary spoken by three cerebral palsy subjects was chosen. Fast Fourier transform, linear predictive, and Mel frequency cepstral coefficients extracted from data provided training input to several whole-word hidden Markov model configurations. The effect of model structure, number of states, and frame rates were also investigated. It was noted that a 10-state ergodic model using 15 msec frames was better than other configurations. Furthermore, it was found that a Mel cepstrum based model outperformed a fast Fourier transform and linear prediction based model. The system offers effective and robust application as a rehabilitation and/or control tool to assist dysarthric motor impaired individuals.

read more

Citations
More filters
Journal ArticleDOI

A speech-controlled environmental control system for people with severe dysarthria.

TL;DR: It is concluded that a speech-controlled ECS is a viable alternative to switch-scanning systems for some people with severe dysarthria and would lead, in many cases, to more efficient control of the home.
Journal ArticleDOI

Silent Speech Recognition as an Alternative Communication Device for Persons With Laryngectomy

TL;DR: This study provides a compelling proof-of-concept for sEMG-based alaryngeal speech recognition, with the strong potential to further improve recognition performance.
Proceedings ArticleDOI

Statistical Pattern Recognition and Built-in Reliability Test for Feature Extraction and Health Monitoring of Electronics under Shock Loads

TL;DR: In this paper, a new approach has been developed to monitor product-level damage during shock and vibration using the dynamic response of the electronic equipment, which is applicable at the system level for identification of impending failures to trigger repair or replacement significantly prior to failure.
Journal ArticleDOI

Artificial neural networks as speech recognisers for dysarthric speech: Identifying the best-performing set of MFCC parameters and studying a speaker-independent approach

TL;DR: This paper studies the application of ANNs as a fixed-length isolated-word SI ASR for individuals who suffer from dysarthria and identifies the best-performing set of MFCC parameters, which can represent dysarthric acoustic features to be used in Artificial Neural Network (ANN)-based ASR.
Journal ArticleDOI

An SVD audio watermarking approach using chaotic encrypted images

TL;DR: Experimental results show that the proposed audio watermarking approach maintains the high quality of the audio signal and that the watermark extraction and decryption are possible even in the presence of attacks.
References
More filters
Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.
Book

Statistical methods for speech recognition

TL;DR: The speech recognition problem hidden Markov models the acoustic model basic language modelling the Viterbi search hypothesis search on a tree and the fast match elements of information theory.
Proceedings ArticleDOI

The Nemours database of dysarthric speech

TL;DR: The database structure and techniques adopted to improve the performance of a Discrete Hidden Markov Model (DHMM) labeler used to assign initial phoneme labels to the elements of the Nemours database are described.
Book

Design of Microcomputer-Based Medical Instrumentation

TL;DR: A new book enPDFd design of microcomputer based medical instrumentation that can be a new way to explore the knowledge and one thing to always remember in every reading time, even step by step is shown.
Proceedings Article

Automatic speech recognition with sparse training data for dysarthric speakers.

TL;DR: A battery of measures of consistency and confusability, based on forced-alignment, which can be used to predict recogniser performance are presented and how these measures perform are shown to the clinicians who are the users of the system.