scispace - formally typeset
Open AccessProceedings ArticleDOI

A robust algorithm for detecting speech segments using an entropic contrast

Reads0
Chats0
TLDR
An entropy based contrast function between the speech segments and the background noise is proposed, which exhibits better-behaved characteristics as compared to the energy-based methods.
Abstract
This paper addresses the issue of automatic word/sentence boundary detection in both quiet and noisy environments. We propose to use an entropy based contrast function between the speech segments and the background noise. A simplified data based scheme of computing the entropy of the speech data is presented. The entropy-based contrast exhibits better-behaved characteristics as compared to the energy-based methods. An adaptive threshold is used to determine the candidate speech segments, which are subjected to word/sentence constraints. Experimental. results show that this algorithm outperforms energy-based algorithms. The improved detection accuracy of speech segments results in at least 25% improvement of recognition performance for isolated speech and more than 16% for connected speech. For continuous speech, a preprocessing stage comprising of the proposed speech segment detection makes the overall HMM based scheme more computationally efficient by rejection of silence periods.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Neural networks used for speech recognition

TL;DR: This investigation on the speech recognition classification performance is performed using two standard neural networks structures as the classifier using Feed-forward Neural Network with back propagation algorithm and a Radial Basis Functions Neural Networks.
Journal ArticleDOI

Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language

TL;DR: Experimental results on Vietnamese showed that with only a few hours of target language speech data, crosslingual context independent modeling worked better than crosslingUAL context dependent modeling, however, it was outperformed by the latter one, when more speech data were available, and it was concluded that in both cases,Crosslingual systems are better than monolingual baseline systems.
Journal ArticleDOI

A Biosignal-Based Human Interface Controlling a Power-Wheelchair for People with Motor Disabilities

TL;DR: It is found that the proposed interface can be considered a potential alternative for the interaction of the severely disabled with electronic systems.
Proceedings ArticleDOI

Wearable Sensing Framework for Human Activity Monitoring

TL;DR: This work proposes a generic framework to continuously monitor users' daily activities and proposes light computation tasks on the wearable device to reduce the amount of data communicated between the wearable, and its host.
Journal ArticleDOI

Speech Recognition System: A Review

TL;DR: This paper presents the advances made as well as highlights the pressing problems for a speech recognition system and classifies the system into Front End and Back End for better understanding and representation of speech Recognition system in each part.
References
More filters
Book

Fundamentals of speech recognition

TL;DR: This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech.
Book

Discrete-Time Processing of Speech Signals

TL;DR: The preface to the IEEE Edition explains the background to speech production, coding, and quality assessment and introduces the Hidden Markov Model, the Artificial Neural Network, and Speech Enhancement.
Journal ArticleDOI

An improved endpoint detector for isolated word recognition

TL;DR: A hybrid end-point detector is proposed which gives a rejection rate of less than 0.5 percent, while providing recognition accuracy close to that obtained from hand-edited endpoints.
Proceedings Article

Robust entropy-based endpoint detection for speech recognition in noisy environments.

TL;DR: This paper presents an entropy-based algorithm for accurate and robust endpoint detection for speech recognition under noisy environments that uses the spectral entropy to identify the speech segments accurately.
Journal ArticleDOI

A robust algorithm for word boundary detection in the presence of noise

TL;DR: This new algorithm identifies islands of reliability (essentially the portion of speech contained between the first and the last vowel) using time and frequency-based features and then applies a noise adaptive procedure to refine the boundaries.