A robust algorithm for detecting speech segments using an entropic contrast

doi:10.1109/MWSCAS.2002.1187039

Open AccessProceedings ArticleDOI

A robust algorithm for detecting speech segments using an entropic contrast

Khurram Waheed, +2 more

- Vol. 3

Chats0

TLDR

An entropy based contrast function between the speech segments and the background noise is proposed, which exhibits better-behaved characteristics as compared to the energy-based methods.

Abstract:

This paper addresses the issue of automatic word/sentence boundary detection in both quiet and noisy environments. We propose to use an entropy based contrast function between the speech segments and the background noise. A simplified data based scheme of computing the entropy of the speech data is presented. The entropy-based contrast exhibits better-behaved characteristics as compared to the energy-based methods. An adaptive threshold is used to determine the candidate speech segments, which are subjected to word/sentence constraints. Experimental. results show that this algorithm outperforms energy-based algorithms. The improved detection accuracy of speech segments results in at least 25% improvement of recognition performance for isolated speech and more than 16% for connected speech. For continuous speech, a preprocessing stage comprising of the proposed speech segment detection makes the overall HMM based scheme more computationally efficient by rejection of silence periods.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Neural networks used for speech recognition

Wouter Gevaert, +2 more

- 01 Jan 2010 -

Journal of Automatic Control

TL;DR: This investigation on the speech recognition classification performance is performed using two standard neural networks structures as the classifier using Feed-forward Neural Network with back propagation algorithm and a Radial Basis Functions Neural Networks.

...read moreread less

Journal ArticleDOI

Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language

Viet Bac Le, +1 more

- 01 Nov 2009 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: Experimental results on Vietnamese showed that with only a few hours of target language speech data, crosslingual context independent modeling worked better than crosslingUAL context dependent modeling, however, it was outperformed by the latter one, when more speech data were available, and it was concluded that in both cases,Crosslingual systems are better than monolingual baseline systems.

...read moreread less

Journal ArticleDOI

A Biosignal-Based Human Interface Controlling a Power-Wheelchair for People with Motor Disabilities

Ki-Hong Kim, +4 more

- 08 Feb 2006 -

Etri Journal

TL;DR: It is found that the proposed interface can be considered a potential alternative for the interaction of the severely disabled with electronic systems.

...read moreread less

Proceedings ArticleDOI

Wearable Sensing Framework for Human Activity Monitoring

Mostafa Uddin, +3 more

TL;DR: This work proposes a generic framework to continuously monitor users' daily activities and proposes light computation tasks on the wearable device to reduce the amount of data communicated between the wearable, and its host.

...read moreread less

Journal ArticleDOI

Speech Recognition System: A Review

Nitin Washani, +1 more

- 22 Apr 2015 -

International Journal of Computer Applic...

TL;DR: This paper presents the advances made as well as highlights the pressing problems for a speech recognition system and classifies the system into Front End and Back End for better understanding and representation of speech Recognition system in each part.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Fundamentals of speech recognition

Lawrence R. Rabiner, +1 more

TL;DR: This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech.

...read moreread less

Book

Discrete-Time Processing of Speech Signals

J. R. Deller, +2 more

TL;DR: The preface to the IEEE Edition explains the background to speech production, coding, and quality assessment and introduces the Hidden Markov Model, the Artificial Neural Network, and Speech Enhancement.

...read moreread less

Journal ArticleDOI

An improved endpoint detector for isolated word recognition

Lori Lamel, +3 more

- 01 Aug 1981 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A hybrid end-point detector is proposed which gives a rejection rate of less than 0.5 percent, while providing recognition accuracy close to that obtained from hand-edited endpoints.

...read moreread less

Proceedings Article

Robust entropy-based endpoint detection for speech recognition in noisy environments.

Jia-Lin Shen, +2 more

TL;DR: This paper presents an entropy-based algorithm for accurate and robust endpoint detection for speech recognition under noisy environments that uses the spectral entropy to identify the speech segments accurately.

...read moreread less

Journal ArticleDOI

A robust algorithm for word boundary detection in the presence of noise

J.-C. Junqua, +2 more

- 01 Jul 1994 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: This new algorithm identifies islands of reliability (essentially the portion of speech contained between the first and the last vowel) using time and frequency-based features and then applies a noise adaptive procedure to refine the boundaries.

...read moreread less

A robust algorithm for detecting speech segments using an entropic contrast

Citations

Neural networks used for speech recognition

Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language

A Biosignal-Based Human Interface Controlling a Power-Wheelchair for People with Motor Disabilities

Wearable Sensing Framework for Human Activity Monitoring

Speech Recognition System: A Review

References

Fundamentals of speech recognition

Discrete-Time Processing of Speech Signals

An improved endpoint detector for isolated word recognition

Robust entropy-based endpoint detection for speech recognition in noisy environments.

A robust algorithm for word boundary detection in the presence of noise

Related Papers (5)

Automatic speech segmentation to improve speech synthesis performance

A noise robust speech activity detection algorithm

Robust entropy-based endpoint detection for speech recognition in noisy environments.

Noisy speech recognition based on robust end-point detection and model adaptation

Using Approximate Entropy as a speech quality measure for a speaker recognition system