Journal ArticleDOI
Robustness of group-delay-based method for extraction of significant instants of excitation from speech signals
Reads0
Chats0
TLDR
A measure for the strength of the excitation based on Frobenius norm of the differenced signal is proposed and illustrated for speech under different types of degradations and for speech from different speakers.Abstract:
We study the robustness of a group-delay-based method for determining the instants of significant excitation in speech signals. These instants correspond to the instants of glottal closure for voiced speech. The method uses the properties of the global phase characteristics of minimum phase signals. Robustness of the method against noise and distortion is due to the fact that the average phase characteristics of a signal is determined mainly by the strength of the excitation impulse. The strength of excitation is determined by the energy of the residual error signal around the instant of excitation. We propose a measure for the strength of the excitation based on Frobenius norm of the differenced signal. The robustness of the group-delay-based method is illustrated for speech under different types of degradations and for speech from different speakers.read more
Citations
More filters
Journal ArticleDOI
Epoch Extraction From Speech Signals
K.S.R. Murty,B. Yegnanarayana +1 more
TL;DR: The interesting part of the results is that the epoch extraction by the proposed method seems to be robust against degradations like white noise, babble, high-frequency channel, and vehicle noise.
Journal ArticleDOI
Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm
TL;DR: The Dynamic Programming Projected Phase-Slope Algorithm (DYPSA) is automatic and operates using the speech signal alone without the need for an EGG signal for automatic estimation of glottal closure instants (GCIs) in voiced speech.
Journal ArticleDOI
Prosody modification using instants of significant excitation
K.S. Rao,B. Yegnanarayana +1 more
TL;DR: The proposed method for prosody (pitch and duration) modification using the instants of significant excitation of the vocal tract system during the production of speech is compared with linear prediction pitch synchronous overlap and add (LP-PSOLA) method.
Journal ArticleDOI
Extraction of vocal-tract system characteristics from speech signals
TL;DR: It is shown that the selection of appropriate analysis segments is crucial in these methods, and it is proposed a selection based on estimated instants of significant excitation, obtained by a method based on the average group-delay property of minimum-phase signals.
Journal ArticleDOI
Short-time phase spectrum in speech processing: A review and some experimental results
Leigh Alsteris,Kuldip K. Paliwal +1 more
TL;DR: It is suggested that a short-time phase spectrum feature set may ultimately be derived from a concatenation of information from both the GDF and IFD representations, and that these features perform worse than the standard MFCC features.
References
More filters
Journal ArticleDOI
Fundamentals of statistical signal processing: estimation theory
TL;DR: The Fundamentals of Statistical Signal Processing: Estimation Theory as mentioned in this paper is a seminal work in the field of statistical signal processing, and it has been used extensively in many applications.
Journal ArticleDOI
Linear prediction: A tutorial review
TL;DR: This paper gives an exposition of linear prediction in the analysis of discrete signals as a linear combination of its past values and present and past values of a hypothetical input to a system whose output is the given signal.
Book
Linear Algebra With Applications
TL;DR: In this article, the authors present a series of exercises for the problem of least square problems in the MATLAB programming language MATLAB, including a chapter test for each of the following problems: