scispace - formally typeset
Journal ArticleDOI

Determination of instants of significant excitation in speech using group delay function

TLDR
A new method based on the global phase characteristics of minimum phase signals for determining the instants of significant excitation in speech signals is proposed, which works well for all types of voiced speech in male as well as female speech but, in all cases, under noise-free conditions only.
Abstract
A new method for determining the instants of significant excitation in speech signals is proposed. In the paper, significant excitation refers primarily to the instant of glottal closure within a pitch period in voiced speech. The method is based on the global phase characteristics of minimum phase signals. The average slope of the unwrapped phase of the short-time Fourier transform of linear prediction residual is calculated as a function of time. Instants where the phase slope function makes a positive zero-crossing are identified as significant excitations. The method is discussed in a source-filter context of speech production. The method is not sensitive to the characteristics of the filter. The influence of the type, length, and position of the analysis window is discussed. The method works well for all types of voiced speech in male as well as female speech but, in all cases, under noise-free conditions only. >

read more

Citations
More filters
Journal ArticleDOI

Epoch Extraction From Speech Signals

TL;DR: The interesting part of the results is that the epoch extraction by the proposed method seems to be robust against degradations like white noise, babble, high-frequency channel, and vehicle noise.
Journal ArticleDOI

Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm

TL;DR: The Dynamic Programming Projected Phase-Slope Algorithm (DYPSA) is automatic and operates using the speech signal alone without the need for an EGG signal for automatic estimation of glottal closure instants (GCIs) in voiced speech.
Book ChapterDOI

Features for Content-Based Audio Retrieval

TL;DR: The goal of this chapter is to review latest research in the context of audio feature extraction and to give an application-independent overview of the most important existing techniques, and to propose a novel taxonomy for the organization of audio features.
Journal ArticleDOI

Prosody modification using instants of significant excitation

TL;DR: The proposed method for prosody (pitch and duration) modification using the instants of significant excitation of the vocal tract system during the production of speech is compared with linear prediction pitch synchronous overlap and add (LP-PSOLA) method.
Journal ArticleDOI

Glottal inverse filtering analysis of human voice production — A review of estimation and parameterization methods of the glottal excitation and their applications

TL;DR: An era spanning five decades during which this topic has been under development is examined, including the estimation methods of the glottal source, the parameterization techniques that have been developed to express the estimatedglottal excitations in numerical forms, and the application areas of GIF.
References
More filters
Book

Linear Prediction of Speech

John E. Markel, +1 more
TL;DR: Speech Analysis and Synthesis Models: Basic Physical Principles, Speech Synthesis Structures, and Considerations in Choice of Analysis.
Journal ArticleDOI

Linear prediction of speech

TL;DR: The book that the authors will offer right here is the soft file concept, which make you can easily find and get this linear prediction of speech by reading this site.
Journal ArticleDOI

Least squares glottal inverse filtering from the acoustic speech waveform

TL;DR: Based on a linear model of speech production, it is shown that both the moment of glottal closure and opening can be determined from the normalized total squared error with proper choices of analysis window length and filter order.
Journal ArticleDOI

Epoch extraction from linear prediction residual for identification of closed glottis interval

TL;DR: In this paper, an interpretation of LP residual by considering the effect of the shape of glottal pulses, inaccurate estimation of formants and bandwidths, phase angles of formant at the instants of excitation, and zeros in the vocal tract system is presented.
Journal ArticleDOI

Significance of group delay functions in signal reconstruction from spectral magnitude or phase

TL;DR: This study shows that the relative importance of spectral magnitude and phase depends on the nature of signals, and explains the convergence behavior of the existing iterative algorithms for signal reconstruction.
Related Papers (5)