Journal ArticleDOI
Speech analysis-synthesis system based on homomorphic filtering.
TLDR
A digital speech analysis‐synthesis system based on a recently proposed approach to the deconvolution of speech is presented and either a zero‐phase or minimum‐phase characteristic can be obtained by simple weighting of the cepstrum before transformation.Abstract:
A digital speech analysis‐synthesis system based on a recently proposed approach to the deconvolution of speech is presented. The analyzer is based on a computation of the cepstrum considered as the inverse Fourier transform of the log magnitude of the Fourier transform. The transmitted parameters represent pitch and voiced unvoiced information and the low‐time portion of the cepstrum representing an approximation to the cepstrum of the vocal‐tract impulse response. In the synthesis, the low‐time cepstral information is transformed to an impulse response function, which is then convolved with a train of impulses during voiced portions or a noise waveform during unvoiced portions to reconstruct the speech. Since no phase information is retained in the analysis, phase must be regenerated during synthesis. Either a zero‐phase or minimum‐phase characteristic can be obtained by simple weighting of the cepstrum before transformation.read more
Citations
More filters
Proceedings ArticleDOI
Query by humming: musical information retrieval in an audio database
TL;DR: A system for querying an audio database by humming is described along with a scheme for representing the melodic information in a song as relative pitch changes, and the performance results of system indicating its effectiveness are presented.
Journal ArticleDOI
The cepstrum: A guide to processing
TL;DR: The power, complex, and phase cepstra are shown to be easily related to one another, and the interpretation and processing of data in such areas as speech, seismology, and hydroacoustics is discussed.
Journal ArticleDOI
Multiband excitation vocoder
Daniel W. Griffin,Jae Lim +1 more
TL;DR: A speech model, referred to as the multiband excitation model, is presented where the band around each harmonic of the fundamental frequency is declared voiced or unvoiced and methods to synthesize speech from the model parameters are described.
Journal ArticleDOI
Speech coding: a tutorial review
TL;DR: The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications.
Book
An Introduction to Digital Speech Processing
TL;DR: A comprehensive overview of digital speech processing that ranges from the basic nature of the speech signal, through a variety of methods of representing speech in digital form, to applications in voice communication and automatic synthesis and recognition of speech.