scispace - formally typeset
Journal ArticleDOI

Speech analysis-synthesis system based on homomorphic filtering.

Alan V. Oppenheim
- 01 Feb 1969 - 
- Vol. 45, Iss: 2, pp 458-465
TLDR
A digital speech analysis‐synthesis system based on a recently proposed approach to the deconvolution of speech is presented and either a zero‐phase or minimum‐phase characteristic can be obtained by simple weighting of the cepstrum before transformation.
Abstract
A digital speech analysis‐synthesis system based on a recently proposed approach to the deconvolution of speech is presented. The analyzer is based on a computation of the cepstrum considered as the inverse Fourier transform of the log magnitude of the Fourier transform. The transmitted parameters represent pitch and voiced unvoiced information and the low‐time portion of the cepstrum representing an approximation to the cepstrum of the vocal‐tract impulse response. In the synthesis, the low‐time cepstral information is transformed to an impulse response function, which is then convolved with a train of impulses during voiced portions or a noise waveform during unvoiced portions to reconstruct the speech. Since no phase information is retained in the analysis, phase must be regenerated during synthesis. Either a zero‐phase or minimum‐phase characteristic can be obtained by simple weighting of the cepstrum before transformation.

read more

Citations
More filters
Proceedings ArticleDOI

Query by humming: musical information retrieval in an audio database

TL;DR: A system for querying an audio database by humming is described along with a scheme for representing the melodic information in a song as relative pitch changes, and the performance results of system indicating its effectiveness are presented.
Journal ArticleDOI

The cepstrum: A guide to processing

TL;DR: The power, complex, and phase cepstra are shown to be easily related to one another, and the interpretation and processing of data in such areas as speech, seismology, and hydroacoustics is discussed.
Journal ArticleDOI

Multiband excitation vocoder

TL;DR: A speech model, referred to as the multiband excitation model, is presented where the band around each harmonic of the fundamental frequency is declared voiced or unvoiced and methods to synthesize speech from the model parameters are described.
Journal ArticleDOI

Speech coding: a tutorial review

TL;DR: The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications.
Book

An Introduction to Digital Speech Processing

TL;DR: A comprehensive overview of digital speech processing that ranges from the basic nature of the speech signal, through a variety of methods of representing speech in digital form, to applications in voice communication and automatic synthesis and recognition of speech.