Separation of speech from interfering speech by means of harmonic selection

doi:10.1121/1.381172

Journal ArticleDOI

Separation of speech from interfering speech by means of harmonic selection

Thomas W. Parsons

- 01 Oct 1976 -

Journal of the Acoustical Society of Ame...

- Vol. 60, Iss: 4, pp 911-918

Chats0

TLDR

In this paper, the harmonics of the desired voice in the Fourier transform of the input were selected to distinguish between two different voices. But the authors focus on the principal subproblem, the separation of vocalic speech.

Abstract:

A common type of interference in speech transmission is that caused by the speech of a competing talker. Although the brain is adept at clarifying such speech, it relies heavily on binaural data. When voices interfere over a single channel, separation is much more difficult and intelligibility suffers. Clarifying such speech is a complex and varied problem whose nature changes with the moment‐to‐moment variation in the types of sound which interfere. This paper describes an attack on the principal subproblem, the separation of vocalic speech. Separation is done by selecting the harmonics of the desired voice in the Fourier transform of the input. In implementing this process, techniques have been developed for resolving overlapping spectrum components, for determining pitches of both talkers, and for assuring consistent separation. These techniques are described, their performance on test utterances is summarized, and the possibility of using this process as a basis for the solution of the general two‐tal...

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Multiple fundamental frequency estimation based on harmonicity and spectral smoothness

Anssi Klapuri

- 01 Nov 2003 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: The spectral smoothness principle is proposed as an efficient new mechanism in estimating the spectral envelopes of detected sounds and works robustly in noise, and is able to handle sounds that exhibit inharmonicities.

...read moreread less

Journal ArticleDOI

A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals

Masataka Goto

- 01 Sep 2004 -

Speech Communication

TL;DR: A predominant-F0 estimation method called PreFEst is proposed that does not rely on the unreliable fundamental component and obtains the most predominant F0 supported by harmonics within an intentionally limited frequency range.

...read moreread less

Journal ArticleDOI

Separation of speech from interfering sounds based on oscillatory correlation

DeLiang Wang, +1 more

- 01 May 1999 -

IEEE Transactions on Neural Networks

TL;DR: A multistage neural model is proposed for an auditory scene analysis task--segregating speech from interfering sound sources, a two-layer oscillator network that performs stream segregation on the basis of oscillatory correlation.

...read moreread less

Reference BookDOI

Speech processing : a dynamic and optimization-oriented approach

Li Deng, +1 more

TL;DR: Analysis of discrete-time speech signals probability and random processes linear model and dynamic system model optimization methods and estimation theory statistical pattern recognition helps clarify speech technology in selected areas.

...read moreread less

Proceedings Article

Speech enhancement

Jae Lim

TL;DR: An overview of various techniques that have been proposed for enhancement of speech is provided to suggest some directions for future research in the speech enhancement problem.

...read moreread less

Collapse

Separation of speech from interfering speech by means of harmonic selection

Citations

Multiple fundamental frequency estimation based on harmonicity and spectral smoothness

A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals

Separation of speech from interfering sounds based on oscillatory correlation

Speech processing : a dynamic and optimization-oriented approach

Speech enhancement

Related Papers (5)

Computational auditory scene analysis

Some Experiments on the Recognition of Speech, with One and with Two Ears

Auditory Scene Analysis

Computational Auditory Scene Analysis: Principles, Algorithms, and Applications

Auditory Scene Analysis: The Perceptual Organization of Sound