Separation of speech from interfering speech by means of harmonic selection

doi:10.1121/1.381172

Journal ArticleDOI

Separation of speech from interfering speech by means of harmonic selection

Thomas W. Parsons

- 01 Oct 1976 -

Journal of the Acoustical Society of Ame...

- Vol. 60, Iss: 4, pp 911-918

Chats0

TLDR

In this paper, the harmonics of the desired voice in the Fourier transform of the input were selected to distinguish between two different voices. But the authors focus on the principal subproblem, the separation of vocalic speech.

Abstract:

A common type of interference in speech transmission is that caused by the speech of a competing talker. Although the brain is adept at clarifying such speech, it relies heavily on binaural data. When voices interfere over a single channel, separation is much more difficult and intelligibility suffers. Clarifying such speech is a complex and varied problem whose nature changes with the moment‐to‐moment variation in the types of sound which interfere. This paper describes an attack on the principal subproblem, the separation of vocalic speech. Separation is done by selecting the harmonics of the desired voice in the Fourier transform of the input. In implementing this process, techniques have been developed for resolving overlapping spectrum components, for determining pitches of both talkers, and for assuring consistent separation. These techniques are described, their performance on test utterances is summarized, and the possibility of using this process as a basis for the solution of the general two‐tal...

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Auditory stream segregation and the control of dissonance in polyphonic music

James K. Wright, +1 more

- 01 Jan 1987 -

Contemporary Music Review

TL;DR: In this article, the authors introduce music theorists to some of the ideas and research that have evolved into the theory of auditory stream segregation, and discuss some ways that the auditory stream-forming process can influence our perception of the linear and harmonic dimensions of polyphonic music.

...read moreread less

A Real-time Music Scene Description System: Detecting Melody and Bass Lines in Audio Signals

Masataka Goto, +1 more

TL;DR: A predominant-pitch estimation method that enables a realtime system detecting melody and bass lines as a subsystem of the authors' music scene description system and is robust enough to estimate the predominant F0s of the melody andbass lines in real-world audio signals.

...read moreread less

Journal ArticleDOI

Primitive Auditory Segregation Based on Oscillatory Correlation

DeLiang Wang

- 01 Jul 1996 -

Cognitive Science

TL;DR: In this paper, a laterally coupled two-dimensional network of relaxation oscillators with a global inhibitor is proposed for primitive auditory scene analysis, which can group auditory features into a stream by phase synchrony and segregate different streams by desynchronization.

...read moreread less

Journal ArticleDOI

Epoch-based analysis of speech signals

B. Yegnanarayana, +1 more

- 22 Nov 2011 -

Sadhana-academy Proceedings in Engineeri...

TL;DR: In this paper, the importance of epochs for speech analysis is discussed, and methods to extract the epoch information are reviewed, and applications of epoch extraction for some speech applications are demonstrated.

...read moreread less

Journal ArticleDOI

On noise masking for automatic missing data speech recognition: A survey and discussion

Christophe Cerisara, +2 more

- 01 Jul 2007 -

Computer Speech & Language

TL;DR: The objective of this study is to identify the mask estimation methods that have been proposed so far, and to open this domain up to other related research, which could be adapted to overcome this difficult challenge.

...read moreread less

Collapse

Separation of speech from interfering speech by means of harmonic selection

Citations

Auditory stream segregation and the control of dissonance in polyphonic music

A Real-time Music Scene Description System: Detecting Melody and Bass Lines in Audio Signals

Primitive Auditory Segregation Based on Oscillatory Correlation

Epoch-based analysis of speech signals

On noise masking for automatic missing data speech recognition: A survey and discussion

Related Papers (5)

Computational auditory scene analysis

Some Experiments on the Recognition of Speech, with One and with Two Ears

Auditory Scene Analysis

Computational Auditory Scene Analysis: Principles, Algorithms, and Applications

Auditory Scene Analysis: The Perceptual Organization of Sound