scispace - formally typeset
Journal ArticleDOI

Separation of speech from interfering speech by means of harmonic selection

Thomas W. Parsons
- 01 Oct 1976 - 
- Vol. 60, Iss: 4, pp 911-918
Reads0
Chats0
TLDR
In this paper, the harmonics of the desired voice in the Fourier transform of the input were selected to distinguish between two different voices. But the authors focus on the principal subproblem, the separation of vocalic speech.
Abstract
A common type of interference in speech transmission is that caused by the speech of a competing talker. Although the brain is adept at clarifying such speech, it relies heavily on binaural data. When voices interfere over a single channel, separation is much more difficult and intelligibility suffers. Clarifying such speech is a complex and varied problem whose nature changes with the moment‐to‐moment variation in the types of sound which interfere. This paper describes an attack on the principal subproblem, the separation of vocalic speech. Separation is done by selecting the harmonics of the desired voice in the Fourier transform of the input. In implementing this process, techniques have been developed for resolving overlapping spectrum components, for determining pitches of both talkers, and for assuring consistent separation. These techniques are described, their performance on test utterances is summarized, and the possibility of using this process as a basis for the solution of the general two‐tal...

read more

Citations
More filters
Journal ArticleDOI

Application of combined temporal and spectral processing methods for speaker recognition under noisy, reverberant or multi-speaker environments

TL;DR: The experimental results show that the combined TSP methods give relatively higher recognition performance compared to either temporal or spectral processing alone.
Journal ArticleDOI

Group Delay Based Methods for Speaker Segregation and its Application in Multimedia Information Retrieval

TL;DR: A novel method of single channel speaker segregation using the group delay cross correlation function is proposed in this paper and a cell phone based multimedia information retrieval system (MIRS) for multi-source meeting environments are developed.
Proceedings ArticleDOI

A speech separation system that is robust to reverberation

TL;DR: A speech separation system has been built up that can successfully extract one speech from mixed speech even when there are more than two overlapping speech signals.

A new score function for joint evaluation of multiplef0 hypotheses

TL;DR: In this article, the fundamental frequencies of the quasiharmonic sources in polyphonic signals for the case that the number of sources is known are estimated based on three physical principles: harmonicity, spectral smoothness and synchronous amplitude evolution within a single source.