On the use of explicit speech modeling in microphone array applications

doi:10.1109/ICASSP.1998.679662

Proceedings ArticleDOI

On the use of explicit speech modeling in microphone array applications

- Vol. 6, pp 3613-3616

TLDR

This paper addresses the limitations of current approaches to distant-talker speech acquisition and advocates the development of techniques which explicitly incorporate the nature of the speech signal into a multi-channel context.

Abstract:

This paper addresses the limitations of current approaches to distant-talker speech acquisition and advocates the development of techniques which explicitly incorporate the nature of the speech signal (e.g. statistical non-stationarity, method of production, pitch, voicing, formant structure, and source radiator model) into a multi-channel context. The goal is to combine the advantages of spatial filtering achieved through beamforming with knowledge of the desired time-series attributes. The potential utility of such an approach is demonstrated through the application of a multi-channel version of the dual excitation speech model.

Citations

PDF

Open Access

More filters

DOI

Single- and multi-microphone speech dereverberation using spectral enhancement

Eap Emanuël Habets

TL;DR: Novel single- and multimicrophone speech dereverberation algorithms are developed that aim at the suppression of late reverberation, i.e., signal processing techniques to reduce the detrimental effects of reflections.

...read moreread less

Proceedings ArticleDOI

Speech dereverberation via maximum-kurtosis subband adaptive filtering

B.W. Gillespie, +2 more

TL;DR: An efficient algorithm for high-quality speech capture in applications such as hands-free teleconferencing or voice recording by personal computers using a modulated complex lapped transform (MCLT), in which the subband filters are adapted to maximize the kurtosis of the linear prediction residual of the reconstructed speech.

...read moreread less

PatentDOI

Method and apparatus for removing noise from feature vectors

Brendan J. Frey, +2 more

- 20 Mar 2001 -

Journal of the Acoustical Society of Ame...

TL;DR: In this paper, a method and computer-readable medium are provided for identifying clean signal feature vectors from noisy signal feature vector, which is based on variational inference techniques, using an iterative approach to identify the clean signal vector.

...read moreread less

Proceedings Article

Speech Denoising and Dereverberation Using Probabilistic Models

Hagai Attias, +3 more

TL;DR: A unified probabilistic framework for denoising and dereverberation of speech signals that is to use a strong speech model that is pre-trained on a large data set of clean speech to get results substantially better than standard methods.

...read moreread less

Journal ArticleDOI

A Supervised Learning Approach to Monaural Segregation of Reverberant Speech

Zhaozhang Jin, +1 more

- 01 May 2009 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A supervised learning approach to monaural segregation of reverberant voiced speech is proposed, which learns to map from a set of pitch-based auditory features to a grouping cue encoding the posterior probability of a time-frequency (T-F) unit being target dominant given observed features.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Image method for efficiently simulating small‐room acoustics

Jont B. Allen, +1 more

- 01 Nov 1976 -

Journal of the Acoustical Society of Ame...

TL;DR: The theoretical and practical use of image techniques for simulating the impulse response between two points in a small rectangular room, when convolved with any desired input signal, simulates room reverberation of the input signal.

...read moreread less

Book

Discrete-Time Processing of Speech Signals

J. R. Deller, +2 more

TL;DR: The preface to the IEEE Edition explains the background to speech production, coding, and quality assessment and introduces the Hidden Markov Model, the Artificial Neural Network, and Speech Enhancement.

...read moreread less

Book

Array Signal Processing: Concepts and Techniques

Don H. Johnson, +1 more

TL;DR: This chapter discusses how signals in Space and Time and apertures and Arrays affect Array Processing and the role that symbols play in this processing.

...read moreread less

Journal ArticleDOI

Speech analysis/Synthesis based on a sinusoidal representation

R.J. McAulay, +1 more

- 01 Aug 1986 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A sinusoidal model for the speech waveform is used to develop a new analysis/synthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves, which forms the basis for new approaches to the problems of speech transformations including time-scale and pitch-scale modification, and midrate speech coding.

...read moreread less

Book

Array Signal Processing

S. Unnikrishna Pillai

TL;DR: The author explains the development of the Wiener Solution and some of the techniques used in its implementation, including Optimum Processing: Steady State Performance and theWiener Solution, which simplifies the implementation of the Covariance Matrix.

...read moreread less

On the use of explicit speech modeling in microphone array applications

Citations

Single- and multi-microphone speech dereverberation using spectral enhancement

Speech dereverberation via maximum-kurtosis subband adaptive filtering

Method and apparatus for removing noise from feature vectors

Speech Denoising and Dereverberation Using Probabilistic Models

A Supervised Learning Approach to Monaural Segregation of Reverberant Speech

References

Image method for efficiently simulating small‐room acoustics

Discrete-Time Processing of Speech Signals

Array Signal Processing: Concepts and Techniques

Speech analysis/Synthesis based on a sinusoidal representation

Array Signal Processing

Related Papers (5)

Image method for efficiently simulating small‐room acoustics

Suppression of acoustic noise in speech using spectral subtraction

Independent Component Analysis

All-pole modeling of degraded speech

Statistical-model-based speech enhancement systems