Relative Transfer Function Identification Using Convolutive Transfer Function Approximation

doi:10.1109/TASL.2008.2009576

Journal ArticleDOI

Relative Transfer Function Identification Using Convolutive Transfer Function Approximation

Ronen Talmon, +2 more

- 01 May 2009 -

IEEE Transactions on Audio, Speech, and ...

- Vol. 17, Iss: 4, pp 546-555

Chats0

TLDR

An unbiased RTF estimator is developed that exploits the nonstationarity and presence probability of the speech signal and derive an analytic expression for the estimator variance.

Abstract:

In this paper, we present a relative transfer function (RTF) identification method for speech sources in reverberant environments. The proposed method is based on the convolutive transfer function (CTF) approximation, which enables to represent a linear convolution in the time domain as a linear convolution in the short-time Fourier transform (STFT) domain. Unlike the restrictive and commonly used multiplicative transfer function (MTF) approximation, which becomes more accurate when the length of a time frame increases relative to the length of the impulse response, the CTF approximation enables representation of long impulse responses using short time frames. We develop an unbiased RTF estimator that exploits the nonstationarity and presence probability of the speech signal and derive an analytic expression for the estimator variance. Experimental results show that the proposed method is advantageous compared to common RTF identification methods in various acoustic environments, especially when identifying long RTFs typical to real rooms.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Machine learning in acoustics: theory and applications

Michael J. Bianco, +6 more

- 11 May 2019 -

arXiv: Signal Processing

TL;DR: In this paper, the authors survey the recent advances and transformative potential of machine learning (ML) including deep learning, in the field of acoustics and highlight ML developments in four acoustICS research areas: source localization in speech processing, source localization from ocean acoustic, bioacoustics, and environmental sounds in everyday scenes.

...read moreread less

Book ChapterDOI

Acoustic Beamforming for Hearing Aid Applications

Simon Doclo, +3 more

TL;DR: This chapter contains sections titled: Introduction Overview of noise reduction techniques Monaural beamforming Binaural beamforms Conclusion References.

...read moreread less

Journal ArticleDOI

Multi-channel linear prediction-based speech dereverberation with sparse priors

Ante Jukic, +3 more

- 01 Sep 2015 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This paper proposes to model the desired speech signal using a general sparse prior that can be represented in a convex form as a maximization over scaled complex Gaussian distributions, which can be interpreted as a generalization of the commonly used time-varying Gaussian model.

...read moreread less

Journal ArticleDOI

The binaural LCMV beamformer and its performance analysis

Elior Hadad, +2 more

- 01 Mar 2016 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A theoretical analysis of the BLCMV beamformer is presented and several decompositions are introduced that reveal its capabilities in terms of interference and noise reduction, while controlling the binaural cues of the desired and the interfering sources.

...read moreread less

Journal ArticleDOI

Theoretical analysis of binaural transfer function MVDR beamformers with interference cue preservation constraints

Elior Hadad, +3 more

- 01 Dec 2015 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: Among all beamformers which are distortionless with respect to the desired source and preserve the binaural cues of the interfering source, the newly proposed BMVDR-RTF beamformer is optimal in terms of SINR.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Fundamentals Of Statistical Signal Processing

Steven Kay

Journal ArticleDOI

Image method for efficiently simulating small‐room acoustics

Jont B. Allen, +1 more

- 01 Nov 1976 -

Journal of the Acoustical Society of Ame...

TL;DR: The theoretical and practical use of image techniques for simulating the impulse response between two points in a small rectangular room, when convolved with any desired input signal, simulates room reverberation of the input signal.

...read moreread less

Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST

John S. Garofolo, +5 more

Dataset

TIMIT Acoustic-Phonetic Continuous Speech Corpus

John S. Garofolo, +6 more

TL;DR: The TIMIT corpus as mentioned in this paper contains broadband recordings of 630 speakers of eight major dialects of American English, each reading ten phonetically rich sentences, including time-aligned orthographic, phonetic and word transcriptions as well as a 16-bit, 16kHz speech waveform file for each utterance.

...read moreread less

Journal ArticleDOI

Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging

Israel Cohen

- 26 Aug 2003 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: In this article, an improved minima controlled recursive averaging (IMCRA) approach is proposed for noise estimation in adverse environments involving nonstationary noise, weak speech components, and low input signal-to-noise ratio (SNR).

...read moreread less