Statistical analysis of the autoregressive modeling of reverberant speech

doi:10.1121/1.2356840

Journal ArticleDOI

Statistical analysis of the autoregressive modeling of reverberant speech

Nikolay D. Gaubitch, +2 more

- 06 Dec 2006 -

Journal of the Acoustical Society of Ame...

- Vol. 120, Iss: 6, pp 4031-4039

TLDR

Analytical results from statistical room acoustics are utilizes to analyze the AR modeling of speech under reverberant conditions and it is demonstrated that at each individual source-microphone position (without spatial expectation), the M-channel AR coefficients provide the best approximation to the clean speech coefficients when microphones are closely spaced.

Abstract:

Hands-free speech input is required in many modern telecommunication applications that employ autoregressive (AR) techniques such as linear predictive coding. When the hands-free input is obtained in enclosed reverberant spaces such as typical office rooms, the speech signal is distorted by the room transfer function. This paper utilizes theoretical results from statistical room acoustics to analyze the AR modeling of speech under these reverberant conditions. Three cases are considered: (i) AR coefficients calculated from a single observation; (ii) AR coefficients calculated jointly from an M-channel observation (M > 1); and (iii) AR coefficients calculated from the output of a delay-and sum beamformer. The statistical analysis, with supporting simulations, shows that the spatial expectation of the AR coefficients for cases (i) and (ii) are approximately equal to those from the original speech, while for case (iii) there is a discrepancy due to spatial correlation between the microphones which can be significant. It is subsequently demonstrated that at each individual source-microphone position (without spatial expectation), the M-channel AR coefficients from case (ii) provide the best approximation to the clean speech coefficients when microphones are closely spaced (<0.3m).

Statistical analysis of the autoregressive modeling of reverberant speech

Citations

Single- and multi-microphone speech dereverberation using spectral enhancement

Temporal Dynamics for Blind Measurement of Room Acoustical Parameters

Regularization for Partial Multichannel Equalization for Speech Dereverberation

Spatiotemporal Averagingmethod for Enhancement of Reverberant Speech

Speech enhancement for robust automatic speech recognition: Evaluation using a baseline system and instrumental measures

References

Linear prediction: A tutorial review

Beamforming: a versatile approach to spatial filtering

Image method for efficiently simulating small‐room acoustics

Kendall's advanced theory of statistics

Discrete-Time Processing of Speech Signals

Related Papers (5)

Image method for efficiently simulating small‐room acoustics

The generalized correlation method for estimation of time delay

Speech Dereverberation

Inverse filtering of room acoustics

Single- and multi-microphone speech dereverberation using spectral enhancement