scispace - formally typeset
Open AccessJournal ArticleDOI

Microphone array speaker localizers using spatial-temporal information

Reads0
Chats0
TLDR
This study shows that in common TDOA-based localization scenarios—where the microphone array has small interelement spread relative to the source position—the elevation and azimuth angles can be accurately estimated, whereas the Cartesian coordinates as well as the range are poorly estimated.
Abstract
A dual-step approach for speaker localization based on a microphone array is addressed in this paper. In the first stage, which is not the main concern of this paper, the time difference between arrivals of the speech signal at each pair of microphones is estimated. These readings are combined in the second stage to obtain the source location. In this paper, we focus on the second stage of the localization task. In this contribution, we propose to exploit the speaker's smooth trajectory for improving the current position estimate. Three localization schemes, which use the temporal information, are presented. The first is a recursive form of the Gauss method. The other two are extensions of the Kalman filter to the nonlinear problem at hand, namely, the extended Kalman filter and the unscented Kalman filter. These methods are compared with other algorithms, which do not make use of the temporal information. An extensive experimental study demonstrates the advantage of using the spatial-temporal methods. To gain some insight on the obtainable performance of the localization algorithm, an approximate analytical evaluation, verified by an experimental study, is conducted. This study shows that in common TDOA-based localization scenarios--where the microphone array has small interelement spread relative to the source position--the elevation and azimuth angles can be accurately estimated, whereas the Cartesian coordinates as well as the range are poorly estimated.

read more

Citations
More filters
Journal ArticleDOI

Speaker Tracking Using Recursive EM Algorithms

TL;DR: This work forms the localization task as a maximum likelihood (ML) parameter estimation problem, and solves it by utilizing the expectation-maximization (EM) procedure, and proposes to adapt two recursive EM (REM) variants based on Titterington's scheme.
Journal ArticleDOI

Particle filter with integrated voice activity detection for acoustic source tracking

TL;DR: The experimental results demonstrate the improved robustness of the method described in this work when tracking sources emitting real-world speech signals, which typically involve significant silence gaps between utterances.
Journal ArticleDOI

Parametrization of Linear Systems Using Diffusion Kernels

TL;DR: This work proposes a supervised algorithm to recover the controlling parameters of natural and artificial linear systems and claims that each system can be viewed as a black box controlled by several independent parameters.
Journal ArticleDOI

Multiple-Hypothesis Extended Particle Filter for Acoustic Source Localization in Reverberant Environments

TL;DR: The EPF scheme is adapted to the multiple-hypothesis model to track a single acoustic source in reverberant environments and it is shown that splitting the array into several sub-arrays improves the robustness of the estimated source location.
Journal ArticleDOI

A Wrapped Kalman Filter for Azimuthal Speaker Tracking

TL;DR: This work presents the wrapped Kalman filter (WKF) for tracking the azimuth of a speaker with a compact, 3-channel microphone array with a wrapped Gaussian distribution and shows that this achieves a lower mean squared error than 2-D methods.
References
More filters
Book

Numerical Recipes in C: The Art of Scientific Computing

TL;DR: Numerical Recipes: The Art of Scientific Computing as discussed by the authors is a complete text and reference book on scientific computing with over 100 new routines (now well over 300 in all), plus upgraded versions of many of the original routines, with many new topics presented at the same accessible level.
Journal ArticleDOI

Unscented filtering and nonlinear estimation

TL;DR: The motivation, development, use, and implications of the UT are reviewed, which show it to be more accurate, easier to implement, and uses the same order of calculations as linearization.

Numerical Recipes in FORTRAN - The Art of Scientific Computing - Second Edition

TL;DR: This paper presents a list of recommended recipes for making CDRom decks and some examples of how these recipes can be modified to suit theommelier's needs.
Journal ArticleDOI

The generalized correlation method for estimation of time delay

TL;DR: In this paper, a maximum likelihood estimator is developed for determining time delay between signals received at two spatially separated sensors in the presence of uncorrelated noise, where the role of the prefilters is to accentuate the signal passed to the correlator at frequencies for which the signal-to-noise (S/N) ratio is highest and suppress the noise power.
Related Papers (5)