scispace - formally typeset
Open AccessJournal ArticleDOI

Stable dereverberation using microphone arrays for speaker verification

A. C. Surendran, +1 more
- 01 Nov 1994 - 
- Vol. 96, Iss: 5, pp 3261-3262
Reads0
Chats0
TLDR
Miyoshi and Kaneda as mentioned in this paper used row action projection (RAP) to solve a system of linear equations, starting from an initial guess, the solution is repeatedly projected onto each hyperplane of the equation system until it converges.
Abstract
The impulse response of a reverberant environment, in general, is a nonminimum phase and cannot be inverted. But an exact inverse of the environment can be obtained by modeling the room as a multiple input–output (MINT) system [M. Miyoshi and Y. Kaneda ICASSP (1986)]. In this report, this model is applied to a microphone array and is used as a front‐end processor for a speaker verification system. The G matrix is inverted using row action projection (RAP), an iterative approach to solving a system of linear equations. Starting from an initial guess, the solution is repeatedly projected onto each hyperplane of the equation system until it converges. The method is stable, robust to noise, and converges to the pseudo‐inverse solution. In computer‐simulated experiments, the signal‐to‐reverberant‐noise ratio is found to improve with the number of microphones in the array. A speaker verification system using the array is evaluated at various signal‐to‐competing‐noise ratios (SCNR). Results suggest that verification performance can be substantially elevated in adverse acoustic environments.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation

TL;DR: This paper addresses the problem of training speaker models using dereverberant speech obtained by suppressing reverberation from arbitrary artificial reverberant speech by training an efficient computational method for a combination of the likelihood of dereverBERant speech using multiple compensation parameter sets.
Proceedings ArticleDOI

Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording

TL;DR: A robust distant-talking speech recognition system with asynchronous speech recording is proposed by combining denoising autoencoder-based cepstral-domain dereverberation, automatic asynchronous speech (microphone or mobile terminal) selection and environment adaptation.
Proceedings ArticleDOI

Speech selection and environmental adaptation for asynchronous speech recognition

TL;DR: A robust distant-talking speech recognition system with asynchronous speech recording is proposed by combining automatic asynchronous speech selection and environmental adaptation with deep neural network based framework.
Related Papers (5)