A Multichannel Kalman-Based Wiener Filter Approach for Speaker Interference Reduction in Meetings

doi:10.1109/ICASSP40776.2020.9054482

Proceedings ArticleDOI

A Multichannel Kalman-Based Wiener Filter Approach for Speaker Interference Reduction in Meetings

Patrick Meyer, +2 more

- pp 451-455

Chats0

TLDR

This work extends an existing approach by integrating methods from acoustic echo cancellation to improve the estimation of the interferer (noise) components of the filter, which leads to an improved signal-to-interferer ratio by up to 2.1 dB absolute at constant speech component quality.

Abstract:

Recording a meeting and obtaining clean speech signals of each speaker is a challenging task. Even with a multichannel recording, in which all speakers are equipped with a close-talk microphone, speech of an active speaker still couples not only into his dedicated microphone, but also into all other microphone channels at a certain level. This is denoted as crosstalk and requires a multichannel speaker interference reduction to enhance the microphone channels for further processing. To solve this issue, we use a Wiener filter which is based on all individual microphone channels. We extend an existing approach by integrating methods from acoustic echo cancellation to improve the estimation of the interferer (noise) components of the filter, which leads to an improved signal-to-interferer ratio by up to 2.1 dB absolute at constant speech component quality.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Distributed Combined Acoustic Echo Cancellation and Noise Reduction in Wireless Acoustic Sensor and Actuator Networks

Santiago Ruiz, +2 more

- 01 Jan 2022 -

IEEE/ACM transactions on audio, speech, ...

TL;DR: The paper presents distributed algorithms for combined acoustic echo cancellation (AEC) and noise reduction (NR) in a wireless acoustic sensor and actuator network (WASAN) where each node may have multiple microphones and multiple loudspeakers, and where the desired signal is a speech signal.

...read moreread less

Posted Content

From a Fourier-Domain Perspective on Adversarial Examples to a Wiener Filter Defense for Semantic Segmentation

Nikhil Kapoor, +6 more

- 02 Dec 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: An adversarial defense method based on the well-known Wiener filters that captures and suppresses adversarial frequencies in a data-driven manner is proposed that not only generalizes across unseen attacks but also beats five existing state-of-the-art methods across two models in a variety of attack settings.

...read moreread less

Proceedings ArticleDOI

From a Fourier-Domain Perspective on Adversarial Examples to a Wiener Filter Defense for Semantic Segmentation

Nikhil Kapoor, +6 more

TL;DR: In this article, the authors study the adversarial problem from a frequency domain perspective and propose an adversarial defense method based on the well-known Wiener filters that captures and suppresses adversarial frequencies in a data-driven manner.

...read moreread less

Journal ArticleDOI

Multichannel speaker interference reduction using frequency domain adaptive filtering

Patrick Meyer, +2 more

- 01 Dec 2020 -

Eurasip Journal on Audio, Speech, and Mu...

TL;DR: An adaptive filter method is integrated, which was originally proposed for acoustic echo cancellation (AEC), in order to obtain a well-performing interferer (noise) component estimation and results in an improved speech-to-interferer ratio by up to 2.7 dB at constant or even better speech component quality.

...read moreread less

Proceedings ArticleDOI

Machine learning based noise suppression in narrow-band speech communication systems

TL;DR: In this article , a machine learning based noise suppression approach that uses a neuro-fuzzy logic-based neural network for noise estimation and reduction is proposed, which is shown to give significant improvements in noise suppression compared to a non-adaptive approach.

...read moreread less

References

PDF

Open Access

More filters

Posted Content

Rethinking Atrous Convolution for Semantic Image Segmentation

Liang-Chieh Chen, +3 more

- 17 Jun 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The proposed `DeepLabv3' system significantly improves over the previous DeepLab versions without DenseCRF post-processing and attains comparable performance with other state-of-art models on the PASCAL VOC 2012 semantic image segmentation benchmark.

...read moreread less

Proceedings ArticleDOI

Deep clustering: Discriminative embeddings for segmentation and separation

John R. Hershey, +3 more

TL;DR: In this paper, a deep network is trained to assign contrastive embedding vectors to each time-frequency region of the spectrogram in order to implicitly predict the segmentation labels of the target spectrogram from the input mixtures.

...read moreread less

Proceedings ArticleDOI

The ICSI Meeting Corpus

Adam Janin, +10 more

TL;DR: A corpus of data from natural meetings that occurred at the International Computer Science Institute in Berkeley, California over the last three years is collected, which supports work in automatic speech recognition, noise robustness, dialog modeling, prosody, rich transcription, information retrieval, and more.

...read moreread less

Journal ArticleDOI

Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks

Morten Kolbæk, +3 more

- 01 Oct 2017 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: In this article, the utterance-level permutation invariant training (uPIT) technique was proposed for speaker independent multitalker speech separation, where RNNs, trained with uPIT, can separate multitalker mixed speech without any prior knowledge of signal duration, number of speakers, speaker identity, or gender.

...read moreread less

Posted Content

Deep clustering: Discriminative embeddings for segmentation and separation

John R. Hershey, +3 more

- 18 Aug 2015 -

arXiv: Neural and Evolutionary Computing

TL;DR: Preliminary experiments on single-channel mixtures from multiple speakers show that a speaker-independent model trained on two-speaker mixtures can improve signal quality for mixtures of held-out speakers by an average of 6dB, and the same model does surprisingly well with three-speakers mixtures.

...read moreread less

Collapse

Related Papers (5)

Adaptive microphone array free of the desired speaker cancellation combined with postfilter

Slobodan Jovicic, +1 more

- 09 May 2008 -

Journal of the Acoustical Society of Ame...

Eurasip Journal on Audio, Speech, and Mu...

A Multichannel Kalman-Based Wiener Filter Approach for Speaker Interference Reduction in Meetings

Citations

Distributed Combined Acoustic Echo Cancellation and Noise Reduction in Wireless Acoustic Sensor and Actuator Networks

From a Fourier-Domain Perspective on Adversarial Examples to a Wiener Filter Defense for Semantic Segmentation

From a Fourier-Domain Perspective on Adversarial Examples to a Wiener Filter Defense for Semantic Segmentation

Multichannel speaker interference reduction using frequency domain adaptive filtering

Machine learning based noise suppression in narrow-band speech communication systems

References

Rethinking Atrous Convolution for Semantic Image Segmentation

Deep clustering: Discriminative embeddings for segmentation and separation

The ICSI Meeting Corpus

Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks

Deep clustering: Discriminative embeddings for segmentation and separation

Related Papers (5)

Adaptive microphone array free of the desired speaker cancellation combined with postfilter

Dynamic signal combining for distributed microphone systems in car environments

Speech enhancement using square microphone array for mobile devices

A minimum speech distortion multichannel algorithm for noise reduction

An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method