Open Access
Recursive Closed-Form Optimization of Spectral Audio Power Allocation for Near End Listening Enhancement
Bastian Sauert,Peter Vary +1 more
- pp 1-4
Reads0
Chats0
TLDR
In this contribution, the calculation rules of the Speech Intelligibility Index (SII) are analysed and a recursive closedform solution is developed which maximizes the SII under the constraint of an unchanged average power of the audio signal.Abstract:
In mobile telephony, near end listening enhancement is desired by the near end listener who perceives not only the clean far end speech but also ambient background noise. A typical scenario is mobile telephony in acoustical background noise such as traffic or babble noise. In such a situation, it is often not acceptable/possible to increase the audio power. In this contribution we analyse the calculation rules of the Speech Intelligibility Index (SII) and develop a recursive closedform solution which maximizes the SII under the constraint of an unchanged average power of the audio signal. This solution has very low complexity compared to a previous approach of the authors and is thus suitable for real-time processing.read more
Citations
More filters
Journal ArticleDOI
Evaluating the intelligibility benefit of speech modifications in known noise conditions
Martin Cooke,Catherine Mayo,Cassia Valentini-Botinhao,Yannis Stylianou,Bastian Sauert,Yan Tang +5 more
TL;DR: The current study compares the benefits of speech modification algorithms in a large-scale speech intelligibility evaluation and quantifies the equivalent intensity change, defined as the amount in decibels that unmodified speech would need to be adjusted by in order to achieve the same intelligibility as modified speech.
Proceedings ArticleDOI
Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression.
TL;DR: Experiments with speech shaped (SSN) and competing speaker types of noise at various low SNR values show that the suggested approach outperforms state-of-the art methods in terms of the Speech Intelligibility Index (SII).
Journal ArticleDOI
On Optimal Linear Filtering of Speech for Near-End Listening Enhancement
TL;DR: Experiments show large intelligibility improvements with the proposed method over the unprocessed noisy speech and better performance than one state-of-the art method.
Proceedings ArticleDOI
Intelligibility-enhancing speech modifications: the Hurricane Challenge
TL;DR: Surprisingly, for most conditions the largest gains were observed for noise-independent algorithms, suggesting that performance in this task can be further improved by exploiting information in the masking signal.
Journal ArticleDOI
Speech energy redistribution for intelligibility improvement in noise based on a perceptual distortion measure
TL;DR: A speech pre-processing algorithm is presented that improves the speech intelligibility in noise for the near-end listener by optimally redistributing the speech energy over time and frequency according to a perceptual distortion measure, which is based on a spectro-temporal auditory model.
References
More filters
Journal ArticleDOI
The American National Standards Institute
TL;DR: This document describes the organization and operations of the American National Standards Institute and describes the research and development activities of the institute.
Proceedings ArticleDOI
Near End Listening Enhancement: Speech Intelligibility Improvement in Noisy Environments
Bastian Sauert,Peter Vary +1 more
TL;DR: A digital signal processing algorithm to improve intelligibility of clean far end speech for the near end listener who is located in an environment with background noise is presented.
Proceedings Article
Near end listening enhancement optimized with respect to speech intelligibility index and audio power limitations
Bastian Sauert,Peter Vary +1 more
TL;DR: This contribution uses a theoretical analysis of the Speech Intelligibility Index (SII) to develop an algorithm which numerically maximizes the SII under the constraint of an unchanged average power of the audio signal.
Journal ArticleDOI
Uniform and warped low delay filter-banks for speech enhancement
Heinrich W. Lollmann,Peter Vary +1 more
TL;DR: A versatile filter-bank concept for adaptive subband filtering is proposed, which achieves a significantly lower algorithmic signal delay than commonly used analysis-synthesis filter-banks and performs time-domain filtering with coefficients adapted in the uniform or non-uniform frequency-domain.
Near end listening enhancement with strict loudspeaker output power constraining
TL;DR: In this paper, the authors investigated the opportunities of listening enhancement under the constraint that the processed loudspeaker signal power is strictly equal to the power of the received signal and compared two reasonable processing strategies: a previous one that aims at the amplification of speech at noisy frequencies and a new one which cuts down the speech power at noisy frequency.
Related Papers (5)
On Optimal Linear Filtering of Speech for Near-End Listening Enhancement
The enhancement of speech intelligibility in high noise levels by high-pass filtering followed by rapid amplitude compression
R. Niederjohn,J. Grotelueschen +1 more
Near End Listening Enhancement: Speech Intelligibility Improvement in Noisy Environments
Bastian Sauert,Peter Vary +1 more