scispace - formally typeset
Open Access

Recursive Closed-Form Optimization of Spectral Audio Power Allocation for Near End Listening Enhancement

Reads0
Chats0
TLDR
In this contribution, the calculation rules of the Speech Intelligibility Index (SII) are analysed and a recursive closedform solution is developed which maximizes the SII under the constraint of an unchanged average power of the audio signal.
Abstract
In mobile telephony, near end listening enhancement is desired by the near end listener who perceives not only the clean far end speech but also ambient background noise. A typical scenario is mobile telephony in acoustical background noise such as traffic or babble noise. In such a situation, it is often not acceptable/possible to increase the audio power. In this contribution we analyse the calculation rules of the Speech Intelligibility Index (SII) and develop a recursive closedform solution which maximizes the SII under the constraint of an unchanged average power of the audio signal. This solution has very low complexity compared to a previous approach of the authors and is thus suitable for real-time processing.

read more

Citations
More filters
Journal ArticleDOI

Evaluating the intelligibility benefit of speech modifications in known noise conditions

TL;DR: The current study compares the benefits of speech modification algorithms in a large-scale speech intelligibility evaluation and quantifies the equivalent intensity change, defined as the amount in decibels that unmodified speech would need to be adjusted by in order to achieve the same intelligibility as modified speech.
Proceedings ArticleDOI

Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression.

TL;DR: Experiments with speech shaped (SSN) and competing speaker types of noise at various low SNR values show that the suggested approach outperforms state-of-the art methods in terms of the Speech Intelligibility Index (SII).
Journal ArticleDOI

On Optimal Linear Filtering of Speech for Near-End Listening Enhancement

TL;DR: Experiments show large intelligibility improvements with the proposed method over the unprocessed noisy speech and better performance than one state-of-the art method.
Proceedings ArticleDOI

Intelligibility-enhancing speech modifications: the Hurricane Challenge

TL;DR: Surprisingly, for most conditions the largest gains were observed for noise-independent algorithms, suggesting that performance in this task can be further improved by exploiting information in the masking signal.
Journal ArticleDOI

Speech energy redistribution for intelligibility improvement in noise based on a perceptual distortion measure

TL;DR: A speech pre-processing algorithm is presented that improves the speech intelligibility in noise for the near-end listener by optimally redistributing the speech energy over time and frequency according to a perceptual distortion measure, which is based on a spectro-temporal auditory model.
References
More filters
Journal ArticleDOI

The American National Standards Institute

TL;DR: This document describes the organization and operations of the American National Standards Institute and describes the research and development activities of the institute.
Proceedings ArticleDOI

Near End Listening Enhancement: Speech Intelligibility Improvement in Noisy Environments

TL;DR: A digital signal processing algorithm to improve intelligibility of clean far end speech for the near end listener who is located in an environment with background noise is presented.
Proceedings Article

Near end listening enhancement optimized with respect to speech intelligibility index and audio power limitations

TL;DR: This contribution uses a theoretical analysis of the Speech Intelligibility Index (SII) to develop an algorithm which numerically maximizes the SII under the constraint of an unchanged average power of the audio signal.
Journal ArticleDOI

Uniform and warped low delay filter-banks for speech enhancement

TL;DR: A versatile filter-bank concept for adaptive subband filtering is proposed, which achieves a significantly lower algorithmic signal delay than commonly used analysis-synthesis filter-banks and performs time-domain filtering with coefficients adapted in the uniform or non-uniform frequency-domain.

Near end listening enhancement with strict loudspeaker output power constraining

TL;DR: In this paper, the authors investigated the opportunities of listening enhancement under the constraint that the processed loudspeaker signal power is strictly equal to the power of the received signal and compared two reasonable processing strategies: a previous one that aims at the amplification of speech at noisy frequencies and a new one which cuts down the speech power at noisy frequency.
Related Papers (5)