A voice activity detection algorithm for communication systems with dynamically varying background acoustic noise

doi:10.1109/VETEC.1998.686432

Proceedings ArticleDOI

A voice activity detection algorithm for communication systems with dynamically varying background acoustic noise

- Vol. 2, pp 1214-1218

TLDR

In this paper, a simple, efficient, and robust voice activity detection (VAD) algorithm was developed to work in a mobile or portable environment exhibiting dynamically varying background noise, using probabilistic distances based on the energy content, the periodicity, the stationarity and the spectral distribution within the low frequency band to decide if the presented speech frame is speech or silence.

Abstract:

Speech can be modeled as short bursts of vocal energy separated by silence gaps. During typical conversation talk-spurts comprise only 31.5% of each party's speech and the remaining 68.5% is silence. Communication systems can achieve significant gains in spectral efficiency and energy efficiency by disconnecting the users from the spectral resource during the silence periods. This paper develops a simple, efficient, and robust voice activity detection (VAD) algorithm to work in a mobile or portable environment exhibiting dynamically varying background noise. The VAD uses probabilistic distances based on the energy content, the periodicity, the stationarity and the spectral distribution within the low frequency band to decide if the presented speech frame is speech or silence.

Citations

PDF

Open Access

More filters

Patent

Method and apparatus for comfort noise generation in speech communication systems

Edgardo M. Cruz-Zeno, +1 more

TL;DR: In this paper, a method that may be used in variety of electronic devices for generating comfort noise includes receiving a plurality of information frames indicative of speech plus background noise, estimating one or more background noise characteristics based on the plurality of Information frames, and generating a comfort noise signal based on one or multiple background noises characteristics.

...read moreread less

Patent

Noise generation in audio codecs

Panji Setiawan, +3 more

TL;DR: The spectral domain is efficiently used in order to parameterize the background noise thereby yielding a background noise synthesis which is more realistic and thus leads to a more transparent active to inactive phase switching as mentioned in this paper.

...read moreread less

Journal ArticleDOI

Robust Voice Activity Detection Using the Spectral Peaks of Vowel Sounds

In-Chul Yoo, +1 more

- 05 Aug 2009 -

Etri Journal

TL;DR: In this paper, a method of detecting the spectral peaks of vowel sounds in corrupted signals was proposed to detect voice activity even in low signal-to-noise ratio (SNR) conditions.

...read moreread less

Patent

Apparatus and method for processing a decoded audio signal in a spectral domain

Guillaume Fuchs, +4 more

TL;DR: An apparatus for processing a decoded audio signal (100) comprising a filter (102), a time-spectral converter stage (106), and a subtracter (112) for performing a subband-wise subtraction between the weighted filtered audio signal and the spectral representation of the decoded signal as discussed by the authors.

...read moreread less

Patent

Apparatus and method for error concealment in low-delay unified speech and audio coding (usac)

Jeremie Lecomte, +3 more

TL;DR: In this paper, an apparatus for generating spectral replacement values for an audio signal is provided, consisting of a buffer unit (110) for storing previous spectral values relating to a previously received error-free audio frame.

...read moreread less

D.C. Cox, +2 more

- 01 Mar 1991 -

IEEE Communications Magazine

TL;DR: The recent growth in the use of digital radio is reviewed, and the technology used to implement low-power digital radio in the local exchange loop plant is discussed and the integration of digitalRadio subscriber loops with network intelligence is explored.

...read moreread less

A voice activity detection algorithm for communication systems with dynamically varying background acoustic noise

Citations

Method and apparatus for comfort noise generation in speech communication systems

Noise generation in audio codecs

Robust Voice Activity Detection Using the Spectral Peaks of Vowel Sounds

Apparatus and method for processing a decoded audio signal in a spectral domain

Apparatus and method for error concealment in low-delay unified speech and audio coding (usac)

References

Control Methods Used in a Study of the Vowels

A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition

New methods of pitch extraction

Real-time digital hardware pitch detector

Low-power digital radio as a ubiquitous subscriber loop

Related Papers (5)

Methods for generating comfort noise during discontinuous transmission

Spectral Subtraction Based on Minimum Statistics

A wideband speech and audio codec at 16/24/32 kbit/s using hybrid ACELP/TCX techniques

Methods and devices for low-frequency emphasis during audio compression based on acelp/tcx

Perceptual linear predictive (PLP) analysis of speech