scispace - formally typeset
Proceedings ArticleDOI

A voice activity detection algorithm for communication systems with dynamically varying background acoustic noise

I.D. Lee, +2 more
- Vol. 2, pp 1214-1218
TLDR
In this paper, a simple, efficient, and robust voice activity detection (VAD) algorithm was developed to work in a mobile or portable environment exhibiting dynamically varying background noise, using probabilistic distances based on the energy content, the periodicity, the stationarity and the spectral distribution within the low frequency band to decide if the presented speech frame is speech or silence.
Abstract
Speech can be modeled as short bursts of vocal energy separated by silence gaps. During typical conversation talk-spurts comprise only 31.5% of each party's speech and the remaining 68.5% is silence. Communication systems can achieve significant gains in spectral efficiency and energy efficiency by disconnecting the users from the spectral resource during the silence periods. This paper develops a simple, efficient, and robust voice activity detection (VAD) algorithm to work in a mobile or portable environment exhibiting dynamically varying background noise. The VAD uses probabilistic distances based on the energy content, the periodicity, the stationarity and the spectral distribution within the low frequency band to decide if the presented speech frame is speech or silence.

read more

Citations
More filters
Patent

Method and apparatus for comfort noise generation in speech communication systems

TL;DR: In this paper, a method that may be used in variety of electronic devices for generating comfort noise includes receiving a plurality of information frames indicative of speech plus background noise, estimating one or more background noise characteristics based on the plurality of Information frames, and generating a comfort noise signal based on one or multiple background noises characteristics.
Patent

Noise generation in audio codecs

TL;DR: The spectral domain is efficiently used in order to parameterize the background noise thereby yielding a background noise synthesis which is more realistic and thus leads to a more transparent active to inactive phase switching as mentioned in this paper.
Journal ArticleDOI

Robust Voice Activity Detection Using the Spectral Peaks of Vowel Sounds

TL;DR: In this paper, a method of detecting the spectral peaks of vowel sounds in corrupted signals was proposed to detect voice activity even in low signal-to-noise ratio (SNR) conditions.
Patent

Apparatus and method for processing a decoded audio signal in a spectral domain

TL;DR: An apparatus for processing a decoded audio signal (100) comprising a filter (102), a time-spectral converter stage (106), and a subtracter (112) for performing a subband-wise subtraction between the weighted filtered audio signal and the spectral representation of the decoded signal as discussed by the authors.
Patent

Apparatus and method for error concealment in low-delay unified speech and audio coding (usac)

TL;DR: In this paper, an apparatus for generating spectral replacement values for an audio signal is provided, consisting of a buffer unit (110) for storing previous spectral values relating to a previously received error-free audio frame.
References
More filters
Journal ArticleDOI

Control Methods Used in a Study of the Vowels

TL;DR: Control methods used in the evaluation of effects of language and dialectal backgrounds and vocal and auditory characteristics of the individuals concerned in a vowel study program at Bell Telephone Laboratories are discussed.
Journal ArticleDOI

A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition

TL;DR: A pattern recognition approach for deciding whether a given segment of a speech signal should be classified as voiced speech, unvoiced speech, or silence, based on measurements made on the signal, which has been found to provide reliable classification with speech segments as short as 10 ms.
Journal ArticleDOI

New methods of pitch extraction

TL;DR: Three new methods will be described for the extraction of the fundamental pitch from a speech signal, which can tolerate a considerable amount of high-pass filtering and additive noise with little degradation in performance.
Journal ArticleDOI

Real-time digital hardware pitch detector

TL;DR: Computing of the autocorrelation function of the clipped speech is easily implemented in digital hardware using simple combinatorial logic, i.e., an up-down counter can be used to compute each correlation point.
Journal ArticleDOI

Low-power digital radio as a ubiquitous subscriber loop

TL;DR: The recent growth in the use of digital radio is reviewed, and the technology used to implement low-power digital radio in the local exchange loop plant is discussed and the integration of digitalRadio subscriber loops with network intelligence is explored.