System for detecting voice activity

Patent

System for detecting voice activity

TLDR

In this paper, a nonlinear two-filter voice detection algorithm was proposed, in which one filter has a low time constant (the fast filter) and one filter had a high time constant(the slow filter).

Abstract:

A system for detection of voice activity in a communications signal, employing a nonlinear two filter voice detection algorithm, in which one filter has a low time constant (the fast filter) and one filter has a high time constant (the slow filter). The slow filter serves to provide a noise floor estimate for the incoming signal, and the fast filter serves to more closely represent the total energy in the signal. The absolute value of incoming data is presented to both filters, and the difference in filter outputs is integrated over each of a series of successive frames, thereby giving an indication of the energy level above the noise floor in each frame of the incoming signal. Voice activity is detected if the measured energy level for a frame exceeds a specified threshold level. Silence (e.g., leaving only noise) is detected if the measured energy level for each of a specified number of successive frames does not exceed a specified threshold level. The system enables voice activity to be distinguished from common noise such as pops, clicks and low level cross-talk.

Citations

PDF

Open Access

More filters

Patent

Instantaneous user initiation voice quality feedback

Muneyb Minhazuddin, +4 more

TL;DR: A system for providing a high communications quality is provided in this paper, where the system comprises an input operable to receive a message from at least one of first and second network nodes 200 and 204, the message indicating a service problem with the session and a statistic collection agent 248 operable in response to the message, in order to cause reconfiguration of one or more attributes or resources in the network, variation of a sampling frequency of session-related performance attributes associated with the network; alteration of the types of session related performance attributes being collected regarding the network.

...read moreread less

Patent

Voice modulation recognition in a radio-to-SIP adapter

Douglas Hall, +1 more

TL;DR: In this paper, a radio-to-SIP adapter is shown to include a voice detection algorithm processor as well as other circuitry to provide an interface between a radio and SIP adapter to accommodate a transition from half duplex to full duplex.

...read moreread less

Patent

Voip endpoint call admission

Neil Hepworth, +2 more

TL;DR: In this paper, an intelligent endpoint or communication device that can collect available bandwidth-related information metrics and/or perform call admission control functions is presented, and the present invention is further directed to an architecture comprising a switch or media server in communication with a plurality of subscriber communication devices.

...read moreread less

Patent

Packet prioritization and associated bandwidth and buffer management techniques for audio over IP

Christopher R. Gentle, +1 more

TL;DR: In this paper, an acoustic prioritization agent assigns a priority value to the packets based on factors such as whether the packet contains voice activity and the degree of acoustic similarity between this packet and adjacent packets in the sequence.

...read moreread less

Patent

Time-sensitive-packet jitter and latency minimization on a shared data link

Shmuel Shaffer, +2 more

TL;DR: In this paper, a packet arrival prediction mechanism predicts when a time-critical packet is expected to arrive, and when transmission of a waiting lower-priority packet might cause a substantial delay in the expected timecritical packet's transmission, the lower priority packet is parked until it can be transmitted without interfering with a time critical packet.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Digital Processing of Speech Signals

Lawrence R. Rabiner, +1 more

TL;DR: This paper presents a meta-modelling framework for digital Speech Processing for Man-Machine Communication by Voice that automates the very labor-intensive and therefore time-heavy and expensive process of encoding and decoding speech.

...read moreread less

PatentDOI

Speech recognition system for an automotive vehicle

Kazunori Noso, +2 more

- 20 Oct 1982 -

Journal of the Acoustical Society of Ame...

TL;DR: A speech recognition system includes speech presence detection which uses a first level threshold of ambient noise/silence above which speech start is decided for a signal as discussed by the authors, unless a predetermined time interval of speech is exceeded after start, causing a corrected second threshold to be calculated.

...read moreread less

PatentDOI

Reduction of background noise for speech enhancement

Brant Martin Helf, +1 more

- 06 Jun 1994 -

Journal of the Acoustical Society of Ame...

TL;DR: In this paper, human audio perception is used to perform spectral and time masking to reduce perceived loudness of noise added to speech signals, where a signal is divided into blocks and passed through notch filters to remove noise components and then appended to part of the previous block.

...read moreread less

Patent

Method and apparatus for reducing residual far-end echo in voice communication networks

Patrick Michael Velardo, +1 more

TL;DR: In this article, a method and apparatus for reducing the energy content that is attributable to echoes of signals transmitted into the local network (NEAR-IN signals) is described, in part, by generating a time-varying TEMPLATE signal which represents the smoothed energy content of NEARIN signals delayed according to the echo path and attenuated by an estimated echo transmission loss.

...read moreread less

PatentDOI

Speech analysis/synthesis system with silence suppression

George R. Doddington

- 13 Oct 1983 -

Journal of the Acoustical Society of Ame...

TL;DR: In this paper, a segment is classified as "speech" if the energy of the signal is greater than an adaptively adjusted threshold defined as the maximum of scaled values of two separate envelope parameters, which both track the variation in energy over the sequence of frames of speech data.

...read moreread less

System for detecting voice activity

Citations

Instantaneous user initiation voice quality feedback

Voice modulation recognition in a radio-to-SIP adapter

Voip endpoint call admission

Packet prioritization and associated bandwidth and buffer management techniques for audio over IP

Time-sensitive-packet jitter and latency minimization on a shared data link

References

Digital Processing of Speech Signals

Speech recognition system for an automotive vehicle

Reduction of background noise for speech enhancement

Method and apparatus for reducing residual far-end echo in voice communication networks

Speech analysis/synthesis system with silence suppression

Related Papers (5)

Hands-free, voice-operated remote control transmitter

Audio activity detection circuit to increase battery life in portable computers

Systems and methods for hands-free voice-activated devices

Methods and apparatus for detecting a voice command

Background speech recognition assistant