scispace - formally typeset
Search or ask a question
Topic

Audio signal processing

About: Audio signal processing is a research topic. Over the lifetime, 21463 publications have been published within this topic receiving 319597 citations. The topic is also known as: audio processing & Acoustic signal processing.


Papers
More filters
Journal ArticleDOI
TL;DR: Two approaches for sparse decomposition of polyphonic music are considered: a time-domain approach based on a shift-invariant model, and a frequency-domain approaches based on phase- Invariant power spectra.

65 citations

Patent
01 Dec 1994
TL;DR: In this paper, the authors propose a system and method for enhancing interactive communication between video conferencing devices of the type in which a delay is inserted into the audio transmission path to provide lip synchronization of the image and speech of the respective users thereof.
Abstract: A system and method for enhancing interactive communication between video conferencing devices of the type in which a delay is inserted into the audio transmission path to provide lip synchronization of the image and speech of the respective users thereof. Each video conferencing device includes a display device for displaying images of at least one communicating party and a speech communicating system for communicating with the communicating party. In accordance with one embodiment of the invention, a speech detecting circuit detects an utterance by a first user of a first video conferencing apparatus. An audible or visual indication is provided to at least a second user of a second video conferencing apparatus before the utterance is reproduced. As a result, the potential for simultaneous speaking by two or more users is substantially reduced. In an alternate embodiment, the amount of delay introduced into the audio signal transmission path is adjusted in accordance with the mode of operation of the video conferencing devices. An audio signal processing system detects, over predetermined intervals, whether or not an interactive conversation between two or more users is in progress. If an interactive conversation is not detected, lip synchronization proceeds in a conventional manner by introducing a predetermined delay into the audio path. If an interactive conversation is detected, the amount of audio delay inserted is minimized until there is a return to the lecture mode of operation.

65 citations

Patent
10 Sep 2003
TL;DR: In this article, a perceptual mask is estimated for an audio stream, based on the perceptual threshold of the human auditory system, and a hidden sub-channel is dynamically allocated substantially below the estimated perceptual mask, in which additional payload is transmitted.
Abstract: Methods and apparatus are provided for communicating an audio stream. A perceptual mask is estimated for an audio stream, based on the perceptual threshold of the human auditory system. A hidden sub-channel is dynamically allocated substantially below the estimated perceptual mask based on the characteristics of the audio stream, in which additional payload is transmitted. The additional payload can be related to components of the audio stream that would not otherwise be transmitted in a narrowband signal, or to concurrent services that can be accessed while the audio stream is being transmitted. A suitable receiver can recover the additional payload, whereas the audio stream will be virtually unaffected from a human auditory standpoint when received by a traditional receiver. A coding scheme is also provided in which a portion of a codec is used to code an upper-band portion of an audio stream, while the narrowband portion is left uncoded.

65 citations

Patent
15 Dec 1994
TL;DR: A home videoconferencing system as discussed by the authors uses a standard television receiver and a camcorder 14 to convert an outgoing analog video signal into a compressed digital video signal; an audio controller 116 converts an outgoing audio signal into compressed digital audio signal; the system controller 120 multiplexes, synchronizes and error corrects outgoing and incoming digital system signals.
Abstract: A home videoconferencing system 100 uses a standard television receiver 16 and a camcorder 14. The video controller 112 converts an outgoing analog video signal into a compressed digital video signal; an audio controller 116 converts an outgoing analog audio signal into a compressed digital audio signal. The system controller 120 multiplexes, synchronizes and error corrects outgoing and incoming digital system signals. The digital system signals are coupled to a modem 122 for transmission and reception over analog phone lines 124.

65 citations

Patent
14 Mar 2008
TL;DR: In this paper, an electronic stethoscope sensor is contained within a housing for transducing body sounds to electronic signals, and is operatively connected to the electronic processor, and one or more secondary audio signal sources operatively connects to the processor.
Abstract: A medical diagnostic and communications apparatus with audio output comprises an electronic processor for processing stethoscope signals and secondary audio signals. An electronic stethoscope sensor is contained within a housing for transducing body sounds to electronic signals, and is operatively connected to the electronic processor. One or more secondary audio signal sources operatively connects to the electronic processor. A common audio output is connected to electronic processor to convert electronic stethoscope signals or secondary audio signals to acoustic output. These sounds may be produced separately or mixed.

65 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
81% related
Feature (computer vision)
128.2K papers, 1.7M citations
79% related
Robustness (computer science)
94.7K papers, 1.6M citations
78% related
Noise
110.4K papers, 1.3M citations
77% related
Image segmentation
79.6K papers, 1.8M citations
77% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202319
202263
2021217
2020525
2019659
2018597