scispace - formally typeset
Search or ask a question
Topic

Audio signal processing

About: Audio signal processing is a research topic. Over the lifetime, 21463 publications have been published within this topic receiving 319597 citations. The topic is also known as: audio processing & Acoustic signal processing.


Papers
More filters
Patent
14 Oct 2008
TL;DR: In this article, a practical speaker connection is identified using a device having a sound channel of a 5.1 channel or 7.1 channels, and a device is provided that can easily reproduce the optimum multiple channels.
Abstract: Practical speaker connection is identified using a device having a sound channel of a 5.1 channel or 7.1 channel, and a device is provided that can easily reproduce the optimum multiple channels. Actual speaker arrangement can be identified by, for example, measuring the impedance of a terminal at the side of an audio amplifier. If incorrect connection is found, a warning is issued. This information is transmitted to a signal source with an EDID and a signal with the optimum a number of sound channel is sent. The EDID is also used for the connection with a display unit and the speaker connection with which the display unit is provided uniquely. For example, a sound through the 7.1 channel is easily reproduced using the speaker of the display unit in the channel of the front speaker.

125 citations

Patent
19 Nov 1996
TL;DR: In this article, audio data is processed from a packetized data stream carrying digital television information in a succession of fixed length transport packets, and some of the packets contain a presentation time stamp (PTS) indicative of a time for commencing the output of associated audio data.
Abstract: Audio data is processed from a packetized data stream carrying digital television information in a succession of fixed length transport packets. Some of the packets contain a presentation time stamp (PTS) indicative of a time for commencing the output of associated audio data. After the audio data stream has been acquired, the detected audio packets are monitored to locate subsequent PTS's for adjusting the timing at which audio data is output, thereby providing proper lip synchronization with associated video. Errors in the audio data are processed in a manner which attempts to maintain synchronization of the audio data stream while masking the errors. In the event that the synchronization condition cannot be maintained, for example in the presence of errors over more than one audio frame, the audio data stream is reacquired while the audio output is concealed. An error condition is signaled to the audio decoder by altering the audio synchronization word associated with the audio frame in which the error has occurred.

125 citations

Journal ArticleDOI
TL;DR: A probabilistic multimodal generation model is introduced and used to derive an information theoretic measure of cross-modal correspondence and nonparametric statistical density modeling techniques can characterize the mutual information between signals from different domains.
Abstract: Audio and visual signals arriving from a common source are detected using a signal-level fusion technique. A probabilistic multimodal generation model is introduced and used to derive an information theoretic measure of cross-modal correspondence. Nonparametric statistical density modeling techniques can characterize the mutual information between signals from different domains. By comparing the mutual information between different pairs of signals, it is possible to identify which person is speaking a given utterance and discount errant motion or audio from other utterances or nonspeech events.

125 citations

Patent
15 Sep 1988
TL;DR: In this paper, a method for obtaining audience preference market survey data, such as a radio and/or television listening audience survey, from a plurality of diverse locations for accumulative processing by a remote data processor, involves recording (22, 30, 40, 42, 44, 56, 54, 52) audio signals (46, 48, 50) at each of the diverse locations which corresponds to the ambient radio and or television audio sound at predetermined synchronized discrete sampling times (42, 60, 64, 66, 62) or windows which are synchronized to a master recording (110)
Abstract: A method for obtaining audience preference market survey data, such as a radio and/or television listening audience survey and/or supplemental data, such as bar coded data (156), from a plurality of diverse locations for accumulative processing by a remote data processor, involves recording (22, 30, 40, 42, 44, 56, 54, 52) a plurality of audio signals (46, 48, 50) at each of the diverse locations which corresponds to the ambient radio and/or television audio sound at predetermined synchronized discrete sampling times (42, 60, 64, 66, 62) or windows which are synchronized to a master recording (110) of the programs being surveyed. The sampling windows are of short duration with respect to the measurement interval. The master recording (110) audio signals frequency intervals are matched against the frequency of the diverse location audio samples to provide an indication of audience preference and tested for a correct match in a configurable filter array (120, 122, 124). Respondents at the diverse locations may be provided with portable tape recorders (30) which are automatically activated at synchronized clock times to obtain the audio samples. Bar code scanning information (150, 24) may also be provided in the form of audio signals by using the scanning signal (152) to drive a voltage controlled audio oscillator (160).

125 citations

Patent
31 Dec 2012
TL;DR: In this article, an exemplary system consisting of an equalizer module that analyzes sound characteristics of individual digital audio samples including a discrete signal, a selector module that applies a selection heuristic to select the discrete signal from the individual audio samples based on the sound characteristics, and an audio module that supplies to an output an insert signal generated according to the selected signal selected by the heuristic.
Abstract: An exemplary system comprises a device including a memory with an audio injection application installed thereon. The application comprises an equalizer module that analyzes sound characteristics of individual digital audio samples including a discrete signal, a selector module that applies a selection heuristic to select the discrete signal from the individual digital audio samples based on the sound characteristics, and an audio module that supplies to an output an insert signal generated according to the discrete signal selected by the selection heuristic.

125 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
81% related
Feature (computer vision)
128.2K papers, 1.7M citations
79% related
Robustness (computer science)
94.7K papers, 1.6M citations
78% related
Noise
110.4K papers, 1.3M citations
77% related
Image segmentation
79.6K papers, 1.8M citations
77% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202319
202263
2021217
2020525
2019659
2018597