scispace - formally typeset
Search or ask a question
Topic

Audio signal processing

About: Audio signal processing is a research topic. Over the lifetime, 21463 publications have been published within this topic receiving 319597 citations. The topic is also known as: audio processing & Acoustic signal processing.


Papers
More filters
Patent
Zhanyong Wang1
29 Aug 2008
TL;DR: In this paper, a navigation device consisting of a GNSS receiver, a Geographic Information System (GIS), a control module, an audio processing module, and a speaker is presented.
Abstract: The invention provides a navigation device capable of playing voice guidance. In one embodiment, the navigation device comprises a GNSS receiver, a Geographic Information System (GIS), a control module, an audio processing module, and a speaker. The GNSS receiver provides a position, a velocity, and an acceleration of the navigation device. The GIS determines a route according to a map data and determines a decision point in the route. The control module dynamically determines a playing policy corresponding to the decision point according to the position, velocity, and acceleration, and generates a guiding sentence corresponding to the decision point according to the playing policy, wherein the playing policy determines a number of words in the guiding sentence. The audio processing module then generates a guiding voice signal corresponding to the guiding sentence. The speaker then plays the guiding voice signal.

99 citations

Patent
04 Nov 1994
TL;DR: In this paper, a real-time visual communication system capable of improving a correspondence between a received video signal and a received audio signal in real time and improving reality is presented, where an AV signal is separated into a video and an audio signal, and the output state of the video or audio signal is controlled by the characteristics of the audio or video signal.
Abstract: A real time visual communication system capable of improving a correspondence between a received video signal and a received audio signal in real time and improving reality. An AV signal is separated into a video signal and an audio signal, and the output state of the video or audio signal is controlled by the characteristics of the audio or video signal. For example, the sound field, reverberation, and the like are controlled in accordance with the characteristics of the video signal. A suitable image pickup unit is selected in accordance with the characteristics of the audio signal to make the sights of conversation participants coincide with each other. It is possible to reproduce sounds of audio signals well matching video signals and to provide visual communication having good reality because of the combination of matched audio and video signals.

99 citations

PatentDOI
TL;DR: In this paper, a method for generating digital audio filters for equalizing a loudspeaker is presented, for a tolerance range for a target response curve of sound level versus frequency for the loudspeaker.
Abstract: A method for generating digital filters for equalizing a loudspeaker. First digital data is provided, for a tolerance range for a target response curve of sound level versus frequency for the loudspeaker. Second digital data is generated, for an actual response curve of sound level versus frequency for the loudspeaker (1010). The first digital data is compared with the second digital data and it is determined whether the actual response curve is within the tolerance range (1020). If the actual response curve is not within the tolerance range, digital audio filters are iteratively generated, and the digital audio filters are applied to the second digital data to generate third digital data for a compensated response curve (1050, 1060, 1070). The frequency, amplitude and bandwidth of the digital audio filters are automatically optimized until the compensated response curve is within the tolerance range or a predetermined limit on the number of digital audio filters has been reached, whichever occurs first (1080).

99 citations

Journal ArticleDOI
TL;DR: A content-based movie parsing and indexing approach is presented; it analyzes both audio and visual sources and accounts for their interrelations to extract high-level semantic cues to extract meaningful movie events and assign semantic labels for the purpose of content indexing.
Abstract: A content-based movie parsing and indexing approach is presented; it analyzes both audio and visual sources and accounts for their interrelations to extract high-level semantic cues. Specifically, the goal of this work is to extract meaningful movie events and assign them semantic labels for the purpose of content indexing. Three types of key events, namely, 2-speaker dialogs, multiple-speaker dialogs, and hybrid events, are considered. Moreover, speakers present in the detected movie dialogs are further identified based on the audio source parsing. The obtained audio and visual cues are then integrated to index the movie content. Our experiments have shown that an effective integration of the audio and visual sources can lead to a higher level of video content understanding, abstraction and indexing.

99 citations

Patent
04 Sep 1998
TL;DR: In this paper, the authors presented an audio enhancement apparatus and method which spectrally shapes harmonics of the low-frequency information in a pair of audio signals so that when reproduced by a loudspeaker, a listener perceives the loudspeaker as having more acoustic bandwidth than is actually provided by the speaker.
Abstract: The present invention provides an audio enhancement apparatus and method which spectrally shapes harmonics of the low-frequency information in a pair of audio signals so that when reproduced by a loudspeaker, a listener perceives the loudspeaker as having more acoustic bandwidth than is actually provided by the loudspeaker. The perception of extra bandwidth is particularly pronounced at low frequencies, especially frequencies at which the loudspeaker system produces less acoustic output energy. In one embodiment, the invention also shifts signal from one audio signal to the other audio signal in order to obtain more bandwidth for the available loudspeaker to reduce clipping. In one embodiment, the invention also provides a combined signal path for spectral shaping of the desired harmonics and a feedforward signal path for each pair of audio signals.

99 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
81% related
Feature (computer vision)
128.2K papers, 1.7M citations
79% related
Robustness (computer science)
94.7K papers, 1.6M citations
78% related
Noise
110.4K papers, 1.3M citations
77% related
Image segmentation
79.6K papers, 1.8M citations
77% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202319
202263
2021217
2020525
2019659
2018597