Topic

Audio signal processing

About: Audio signal processing is a research topic. Over the lifetime, 21463 publications have been published within this topic receiving 319597 citations. The topic is also known as: audio processing & Acoustic signal processing.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Patent•

Digital audio workstation providing digital storage and display of video information

[...]

Peter J Fasciano, Curt A Rawley, Thomas R Hegg, Mackenzie Leathurby, Jeffrey L Bedell, Jr James A Ravan - Show less +2 more

09 Apr 1993

TL;DR: In this paper, a digital audio workstation for the audio portions of video programs is presented, which combines audio editing capability with the ability to immediately display video images associated with the audio program.

...read moreread less

Abstract: The invention disclosed herein is a digital audio workstation for the audio portions of video programs. It combines audio editing capability with the ability to immediately display video images associated with the audio program. The invention detects an operator's indication of a point or segment of audio information and uses it to retrieve and display the video images that correspond to the indicated audio programming. Another aspect of the invention is a labeling and notation system for recorded digitized audio or video information. The system provides a means of storing in association with a particular point of the audio or video information a digitized voice or textual message for later reference regarding that information.

...read moreread less

226 citations

Proceedings Article•DOI•

Quantile based noise estimation for spectral subtraction and Wiener filtering

[...]

Volker Stahl¹, Alexander Fischer¹, Rolf Bippus¹•Institutions (1)

Philips¹

05 Jun 2000

TL;DR: This paper restricts its considerations to the case where only a single microphone recording of the noisy signal is available and proposes a method based on temporal quantiles in the power spectral domain, which is compared with pause detection and recursive averaging.

...read moreread less

Abstract: Elimination of additive noise from a speech signal is a fundamental problem in audio signal processing. In this paper we restrict our considerations to the case where only a single microphone recording of the noisy signal is available. The algorithms which we investigate proceed in two steps. First, the noise power spectrum is estimated. A method based on temporal quantiles in the power spectral domain is proposed and compared with pause detection and recursive averaging. The second step is to eliminate the estimated noise from the observed signal by spectral subtraction or Wiener filtering. The database used in the experiments comprises 6034 utterances of German digits and digit strings by 770 speakers in 10 different cars. Without noise reduction, we obtain an error rate of 11.7%. Quantile based noise estimation and Wiener filtering reduce the error rate to 8.6%. Similar improvements are achieved in an experiment with artificial, non-stationary noise.

...read moreread less

226 citations

Patent•

Digital audio file search method and apparatus using text-to-speech processing

[...]

Luis Stohr, Satoshi Tanimoto

05 Jan 2006

TL;DR: In this paper, a digital audio file search method and apparatus for digital audio files is provided that allows a user to navigate the audio files by generating speech sounds related to the information of the audio file to facilitate searching and playback.

...read moreread less

Abstract: A digital audio file search method and apparatus for digital audio files is provided that allows a user to navigate the audio files by generating speech sounds related to the information of the audio files to facilitate searching and playback. The digital audio file search method and apparatus searches for audio files in a portable digital audio player in combination with an automobile audio system through speech sounds by utilizing text-to-speech processing and by prompting response from a user in response to the generated speech sounds. The text-to-speech technology is utilized to generate the speech sound based on tag-data of the audio files. When hearing the speech sounds, the user gives instruction for searching the files without being distracted from driving the automobile.

...read moreread less

226 citations

Proceedings Article•DOI•

Story segmentation and detection of commercials in broadcast news video

[...]

Alexander G. Hauptmann¹, Michael Witbrock•Institutions (1)

Carnegie Mellon University¹

22 Apr 1998

TL;DR: This paper explains how the Informedia system takes advantage of the closed captioning frequently broadcast with the news, how it extracts timing information by aligning the closed-captions with the result of the speech recognition, and how the system integrates closed-caption cues with the results of image and audio processing.

...read moreread less

Abstract: The Informedia Digital Library Project allows full content indexing and retrieval of text, audio and video material. Segmentation is an integral process in the Informedia digital video library. The success of the Informedia project hinges on two critical assumptions: that we can extract sufficiently accurate speech recognition transcripts from the broadcast audio and that we can segment the broadcast into video paragraphs, or stories, that are useful for information retrieval. In previous papers we have shown that speech recognition is sufficient for information retrieval of pre-segmented video news stories. We now address the issue of segmentation and demonstrate that a fully automatic system can extract story boundaries using available audio, video and closed-captioning cues. The story segmentation step for the Informedia Digital Video Library splits full-length news broadcasts into individual news stories. During this phase the system also labels commercials as separate "stories". We explain how the Informedia system takes advantage of the closed captioning frequently broadcast with the news, how it extracts timing information by aligning the closed-captions with the result of the speech recognition, and how the system integrates closed-caption cues with the results of image and audio processing.

...read moreread less

224 citations

Patent•

Apparatus and methods for music and lyrics broadcasting

[...]

Roy J. Mankovitz

20 Mar 1995

TL;DR: In this paper, a system for broadcasting audio music and broadcasting lyrics for display and highlighting substantially simultaneously with the occurrence of the lyrics in the accompanying audio music is provided, which includes a audio music source that provides a data output and a analog audio signal output.

...read moreread less

Abstract: A system for broadcasting audio music and broadcasting lyrics for display and highlighting substantially simultaneously with the occurrence of the lyrics in the accompanying audio music is provided. The system includes a audio music source that provides a data output and a analog audio signal output. A computer receives the data output by the music source and generates lyric text data and lyric timing commands. A subcarrier generator generates a subcarrier signal carrying the lyric text data and lyric timing commands. An FM transmitter broadcasts a composite signal that combines the analog output of the music source with the subcarrier signal. A lyric display unit receives the composite signal, separates and decodes the subcarrier signal and displays and highlights lyrics according the lyric text data and lyric timing commands decoded from the subcarrier signal.

...read moreread less

224 citations

Collapse

Network Information

Performance

Metrics

21,541

Papers

328,867

Citations

No. of papers in the topic in previous years
Year	Papers
2023	19
2022	63
2021	217
2020	525
2019	659
2018	597

Audio signal processing

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics