scispace - formally typeset
Search or ask a question
Topic

Speech coding

About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.


Papers
More filters
Patent
05 Jan 2006
TL;DR: In this paper, a digital audio file search method and apparatus for digital audio files is provided that allows a user to navigate the audio files by generating speech sounds related to the information of the audio file to facilitate searching and playback.
Abstract: A digital audio file search method and apparatus for digital audio files is provided that allows a user to navigate the audio files by generating speech sounds related to the information of the audio files to facilitate searching and playback. The digital audio file search method and apparatus searches for audio files in a portable digital audio player in combination with an automobile audio system through speech sounds by utilizing text-to-speech processing and by prompting response from a user in response to the generated speech sounds. The text-to-speech technology is utilized to generate the speech sound based on tag-data of the audio files. When hearing the speech sounds, the user gives instruction for searching the files without being distracted from driving the automobile.

226 citations

PatentDOI
TL;DR: In this article, a method and system for synthesizing speech utilizing a periodic waveform decomposition and relocation coding scheme was proposed, where signals of voiced sound interval among original speech are decomposed into wavelets, each of which corresponds to a speech waveform for one period made by each glottal pulse.
Abstract: The present invention relates to a method and system for synthesizing speech utilizing a periodic waveform decomposition and relocation coding scheme. According to the scheme, signals of voiced sound interval among original speech are decomposed into wavelets, each of which corresponds to a speech waveform for one period made by each glottal pulse. These wavelets are respectively coded and stored. The wavelets nearest to the positions where the wavelets are to be located are selected from stored wavelets and decoded. The decoded wavelets are superposed to each other such that original sound quality can be maintained and duration and pitch frequency of speech segment can be controlled arbitrarily.

224 citations

Patent
18 Nov 1996
TL;DR: In this paper, a home entertainment and information system is provided which assigns and transmits audio programming to audio output devices, such that when a program is selected using the remote control device, the audio portion of the program is transmitted to the assigned audio output device.
Abstract: A home entertainment and information system is provided which assigns and transmits audio programming to audio output devices. Digital and analog signals from a variety of program sources are received by the home entertainment and information system. The system assigns and transmits to an audio output device a program that is distinct from programs assigned and transmitted to other audio output devices within the same system, and thus where two users are viewing different programs visually displayed on the same or different monitors, they hear the audio portion of the respective program they are viewing through individual audio output devices. An audio output device is also assignable to a remote control device such that when a program is selected using the remote control device, the audio portion of the program is transmitted to the assigned audio output device.

223 citations

Patent
Mark F. Davis1
28 Feb 2005
TL;DR: In this article, the authors proposed an improved decorrelation of multiple audio channels derived from a monophonic audio channel or from multiple channels of audio along with related auxiliary information from which multiple channels can be reconstructed.
Abstract: Multiple channels of audio are combined either to a monophonic composite signal or to multiple channels of audio along with related auxiliary information from which multiple channels of audio are reconstructed, including improved downmixing of multiple audio channels to a monophonic audio signal or to multiple audio channels and improved decorrelation of multiple audio channels derived from a monophonic audio channel or from multiple audio channels. Aspects of the disclosed invention are usable in audio encoders, decoders, encode/decode systems, downmixers, upmixers, and decorrelators.

221 citations


Network Information
Related Topics (5)
Signal processing
73.4K papers, 983.5K citations
86% related
Decoding methods
65.7K papers, 900K citations
84% related
Fading
55.4K papers, 1M citations
80% related
Feature vector
48.8K papers, 954.4K citations
80% related
Feature extraction
111.8K papers, 2.1M citations
80% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202338
202284
202170
202062
201977
2018108