scispace - formally typeset
Search or ask a question

Showing papers on "Linear predictive coding published in 1968"


Patent
Bishnu S. Atal1
19 Aug 1968
TL;DR: In this article, an adaptive predictor is employed which is readjusted periodically to match the time-varying characteristics of a speech signal, which is used to reduce the channel capacity required to transmit a signal with specified fidelity.
Abstract: Predictive coding of signals, i.e., the reduction or redundancy in a signal by subtracting from it that part which can be predicted from its past, is a well-known technique for reducing the channel capacity required to transmit a signal with specified fidelity. It has been widely applied to signals, such as television signals which have regularly repeating intervals of information, but has not been satisfactorily applied to signals, such as speech, which exhibit characteristics that vary from speaker to speaker and from time to time for one speaker. According to this invention, an adaptive predictor is employed which is readjusted periodically to match the time-varying characteristics of a speech signal.

181 citations


Journal ArticleDOI
TL;DR: Only 15 minutes of programmed training in listening to this processed speech was found to be significantly effective for improving perception in all four conditions, and the effect of a short training period on the perception of the speech signals so processed.
Abstract: The purpose of this study was 1) to measure the intelligibility of speech signals processed with a frequency converter designed to effect a spectral compression by a sampling-synthesizing technique, and 2) to investigate the effect of a short training period on the perception of the speech signals so processed. The input signals were composed of 71 CVC words recorded by an American female speaker. The highest center frequency for the input of the 22 channels was 9500 Hz. The four compression ratios of the output to the fixed input frequency spectrum were 1.0, 0.7 0.5, and 0.4 for all 22 channels used. Eighty listeners were randomly selected and assigned to one of the four conditions of spectral compression. The intelligibility score of each condition of spectral compression was obtained from a multiple choice test of six possible choices for each stimulus. The average intelligibility score for all English phonemes was obtained for each condition of compression. The intelligibility of each vowel and consonant was also calculated for all conditions. Only 15 minutes of programmed training in listening to this processed speech was found to be significantly effective for improving perception in all four conditions.

7 citations


Journal ArticleDOI
Q. Wilson1
TL;DR: A speech recognizer model is presented for illustration purposes; parameters and experimental evidence are included to support the contention that pattern recognition can be reduced to a signal detection problem.
Abstract: The purpose of this paper is to call attention to some of the properties of optical signal detection which may be useful for overcoming processing logistics problems in speech recognizers of useful vocabulary. Analog cross correlation by optical means, as described by Vander Lugt [5] permits simultaneous comparison of the speaker's utterance with a stored vocabulary tailored to the particular speaker. Storage might take the form of a microfilm aperture card. A speech recognizer model is presented for illustration purposes; parameters and experimental evidence are included to support the contention that pattern recognition can be reduced to a signal detection problem.

2 citations


Patent
Ernst Rothauser1, Kurt Bandat1
30 Jul 1968
TL;DR: In this article, the authors proposed a method of channel vocoder speech synthesis for representing the speech signals for several independent speech lines, where digital speech data stored in data processors are used.
Abstract: The invention relates to a method of channel vocoder speech synthesis. For representing the speech signals for several independent speech lines, digital speech data stored in data processors are used. In addition, the method provides for the storage of the time-sampled digital description of the transient behaviors of n spectrum channel band-pass filters. Only one such description is needed for synthesizing the speech signals for m speech lines. For a given speech line the transient responses of the band-pass filters are modulated by the frequency function for the given line. The modulated transient values are added for corresponding time samples and stored in a delay line for the given speech line. The stored value of a speech line is released at points in time defined by the excitation function, thus releasing a digital description of the transient response of the set of band-pass filters as if they were excited by a unit pulse and modulated by the frequency function of the given speech line. The digital description is demodulated to an analogue form by conventional means.

2 citations


Journal ArticleDOI
TL;DR: A pattern recognition algorithm has been used to compare the usefulness of two types of spectrum analyzers for speech recognition.
Abstract: A pattern recognition algorithm has been used to compare the usefulness of two types of spectrum analyzers for speech recognition.

1 citations