Speaker adaptation through vector quantization

doi:10.1109/ICASSP.1986.1168676

Proceedings ArticleDOI

Speaker adaptation through vector quantization

Kiyohiro Shikano, +2 more

- Vol. 11, pp 2643-2646

Chats0

TLDR

Vector quantization (VQ) is a technique that reduces the computation amount and memory size drastically and is proposed in order to improve speaker-independent recognition.

Abstract:

Vector quantization (VQ) is a technique that reduces the computation amount and memory size drastically. In this paper, speaker adaptation algorithms through VQ are proposed in order to improve speaker-independent recognition. The speaker adaptation algorithms use VQ codebooks of a reference speaker and an input speaker. Speaker adaptation is performed by substituting vectors in the codebook of a reference speaker for vectors of the input speaker's codebook, or vice versa. To confirm the effectiveness of these algorithms, word recognition experiments are carried out using the IBM office correspondence task uttered by 11 speakers. The total number of words is 1174 for each speaker, and the number of different words is 422. The average word recognition rate using different speaker's reference through speaker adaptation is 80.9%, and the rate within the second choice is 92.0%.

Citations

PDF

Open Access

More filters

Patent

Intelligent Automated Assistant

Thomas R. Gruber, +7 more

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.

...read moreread less

Journal ArticleDOI

Continuous probabilistic transform for voice conversion

Yannis Stylianou, +2 more

- 01 Mar 1998 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: The design of a new methodology for representing the relationship between two sets of spectral envelopes and the proposed transform greatly improves the quality and naturalness of the converted speech signals compared with previous proposed conversion methods.

...read moreread less

Journal ArticleDOI

Speaker-independent phone recognition using hidden Markov models

Kai-Fu Lee, +1 more

- 01 Nov 1989 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: The authors introduce the co-occurrence smoothing algorithm, which enables accurate recognition even with very limited training data, and can be used as benchmarks to evaluate future systems.

...read moreread less

Patent

Automated Response to and Sensing of User Activity in Portable Devices

Brian Q. Huppi, +3 more

TL;DR: In this paper, various methods and devices described herein relate to devices which, in at least certain embodiments, may include one or more sensors for providing data relating to user activity and at least one processor for causing the device to respond based on the user activity which was determined, at least in part, through the sensors.

...read moreread less

Patent

Using context information to facilitate processing of commands in a virtual assistant

Thomas R. Gruber, +4 more

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

An Algorithm for Vector Quantizer Design

Y. Linde, +2 more

- 01 Jan 1980 -

IEEE Transactions on Communications

TL;DR: An efficient and intuitive algorithm is presented for the design of vector quantizers based either on a known probabilistic model or on a long training sequence of data.

...read moreread less

Journal ArticleDOI

On the application of vector quantization and hidden Markov models to speaker-independent, isolated word recognition

Lawrence R. Rabiner, +2 more

- 01 Apr 1983 -

Bell System Technical Journal

TL;DR: This paper presents an approach to speaker-independent, isolated word recognition in which the well-known techniques of vector quantization and hidden Markov modeling are combined with a linear predictive coding analysis front end in the framework of a standard statistical pattern recognition model.

...read moreread less

Journal ArticleDOI

Discrete utterance speech recognition without time alignment

J. Shore, +1 more

- 01 Jul 1983 -

IEEE Transactions on Information Theory

TL;DR: The results of a new method based on rate-distortion speech coding (speech coding by vector quantization), minimum cross-entropy pattern classification, and information-theoretic spectral distortion measures for discrete utterance speech recognition are presented.

...read moreread less

Proceedings ArticleDOI

Isolated word recognition using phoneme-like templates

N. Sugamura, +2 more

TL;DR: New technique for use in a word recognition system where word templates are represented as sequences of descrete phoneme-like (pseudo-phoneme) templates which are automatically determined from a training set of word utterances by a clustering technique.

...read moreread less

Proceedings ArticleDOI

A real-time, isolated-word, speech recognition system for dictation transcription

Frederick Jelinek

TL;DR: The architecture of an experimental, real-time, isolated-word, speech recognition system with a 5,000-word vocabulary which can be used for dictating office correspondence is described and some recent experimental results obtained are given.

...read moreread less

Speaker adaptation through vector quantization

Citations

Intelligent Automated Assistant

Continuous probabilistic transform for voice conversion

Speaker-independent phone recognition using hidden Markov models

Automated Response to and Sensing of User Activity in Portable Devices

Using context information to facilitate processing of commands in a virtual assistant

References

An Algorithm for Vector Quantizer Design

On the application of vector quantization and hidden Markov models to speaker-independent, isolated word recognition

Discrete utterance speech recognition without time alignment

Isolated word recognition using phoneme-like templates

A real-time, isolated-word, speech recognition system for dictation transcription

Related Papers (5)

An Algorithm for Vector Quantizer Design

A Maximum Likelihood Approach to Continuous Speech Recognition

Continuous speech recognition by statistical methods

Large-vocabulary speaker-independent continuous speech recognition: the sphinx system

Speaker-independent isolated word recognition using dynamic features of speech spectrum