Vector-quantization-based speech recognition and speaker recognition techniques

doi:10.1109/ACSSC.1991.186588

Open AccessProceedings ArticleDOI

Vector-quantization-based speech recognition and speaker recognition techniques

S. Furui

- pp 954-958

Chats0

TLDR

It is concluded that not only has the VQ technique reduced the amount of computation and storage, but it has also created new ideas for solving various problems in speech/speaker recognition.

Abstract:

The author reviews major methods of applying the vector quantization (VQ) technique to speech and speaker recognition. These include speech recognition based on the combination of VQ and the DTW/HMM (dynamic time warping/hidden Markov model) technique. VQ-distortion-based recognition, learning VQ algorithms, speaker adaptation by VQ-codebook mapping, and VQ-distortion-based speaker recognition. It is concluded that not only has the VQ technique reduced the amount of computation and storage, but it has also created new ideas for solving various problems in speech/speaker recognition. >

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Real-time speaker identification.

Pasi Fränti, +2 more

TL;DR: The number of test vectors is reduced by pre-quantizing the test sequence prior to matching, and the number of speakers are reduced by pruning out unlikely speakers during the identification process by optimizing vector quantization (VQ) based speaker identification.

...read moreread less

DOI

Quranic Verse Recitation Feature Extraction using Mel-Frequency Cepstral Coefficient (MFCC)

Noor Jamaliah Ibrahim, +4 more

TL;DR: This paper explores the viability of Mel-Frequency Cepstral Coefficient (MFCC) technique to extract features from Quranic verse recitation, one of the most popular feature extraction techniques used in speech recognition.

...read moreread less

DOI

Toward Constructing A Multilingual Speech Corpus for Taiwanese (Min-nan), Hakka, and Mandarin

Ren-Yuan Lyu, +2 more

TL;DR: The Formosa speech database (ForSDat) is a multilingual speech corpus collected at Chang Gung University and sponsored by the National Science Council of Taiwan and the first version of this corpus containing speech of 600 speakers of Taiwanese and Mandarin was finished and is ready to be released.

...read moreread less

Multiband Approach to Robust Text-Independent Speaker Identification

Wan-Chen Chen, +2 more

TL;DR: Experimental results show that both proposed methods achieve better performance than GMM using full-band LPCCs and mel-frequency cepstral coefficients (MFCCs) when the speaker identification is evaluated in the presence of clean and noisy environments.

...read moreread less

Book ChapterDOI

Learning Intrinsic Video Content Using Levenshtein Distance in Graph Partitioning

Jeffrey Ng, +1 more

TL;DR: The graph partitioning method is extended and in particular, the Normalised Cut model originally introduced for static image segmentation is extended to unsupervised clustering of temporal trajectories withfully automated model order selection.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Self Organization And Associative Memory

Teuvo Kohonen

TL;DR: The purpose and nature of Biological Memory, as well as some of the aspects of Memory Aspects, are explained.

...read moreread less

Journal ArticleDOI

Hidden Markov models for speech recognition

Biing-Hwang Juang, +1 more

- 01 Aug 1991 -

Technometrics

TL;DR: The role of statistical methods in this powerful technology as applied to speech recognition is addressed and a range of theoretical and practical issues that are as yet unsolved in terms of their importance and their effect on performance for different system implementations are discussed.

...read moreread less

Book

Hidden Markov Models for Speech Recognition

Xuedong Huang, +2 more

TL;DR: In this article, the authors unified theory with semi-continuous models using hidden Markov models for speech recognition experimental examples, using vector quantization and mixture densities hidden markov models.

...read moreread less

BookDOI

Automatic Speech Recognition

Kai-Fu Lee

Proceedings ArticleDOI

Statistical pattern recognition with neural networks: benchmarking studies

Barna, +1 more

TL;DR: Three basic types of neural-like networks, backpropagation network, Boltzmann machine, and learning vector quantization, were applied to two representative artificial statistical pattern recognition tasks, each with varying dimensionality.

...read moreread less