Fast approach to speaker identification for large population using MLLR and sufficient statistics

doi:10.1109/NCC.2010.5430206

Proceedings ArticleDOI

Fast approach to speaker identification for large population using MLLR and sufficient statistics

Achintya Kumar Sarkar, +2 more

- pp 1-5

Chats0

TLDR

A Maximum Likelihood Linear Regression (MLLR) based fast method to calculate the likelihood from the speaker model using the MLLR matrix that performs faster than GMM-UBM based system with some degradation in system accuracy.

Abstract:

In speaker identification, most of the computational processing time is required to calculate the likelihood of the test utterance of the unknown speaker with respect to the speaker models in the database. When number of speakers in the database is in the order of 10,000 or more, then computational complexity becomes very high. In this paper, we propose a Maximum Likelihood Linear Regression (MLLR) based fast method to calculate the likelihood from the speaker model using the MLLR matrix. The proposed technique will help to quickly find the best N speakers during identification. After that final speaker identification task can be done within the N selected speakers using any conventional method of speaker identification. The comparative study of the proposed method is done in terms of processing time with the state-of-the-art GMM-UBM based system on NIST 2004 SRE. The proposed technique performs faster than GMM-UBM based system with some degradation in system accuracy.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Fuzzy-Clustering-Based Decision Tree Approach for Large Population Speaker Identification

Yakun Hu, +2 more

- 01 Apr 2013 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: The key idea of the approach is to use a decision tree to hierarchically partition the whole population into groups of small size, and determine which speaker group at the leaf node a speaker under test belongs to, and apply MFCC+GMM to the selected speaker group for speaker identification.

...read moreread less

Patent

Methods for creating and searching a database of speakers

Woojay Jeon, +3 more

TL;DR: In this paper, a method of performing a search of a database of speakers, including deriving a query utterance from the query speech sample, extracting query utterances statistics from the utterance, and performing Kernelized Locality-Sensitive Hashing (KLSH) using a kernel function, is presented.

...read moreread less

Proceedings ArticleDOI

A fast two-level Speaker Identification method employing sparse representation and GMM-based methods

Hossein Zeinali, +3 more

TL;DR: This paper proposes a two-step method that utilizes two different identification methods that use Nearest Neighbor method to decrease the search space and GMM-based SI methods to specify the target speaker.

...read moreread less

Journal ArticleDOI

Speaker Identification using a Novel Prosody with Fuzzy based Hierarchical Decision Tree Approach

K. Manikandan, +1 more

- 30 Nov 2016 -

Indian journal of science and technology

TL;DR: The proposed speaker identification using a novel prosody with fuzzy based hierarchical decision tree approach and is used to modifying the limitations of existing traditional methods improves the performance of speaker identification in given population under noisy environments.

...read moreread less

Journal ArticleDOI

Speaker identification using fuzzy i-vector tree

Jakub Galka, +1 more

- 01 Jan 2019 -

Journal of Intelligent and Fuzzy Systems

References

PDF

Open Access

More filters

Journal ArticleDOI

Maximum likelihood from incomplete data via the EM algorithm

Arthur P. Dempster, +2 more

- 01 Sep 1977 -

Journal of the royal statistical society...

Journal ArticleDOI

Speaker Verification Using Adapted Gaussian Mixture Models

Douglas A. Reynolds, +2 more

- 01 Jan 2000 -

Digital Signal Processing

TL;DR: The major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs) are described.

...read moreread less

Journal Article

Maximum likelihood estimation from incomplete data via the EM algorithm

A. Dempster

- 01 Jan 1977 -

Journal of the Royal Statistical Society

Journal ArticleDOI

Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models

C. J. Leggetter, +1 more

- 01 Apr 1995 -

Computer Speech & Language

TL;DR: An important feature of the method is that arbitrary adaptation data can be used—no special enrolment sentences are needed and that as more data is used the adaptation performance improves.

...read moreread less

The HTK book

Steve Young, +4 more

TL;DR: The Fundamentals of HTK: General Principles of HMMs, Recognition and Viterbi Decoding, and Continuous Speech Recognition.

...read moreread less

Related Papers (5)

Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition

M. Ferras, +3 more

- 01 Aug 2010 -

IEEE Transactions on Audio, Speech, and ...

Fast approach to speaker identification for large population using MLLR and sufficient statistics

Citations

Fuzzy-Clustering-Based Decision Tree Approach for Large Population Speaker Identification

Methods for creating and searching a database of speakers

A fast two-level Speaker Identification method employing sparse representation and GMM-based methods

Speaker Identification using a Novel Prosody with Fuzzy based Hierarchical Decision Tree Approach

Speaker identification using fuzzy i-vector tree

References

Maximum likelihood from incomplete data via the EM algorithm

Speaker Verification Using Adapted Gaussian Mixture Models

Maximum likelihood estimation from incomplete data via the EM algorithm

Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models

The HTK book

Related Papers (5)

Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition

Efficient speaker identification using distributional speaker model clustering

Unsupervised speaker adaptation based on sufficient HMM statistics of selected speakers

A Comparison of Various Adaptation Methods for Speaker Verification With Limited Enrollment Data

Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications