Method and apparatus for speaker identification using mixture discriminant analysis to develop speaker models

doi:10.1121/1.1500916

PatentDOI

Method and apparatus for speaker identification using mixture discriminant analysis to develop speaker models

Sarangarajan Parthasarathy, +1 more

- 16 Mar 2001 -

Journal of the Acoustical Society of Ame...

- Vol. 112, Iss: 1, pp 21

TLDR

In this paper, a speaker identification system is provided that constructs speaker models using a discriminant analysis technique where the data in each class is modeled by Gaussian mixtures, and the likelihood scores of the second set of feature vectors are computed using speaker models trained using mixture discriminant analyses.

Abstract:

A speaker identification system is provided that constructs speaker models using a discriminant analysis technique where the data in each class is modeled by Gaussian mixtures. The speaker identification method and apparatus determines the identity of a speaker, as one of a small group, based on a sentence-length password utterance. A speaker's utterance is received and a sequence of a first set of feature vectors are computed based on the received utterance. The first set of feature vectors are then transformed into a second set of feature vectors using transformations specific to a particular segmentation unit, and likelihood scores of the second set of feature vectors are computed using speaker models trained using mixture discriminant analysis. The likelihood scores are then combined to determine an utterance score and the speaker's identity is validated based on the utterance score. The speaker identification method and apparatus also includes training and enrollment phases. In the enrollment phase the speaker's password utterance is received multiple times. A transcription of the password utterance as a sequence of phones is obtained, and the phone string is stored in a database containing phone strings of other speakers in the group. In the training phase, the first set of feature vectors are extracted from each password utterance and the phone boundaries for each phone in the password transcription are obtained using a speaker independent phone recognizer. A mixture model is developed for each phone of a given speaker's password. Then, using the feature vectors from the password utterances of all of the speakers in the group, transformation parameters and transformed models are generated for each phone and speaker, using mixture discriminant analysis.

Method and apparatus for speaker identification using mixture discriminant analysis to develop speaker models

Citations

Method and system for considering information about an expected response when performing speech recognition

Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

Methods and systems for identifying errors in a speech recognition system

Method and system for mitigating delay in receiving audio stream during production of sound from audio stream

Method and apparatus for searching for music based on speech recognition

References

Discriminant Analysis by Gaussian Mixtures

Flexible Discriminant Analysis by Optimal Scoring

Sequential, nonparametric speech recognition and speaker identification

Signal pattern recognition apparatus comprising parameter training controller for training feature conversion parameters and discriminant functions

Sub-word unit talker verification using hidden Markov models

Related Papers (5)

Speech recognition with attempted speaker recognition for speaker model prefetching or alternative speech modeling

Speaker recognition device

Topic discriminator using posterior probability or confidence scores

Speech recognition using thresholded speaker class model selection or model adaptation

Method and system for controlling an external machine by a voice command