Open Access
MSR Identity Toolbox v1.0: A MATLAB Toolbox for Speaker Recognition Research
Reads0
Chats0
TLDR
The MSR Identity Toolbox is released, which contains a collection of MATLAB tools and routines that can be used for research and development in speaker recognition, and provides many of the functionalities available in other open-source speaker recognition toolkits.Abstract:
We are happy to announce the release of the MSR Identity Toolbox: A MATLAB toolbox for speaker-recognition research. This toolbox contains a collection of MATLAB tools and routines that can be used for research and development in speaker recognition. It provides researchers with a test bed for developing new front-end and back-end techniques, allowing replicable evaluation of new advancements. It will also help newcomers in the field by lowering the "barrier to entry," enabling them to quickly build baseline systems for their experiments. Although the focus of this toolbox is on speaker recognition, it can also be used for other speech related applications such as language, dialect, and accent identification. Additionally, it provides many of the functionalities available in other open-source speaker recognition toolkits (e.g., ALIZEread more
Citations
More filters
Journal ArticleDOI
Adversarially Learned Total Variability Embedding for Speaker Recognition with Random Digit Strings.
Woo Hyun Kang,Nam Soo Kim +1 more
TL;DR: A novel technique for extracting an i-vector-like feature based on an adversarially learned inference (ALI) model which summarizes the variability within the Gaussian mixture model (GMM) distribution through a nonlinear process is proposed.
Proceedings ArticleDOI
Incorporating Local Acoustic Variability Information into Short Duration Speaker Verification.
TL;DR: Gaussian Probabilistic Linear Discriminant Analysis of the supervector space, with a block diagonal covariance assumption, is used in conjunction with the traditional total variability model to incorporate component-wise local acoustic variability information into the speaker verification framework.
The effect of speaking style on the performance of a forensic voice comparison system
TL;DR: The effect of mismatches in different normal-voice speaking styles on the performance of an automatic forensic voice comparison system when different datasets are used for training and testing the system.
Journal ArticleDOI
Dialect Identification using Chroma-Spectral Shape Features with Ensemble Technique
TL;DR: In this paper, chroma features familiar with music-related systems are employed for identification of dialects, and eight significant spectral shape related features from short term spectra are computed and combined along with Chroma features and named as chroma-spectral shape features.
Dissertation
Robust text independent closed set speaker identification systems and their evaluation
TL;DR: As recommendations from the study, mean fusion is found to yield overall best performance in terms of the SIA with noisy speech, whereas linear weighted sum fusion is overall best for original database recordings.
References
More filters
Book
Introduction to Statistical Pattern Recognition
TL;DR: This completely revised second edition presents an introduction to statistical pattern recognition, which is appropriate as a text for introductory courses in pattern recognition and as a reference book for workers in the field.
Journal ArticleDOI
Speaker Verification Using Adapted Gaussian Mixture Models
TL;DR: The major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs) are described.
Journal ArticleDOI
Front-End Factor Analysis for Speaker Verification
TL;DR: An extension of the previous work which proposes a new speaker representation for speaker verification, a new low-dimensional speaker- and channel-dependent space is defined using a simple factor analysis, named the total variability space because it models both speaker and channel variabilities.
Proceedings ArticleDOI
Probabilistic Linear Discriminant Analysis for Inferences About Identity
TL;DR: This paper describes face data as resulting from a generative model which incorporates both within- individual and between-individual variation, and calculates the likelihood that the differences between face images are entirely due to within-individual variability.