MSR Identity Toolbox v1.0: A MATLAB Toolbox for Speaker Recognition Research

Open Access

MSR Identity Toolbox v1.0: A MATLAB Toolbox for Speaker Recognition Research

Chats0

TLDR

The MSR Identity Toolbox is released, which contains a collection of MATLAB tools and routines that can be used for research and development in speaker recognition, and provides many of the functionalities available in other open-source speaker recognition toolkits.

Abstract:

We are happy to announce the release of the MSR Identity Toolbox: A MATLAB toolbox for speaker-recognition research. This toolbox contains a collection of MATLAB tools and routines that can be used for research and development in speaker recognition. It provides researchers with a test bed for developing new front-end and back-end techniques, allowing replicable evaluation of new advancements. It will also help newcomers in the field by lowering the "barrier to entry," enabling them to quickly build baseline systems for their experiments. Although the focus of this toolbox is on speaker recognition, it can also be used for other speech related applications such as language, dialect, and accent identification. Additionally, it provides many of the functionalities available in other open-source speaker recognition toolkits (e.g., ALIZE

Citations

PDF

Open Access

More filters

Posted Content

SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems

Hadi Abdullah, +4 more

- 13 Jul 2020 -

arXiv: Cryptography and Security

TL;DR: It is argued that substantial additional work is required to provide adequate mitigations in the speech and speaker recognition space and systematizing existing research and providing a taxonomy through which the community can evaluate future work is needed.

...read moreread less

Proceedings ArticleDOI

An extensible speaker identification sidekit in Python

Anthony Larcher, +2 more

TL;DR: SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker recognition system.

...read moreread less

Journal ArticleDOI

Enhanced Forensic Speaker Verification Using a Combination of DWT and MFCC Feature Warping in the Presence of Noise and Reverberation Conditions

Ahmed Kamil Hasan Al-Ali, +4 more

- 19 Jul 2017 -

IEEE Access

TL;DR: In this article, the authors investigated the effectiveness of combining features, mel frequency cepstral coefficients (MFCCs), and MFCC extracted from the discrete wavelet transform (DWT) of the speech, with and without feature warping for improving modern identityvector (i-vector)-based speaker verification performance in the presence of noise and reverberation.

...read moreread less

Proceedings ArticleDOI

Model Fusion for Multimodal Depression Classification and Level Detection

Mohammed Senoussaoui, +3 more

TL;DR: It is shown that an i-vector based representation for short term audio features contains useful information for depression classification and prediction, and employed a classification step prior to regression to allow having different regression models depending on the presence or absence of depression.

...read moreread less

Journal ArticleDOI

Towards the identification of Idiopathic Parkinson’s Disease from the speech. New articulatory kinetic biomarkers

Juan Ignacio Godino-Llorente, +4 more

- 14 Dec 2017 -

PLOS ONE

TL;DR: The goal of this work is to identify and interpret new reliable and complementary articulatory biomarkers that could be applied to predict/evaluate Parkinson’s Disease from a diadochokinetic test, contributing to the possibility of a further multidimensional analysis of the speech of parkinsonian patients.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Introduction to Statistical Pattern Recognition

Keinosuke Fukunaga

TL;DR: This completely revised second edition presents an introduction to statistical pattern recognition, which is appropriate as a text for introductory courses in pattern recognition and as a reference book for workers in the field.

...read moreread less

Journal ArticleDOI

Speaker Verification Using Adapted Gaussian Mixture Models

Douglas A. Reynolds, +2 more

- 01 Jan 2000 -

Digital Signal Processing

TL;DR: The major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs) are described.

...read moreread less

Book

Introduction to statistical pattern recognition (2nd ed.)

Keinosuke Fukunaga

Journal ArticleDOI

Front-End Factor Analysis for Speaker Verification

Najim Dehak, +4 more

- 01 May 2011 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: An extension of the previous work which proposes a new speaker representation for speaker verification, a new low-dimensional speaker- and channel-dependent space is defined using a simple factor analysis, named the total variability space because it models both speaker and channel variabilities.

...read moreread less

Proceedings ArticleDOI

Probabilistic Linear Discriminant Analysis for Inferences About Identity

Simon J. D. Prince, +1 more

TL;DR: This paper describes face data as resulting from a generative model which incorporates both within- individual and between-individual variation, and calculates the likelihood that the differences between face images are entirely due to within-individual variability.

...read moreread less