scispace - formally typeset
M

Mitchell McLaren

Researcher at SRI International

Publications -  115
Citations -  3442

Mitchell McLaren is an academic researcher from SRI International. The author has contributed to research in topics: Speaker recognition & Speaker diarisation. The author has an hindex of 30, co-authored 109 publications receiving 3025 citations. Previous affiliations of Mitchell McLaren include Queensland University of Technology & Radboud University Nijmegen.

Papers
More filters
Proceedings ArticleDOI

A novel scheme for speaker recognition using a phonetically-aware deep neural network

TL;DR: A novel framework for speaker recognition in which extraction of sufficient statistics for the state-of-the-art i-vector model is driven by a deep neural network (DNN) trained for automatic speech recognition (ASR) to produce frame alignments.
Proceedings ArticleDOI

The Speakers in the Wild (SITW) Speaker Recognition Database.

TL;DR: The Speakers in the Wild (SITW) speaker recognition database contains hand-annotated speech samples from open-source media for the purpose of benchmarking text-independent speaker recognition technology on single and multi-speaker audio acquired across unconstrained or “wild” conditions.
Proceedings ArticleDOI

Advances in deep neural network approaches to speaker recognition

TL;DR: This work considers two approaches to DNN-based SID: one that uses the DNN to extract features, and another that uses a DNN during feature modeling, and several methods of DNN feature processing are applied to bring significantly greater robustness to microphone speech.
Journal ArticleDOI

Source-Normalized LDA for Robust Speaker Recognition Using i-Vectors From Multiple Speech Sources

TL;DR: This study provides a thorough analysis of how SN-LDA transforms the i-vector space to reduce source variation and its robustness to varying evaluation and LDA training conditions.
Journal ArticleDOI

Study of senone-based deep neural network approaches for spoken language recognition

TL;DR: This paper compares different approaches for using deep neural networks (DNNs) trained to predict senone posteriors for the task of spoken language recognition (SLR) and concludes that the approach based on bottleneck features followed by i-vector modeling outperform the other two approaches.