Petr Schwarz

Researcher at Brno University of Technology

Publications - 42

Citations - 8385

Petr Schwarz is an academic researcher from Brno University of Technology. The author has contributed to research in topics: NIST & Speaker recognition. The author has an hindex of 22, co-authored 42 publications receiving 7632 citations. Previous affiliations of Petr Schwarz include Oregon Health & Science University.

Papers

PDF

Open Access

More filters

Proceedings Article

The Kaldi Speech Recognition Toolkit

Daniel Povey, +12 more

TL;DR: The design of Kaldi is described, a free, open-source toolkit for speech recognition research that provides a speech recognition system based on finite-state automata together with detailed documentation and a comprehensive set of scripts for building complete recognition systems.

...read moreread less

Journal ArticleDOI

The subspace Gaussian mixture model-A structured model for speech recognition

Daniel Povey, +12 more

- 01 Apr 2011 -

Computer Speech & Language

TL;DR: A new approach to speech recognition, in which all Hidden Markov Model states share the same Gaussian Mixture Model (GMM) structure with the same number of Gaussians in each state, appears to give better results than a conventional model.

...read moreread less

Journal ArticleDOI

Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006

Niko Brümmer, +9 more

- 01 Sep 2007 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: The STBU speaker recognition system was a combination of three main kinds of subsystems, which performed well in the NIST Speaker Recognition Evaluation 2006 (SRE).

...read moreread less

Proceedings ArticleDOI

Hierarchical Structures of Neural Networks for Phoneme Recognition

Petr Schwarz, +2 more

TL;DR: This paper deals with phoneme recognition based on neural networks (NN), and focuses on temporal patterns (TRAPs) and novel split temporal context (STC) phoneme recognizers and investigates into tandem NN architectures.

...read moreread less

Proceedings ArticleDOI

Subspace Gaussian Mixture Models for speech recognition

Daniel Povey, +12 more

TL;DR: An acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the total parameter space, and this style of acoustic model allows for a much more compact representation.

...read moreread less

Collapse