Julius --- An Open Source Real-Time Large Vocabulary Recognition Engine

Open AccessProceedings Article

Julius --- An Open Source Real-Time Large Vocabulary Recognition Engine

Akinobu Lee, +2 more

- Vol. 3, pp 1691-1694

Chats0

TLDR

EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, 2001, Aalborg, Denmark.

Abstract:

EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, 2001, Aalborg, Denmark.

Citations

PDF

Open Access

More filters

Proceedings Article

The Kaldi Speech Recognition Toolkit

Daniel Povey, +12 more

TL;DR: The design of Kaldi is described, a free, open-source toolkit for speech recognition research that provides a speech recognition system based on finite-state automata together with detailed documentation and a comprehensive set of scripts for building complete recognition systems.

...read moreread less

Proceedings ArticleDOI

ESPNet: End-to-end speech processing toolkit

Shinji Watanabe, +11 more

TL;DR: In this article, a new open source platform for end-to-end speech processing named ESPnet is introduced, which mainly focuses on automatic speech recognition (ASR), and adopts widely used dynamic neural network toolkits, Chainer and PyTorch, as a main deep learning engine.

...read moreread less

Journal ArticleDOI

Hacking smart machines with smarter ones: How to extract meaningful data from machine learning classifiers

Giuseppe Ateniese, +5 more

- 01 Sep 2015 -

International Journal of Security and Ne...

TL;DR: It is shown that it is possible to infer unexpected but useful information from ML classifiers and that this kind of information leakage can be exploited by a vendor to build more effective classifiers or to simply acquire trade secrets from a competitor's apparatus, potentially violating its intellectual property rights.

...read moreread less

Proceedings Article

Recent Development of Open-Source Speech Recognition Engine Julius

Akinobu Lee, +1 more

TL;DR: An overview of Julius, major features and specifications are described, and the developments conducted in the recent years are summarized.

...read moreread less

Journal ArticleDOI

ModDrop: Adaptive Multi-Modal Gesture Recognition

Natalia Neverova, +3 more

- 01 Aug 2016 -

IEEE Transactions on Pattern Analysis an...

TL;DR: The proposed ModDrop training technique ensures robustness of the classifier to missing signals in one or several channels to produce meaningful predictions from any number of available modalities, and demonstrates the applicability of the proposed fusion scheme to modalities of arbitrary nature by experiments on the same dataset augmented with audio.

...read moreread less

Collapse

References

PDF

Open Access

More filters

The HTK book

Steve Young, +4 more

TL;DR: The Fundamentals of HTK: General Principles of HMMs, Recognition and Viterbi Decoding, and Continuous Speech Recognition.

...read moreread less

Proceedings Article

Statistical Language Modeling using the CMU-Cambridge Toolkit

Philip Clarkson, +1 more

TL;DR: The CMU Statistical Language Modeling toolkit was re leased in in order to facilitate the construction and testing of bigram and trigram language models and the technology as implemented in the toolkit is outlined.

...read moreread less

Proceedings ArticleDOI

A new phonetic tied-mixture model for efficient decoding

Akinobu Lee, +3 more

TL;DR: A phonetic tied-mixture model for efficient large vocabulary continuous speech recognition that enables the decoder to perform efficient Gaussian pruning and it is found out that computing only two out of 64 components does not cause any loss of accuracy.

...read moreread less

Proceedings ArticleDOI

Gaussian mixture selection using context-independent HMM

Akinobu Lee, +2 more

TL;DR: The proposed method achieves comparable performance as the standard Gaussian selection, and performs much better under aggressive pruning condition, and acoustic matching cost is reduced to almost 14% with little loss of accuracy.

...read moreread less