scispace - formally typeset
Open AccessProceedings Article

Julius --- An Open Source Real-Time Large Vocabulary Recognition Engine

Reads0
Chats0
TLDR
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, 2001, Aalborg, Denmark.
Abstract
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, 2001, Aalborg, Denmark.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings Article

The Kaldi Speech Recognition Toolkit

TL;DR: The design of Kaldi is described, a free, open-source toolkit for speech recognition research that provides a speech recognition system based on finite-state automata together with detailed documentation and a comprehensive set of scripts for building complete recognition systems.
Proceedings ArticleDOI

ESPNet: End-to-end speech processing toolkit

TL;DR: In this article, a new open source platform for end-to-end speech processing named ESPnet is introduced, which mainly focuses on automatic speech recognition (ASR), and adopts widely used dynamic neural network toolkits, Chainer and PyTorch, as a main deep learning engine.
Journal ArticleDOI

Hacking smart machines with smarter ones: How to extract meaningful data from machine learning classifiers

TL;DR: It is shown that it is possible to infer unexpected but useful information from ML classifiers and that this kind of information leakage can be exploited by a vendor to build more effective classifiers or to simply acquire trade secrets from a competitor's apparatus, potentially violating its intellectual property rights.
Proceedings Article

Recent Development of Open-Source Speech Recognition Engine Julius

TL;DR: An overview of Julius, major features and specifications are described, and the developments conducted in the recent years are summarized.
Journal ArticleDOI

ModDrop: Adaptive Multi-Modal Gesture Recognition

TL;DR: The proposed ModDrop training technique ensures robustness of the classifier to missing signals in one or several channels to produce meaningful predictions from any number of available modalities, and demonstrates the applicability of the proposed fusion scheme to modalities of arbitrary nature by experiments on the same dataset augmented with audio.
References
More filters

The HTK book

TL;DR: The Fundamentals of HTK: General Principles of HMMs, Recognition and Viterbi Decoding, and Continuous Speech Recognition.
Proceedings Article

Statistical Language Modeling using the CMU-Cambridge Toolkit

TL;DR: The CMU Statistical Language Modeling toolkit was re leased in in order to facilitate the construction and testing of bigram and trigram language models and the technology as implemented in the toolkit is outlined.
Proceedings ArticleDOI

A new phonetic tied-mixture model for efficient decoding

TL;DR: A phonetic tied-mixture model for efficient large vocabulary continuous speech recognition that enables the decoder to perform efficient Gaussian pruning and it is found out that computing only two out of 64 components does not cause any loss of accuracy.
Proceedings ArticleDOI

Gaussian mixture selection using context-independent HMM

TL;DR: The proposed method achieves comparable performance as the standard Gaussian selection, and performs much better under aggressive pruning condition, and acoustic matching cost is reduced to almost 14% with little loss of accuracy.
Related Papers (5)