scispace - formally typeset
Proceedings ArticleDOI

GlobalPhone: A multilingual text & speech database in 20 languages

TLDR
The advances in the multilingual text and speech database GlobalPhone, a multilingual database of high-quality read speech with corresponding transcriptions and pronunciation dictionaries in 20 languages, are described.
Abstract
This paper describes the advances in the multilingual text and speech database GlobalPhone, a multilingual database of high-quality read speech with corresponding transcriptions and pronunciation dictionaries in 20 languages. GlobalPhone was designed to be uniform across languages with respect to the amount of data, speech quality, the collection scenario, the transcription and phone set conventions. With more than 400 hours of transcribed audio data from more than 2000 native speakers GlobalPhone supplies an excellent basis for research in the areas of multilingual speech recognition, rapid deployment of speech processing systems to yet unsupported languages, language identification tasks, speaker recognition in multiple languages, multilingual speech synthesis, as well as monolingual speech recognition in a large variety of languages.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Montreal Forced Aligner: Trainable Text-Speech Alignment Using Kaldi.

TL;DR: The Montreal Forced Aligner (MFA) is an update to the Prosodylab-Aligner, and maintains its key functionality of trainability on new data, as well as incorporating improved architecture (triphone acoustic models and speaker adaptation), and other features.
Journal ArticleDOI

Automatic speech recognition for under-resourced languages: A survey

TL;DR: This paper proposes, in this paper, a survey that focuses on automatic speech recognition (ASR) for under-resourced languages, and a literature review of the recent contributions made.
Proceedings ArticleDOI

Multilingual deep neural network based acoustic modeling for rapid language adaptation

TL;DR: The studies reveal that crosslingual acoustic model transfer through multilingual DNNs is superior to unsupervised RBM pre-training and greedy layer-wise supervised training and that KL-HMM based decoding consistently outperforms conventional hybrid decoding, especially in low-resource scenarios.
Journal ArticleDOI

A Review on Automatic Speech Recognition Architecture and Approaches

TL;DR: A detailed study on automatic speech recognition is carried out and presented in this paper that covers the architecture, speech parameterization, methodologies, characteristics, issues, databases, tools and applications.
References
More filters
Proceedings Article

SRILM – An Extensible Language Modeling Toolkit

TL;DR: The functionality of the SRILM toolkit is summarized and its design and implementation is discussed, highlighting ease of rapid prototyping, reusability, and combinability of tools.
Journal ArticleDOI

Language-independent and language-adaptive acoustic modeling for speech recognition

TL;DR: Different methods for multilingual acoustic model combination and a polyphone decision tree specialization procedure are introduced for estimating acoustic models for a new target language using speech data from varied source languages, but only limited data from the target language.
Proceedings Article

GlobalPhone: A Multilingual Speech and Text Database developed at Karlsruhe University

Tanja Schultz
TL;DR: The design, collection, and current status of the multilingual database GlobalPhone is described, an ongoing project since 1995 at Karlsruhe University, which is suitable for the development of large vocabulary speech recognition systems in many languages.
Related Papers (5)