GlobalPhone: A multilingual text & speech database in 20 languages

doi:10.1109/ICASSP.2013.6639248

Proceedings ArticleDOI

GlobalPhone: A multilingual text & speech database in 20 languages

- pp 8126-8130

TLDR

The advances in the multilingual text and speech database GlobalPhone, a multilingual database of high-quality read speech with corresponding transcriptions and pronunciation dictionaries in 20 languages, are described.

Abstract:

This paper describes the advances in the multilingual text and speech database GlobalPhone, a multilingual database of high-quality read speech with corresponding transcriptions and pronunciation dictionaries in 20 languages. GlobalPhone was designed to be uniform across languages with respect to the amount of data, speech quality, the collection scenario, the transcription and phone set conventions. With more than 400 hours of transcribed audio data from more than 2000 native speakers GlobalPhone supplies an excellent basis for research in the areas of multilingual speech recognition, rapid deployment of speech processing systems to yet unsupported languages, language identification tasks, speaker recognition in multiple languages, multilingual speech synthesis, as well as monolingual speech recognition in a large variety of languages.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Ethnologue: Languages of the World

Sarah L. Nesbeitt

- 01 Nov 1999 -

Electronic Resources Review

Proceedings ArticleDOI

Montreal Forced Aligner: Trainable Text-Speech Alignment Using Kaldi.

Michael McAuliffe, +4 more

TL;DR: The Montreal Forced Aligner (MFA) is an update to the Prosodylab-Aligner, and maintains its key functionality of trainability on new data, as well as incorporating improved architecture (triphone acoustic models and speaker adaptation), and other features.

...read moreread less

Journal ArticleDOI

Automatic speech recognition for under-resourced languages: A survey

Laurent Besacier, +3 more

- 01 Jan 2014 -

Speech Communication

TL;DR: This paper proposes, in this paper, a survey that focuses on automatic speech recognition (ASR) for under-resourced languages, and a literature review of the recent contributions made.

...read moreread less

Proceedings ArticleDOI

Multilingual deep neural network based acoustic modeling for rapid language adaptation

Ngoc Thang Vu, +5 more

TL;DR: The studies reveal that crosslingual acoustic model transfer through multilingual DNNs is superior to unsupervised RBM pre-training and greedy layer-wise supervised training and that KL-HMM based decoding consistently outperforms conventional hybrid decoding, especially in low-resource scenarios.

...read moreread less

Journal ArticleDOI

A Review on Automatic Speech Recognition Architecture and Approaches

S. Karpagavalli, +1 more

- 30 Apr 2016 -

International Journal of Signal Processi...

TL;DR: A detailed study on automatic speech recognition is carried out and presented in this paper that covers the architecture, speech parameterization, methodologies, characteristics, issues, databases, tools and applications.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

SRILM – An Extensible Language Modeling Toolkit

Andreas Stolcke

TL;DR: The functionality of the SRILM toolkit is summarized and its design and implementation is discussed, highlighting ease of rapid prototyping, reusability, and combinability of tools.

...read moreread less

Book

Ethnologue : languages of the world

Paul M. Lewis

Journal ArticleDOI

Ethnologue: Languages of the World

Sarah L. Nesbeitt

- 01 Nov 1999 -

Electronic Resources Review

Journal ArticleDOI

Language-independent and language-adaptive acoustic modeling for speech recognition

Tanja Schultz, +3 more

- 01 Aug 2001 -

Speech Communication

TL;DR: Different methods for multilingual acoustic model combination and a polyphone decision tree specialization procedure are introduced for estimating acoustic models for a new target language using speech data from varied source languages, but only limited data from the target language.

...read moreread less

Proceedings Article

GlobalPhone: A Multilingual Speech and Text Database developed at Karlsruhe University

Tanja Schultz

TL;DR: The design, collection, and current status of the multilingual database GlobalPhone is described, an ongoing project since 1995 at Karlsruhe University, which is suitable for the development of large vocabulary speech recognition systems in many languages.

...read moreread less