A Unified Parser for Developing Indian Language Text to Speech Synthesizers

doi:10.1007/978-3-319-45510-5_59

Book ChapterDOI

A Unified Parser for Developing Indian Language Text to Speech Synthesizers

Arun Baby, +3 more

- pp 514-521

Chats0

TLDR

The design of a language independent parser for text-to-speech synthesis in Indian languages is described and the accuracy of the phoneme sequences generated by the proposed parser is more accurate than that of language specific parsers.

Abstract:

This paper describes the design of a language independent parser for text-to-speech synthesis in Indian languages. Indian languages come from 5–6 different language families of the world. Most Indian languages have their own scripts. This makes parsing for text to speech systems for Indian languages a difficult task. In spite of the number of different families which leads to divergence, there is a convergence owing to borrowings across language families. Most importantly Indian languages are more or less phonetic and can be considered to consist broadly of about 35–38 consonants and 15–18 vowels. In this paper, an attempt is made to unify the languages based on this broad list of phones. A common label set is defined to represent the various phones in Indian languages. A uniform parser is designed across all the languages capitalising on the syllable structure of Indian languages. The proposed parser converts UTF-8 text to common label set, applies letter-to-sound rules and generates the corresponding phoneme sequences. The parser is tested against the custom-built parsers for multiple Indian languages. The TTS results show that the accuracy of the phoneme sequences generated by the proposed parser is more accurate than that of language specific parsers.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages.

Brij Mohan Lal Srivastava, +8 more

TL;DR: A low-resource Automatic Speech Recognition challenge for Indian languages as part of Interspeech 2018, which received 109 submissions from 18 research groups and evaluated the systems in terms of Word Error Rate on a blind test set.

...read moreread less

Proceedings ArticleDOI

Building Multilingual End-to-End Speech Synthesisers for Indian Languages

Anusha Prakash, +3 more

TL;DR: Subjective evaluations indicate that reasonably good quality Indic TTSes can be developed using both approaches, which emphasises the need to incorporate multilingual text processing in the end-to-end framework.

...read moreread less

Proceedings ArticleDOI

Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages

Vishwas M. Shetty, +1 more

TL;DR: In this article, the authors explore the benefits of representing similar target subword units (e.g., Byte Pair Encoded(BPE) units) through a Common Label Set (CLS).

...read moreread less

Proceedings ArticleDOI

An Exploration towards Joint Acoustic Modeling for Indian Languages: IIIT-H Submission for Low Resource Speech Recognition Challenge for Indian Languages, INTERSPEECH 2018.

Hari Krishna Vydana, +3 more

TL;DR: The joint acoustic model trained with RNN-CTC has performed better than monolingual models, due to an efficient data sharing across the languages, andSub-space Gaussian mixture models, and recurrent neural networks trained with connectionst temporal classification (CTC) objective function are explored for training joint acoustic models.

...read moreread less

Proceedings ArticleDOI

Deep learning techniques in tandem with signal processing cues for phonetic segmentation for text to speech synthesis in Indian languages

Arun Baby, +3 more

TL;DR: This paper capitalise on the ability of robust acoustic modeling techniques such as deep neural networks (DNN) and convolutional deep neural Networks (CNN) for acoustic modeling to correct the segment boundaries obtained using DNN-HMM/CNN- HMM segmentation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Induction of Decision Trees

J. R. Quinlan

- 25 Mar 1986 -

Machine Learning

TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.

...read moreread less

Proceedings ArticleDOI

Speech parameter generation algorithms for HMM-based speech synthesis

Keiichi Tokuda, +4 more

TL;DR: A speech parameter generation algorithm for HMM-based speech synthesis, in which the speech parameter sequence is generated from HMMs whose observation vector consists of a spectral parameter vector and its dynamic feature vectors, is derived.

...read moreread less

Proceedings Article

An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG

Ann Copestake, +1 more

TL;DR: An outline of the LinGO English grammar and LKB system is given, and the ways in which they are currently being used are discussed, which supports collaborative development on many levels.

...read moreread less

Book

lex & yacc

Tony Mason, +1 more

A common attribute based unified HTS framework for speech synthesis in Indian languages.

B. Ramani, +12 more

TL;DR: The common phoneset and common question set are used to build HTS based systems for six Indian languages, namely, Hindi, Marathi, Bengali, Tamil, Telugu and Malayalam, and a uniform HMM framework for building speech synthesisers is proposed.

...read moreread less

Related Papers (5)

Many Languages, One Parser

Waleed Ammar, +5 more

- 28 Jul 2016 -

Transactions of the Association for Comp...

arXiv: Computation and Language

A Unified Parser for Developing Indian Language Text to Speech Synthesizers

Citations

Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages.

Building Multilingual End-to-End Speech Synthesisers for Indian Languages

Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages

An Exploration towards Joint Acoustic Modeling for Indian Languages: IIIT-H Submission for Low Resource Speech Recognition Challenge for Indian Languages, INTERSPEECH 2018.

Deep learning techniques in tandem with signal processing cues for phonetic segmentation for text to speech synthesis in Indian languages

References

Induction of Decision Trees

Speech parameter generation algorithms for HMM-based speech synthesis

An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG

lex & yacc

A common attribute based unified HTS framework for speech synthesis in Indian languages.

Related Papers (5)

Many Languages, One Parser

One Parser, Many Languages.

The Kaldi Speech Recognition Toolkit

A common attribute based unified HTS framework for speech synthesis in Indian languages.

Cross-lingual Abstract Meaning Representation Parsing.