scispace - formally typeset
Book ChapterDOI

A Unified Parser for Developing Indian Language Text to Speech Synthesizers

Reads0
Chats0
TLDR
The design of a language independent parser for text-to-speech synthesis in Indian languages is described and the accuracy of the phoneme sequences generated by the proposed parser is more accurate than that of language specific parsers.
Abstract
This paper describes the design of a language independent parser for text-to-speech synthesis in Indian languages. Indian languages come from 5–6 different language families of the world. Most Indian languages have their own scripts. This makes parsing for text to speech systems for Indian languages a difficult task. In spite of the number of different families which leads to divergence, there is a convergence owing to borrowings across language families. Most importantly Indian languages are more or less phonetic and can be considered to consist broadly of about 35–38 consonants and 15–18 vowels. In this paper, an attempt is made to unify the languages based on this broad list of phones. A common label set is defined to represent the various phones in Indian languages. A uniform parser is designed across all the languages capitalising on the syllable structure of Indian languages. The proposed parser converts UTF-8 text to common label set, applies letter-to-sound rules and generates the corresponding phoneme sequences. The parser is tested against the custom-built parsers for multiple Indian languages. The TTS results show that the accuracy of the phoneme sequences generated by the proposed parser is more accurate than that of language specific parsers.

read more

Citations
More filters
Proceedings ArticleDOI

Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages.

TL;DR: A low-resource Automatic Speech Recognition challenge for Indian languages as part of Interspeech 2018, which received 109 submissions from 18 research groups and evaluated the systems in terms of Word Error Rate on a blind test set.
Proceedings ArticleDOI

Building Multilingual End-to-End Speech Synthesisers for Indian Languages

TL;DR: Subjective evaluations indicate that reasonably good quality Indic TTSes can be developed using both approaches, which emphasises the need to incorporate multilingual text processing in the end-to-end framework.
Proceedings ArticleDOI

Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages

TL;DR: In this article, the authors explore the benefits of representing similar target subword units (e.g., Byte Pair Encoded(BPE) units) through a Common Label Set (CLS).
Proceedings ArticleDOI

An Exploration towards Joint Acoustic Modeling for Indian Languages: IIIT-H Submission for Low Resource Speech Recognition Challenge for Indian Languages, INTERSPEECH 2018.

TL;DR: The joint acoustic model trained with RNN-CTC has performed better than monolingual models, due to an efficient data sharing across the languages, andSub-space Gaussian mixture models, and recurrent neural networks trained with connectionst temporal classification (CTC) objective function are explored for training joint acoustic models.
Proceedings ArticleDOI

Deep learning techniques in tandem with signal processing cues for phonetic segmentation for text to speech synthesis in Indian languages

TL;DR: This paper capitalise on the ability of robust acoustic modeling techniques such as deep neural networks (DNN) and convolutional deep neural Networks (CNN) for acoustic modeling to correct the segment boundaries obtained using DNN-HMM/CNN- HMM segmentation.
References
More filters
Journal ArticleDOI

Induction of Decision Trees

J. R. Quinlan
- 25 Mar 1986 - 
TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.
Proceedings ArticleDOI

Speech parameter generation algorithms for HMM-based speech synthesis

TL;DR: A speech parameter generation algorithm for HMM-based speech synthesis, in which the speech parameter sequence is generated from HMMs whose observation vector consists of a spectral parameter vector and its dynamic feature vectors, is derived.
Proceedings Article

An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG

TL;DR: An outline of the LinGO English grammar and LKB system is given, and the ways in which they are currently being used are discussed, which supports collaborative development on many levels.
Book

lex & yacc

Tony Mason, +1 more

A common attribute based unified HTS framework for speech synthesis in Indian languages.

TL;DR: The common phoneset and common question set are used to build HTS based systems for six Indian languages, namely, Hindi, Marathi, Bengali, Tamil, Telugu and Malayalam, and a uniform HMM framework for building speech synthesisers is proposed.
Related Papers (5)