Book ChapterDOI
A Unified Parser for Developing Indian Language Text to Speech Synthesizers
Arun Baby,N L Nishanthi,Anju Leela Thomas,Hema A. Murthy +3 more
- pp 514-521
Reads0
Chats0
TLDR
The design of a language independent parser for text-to-speech synthesis in Indian languages is described and the accuracy of the phoneme sequences generated by the proposed parser is more accurate than that of language specific parsers.Abstract:
This paper describes the design of a language independent parser for text-to-speech synthesis in Indian languages. Indian languages come from 5–6 different language families of the world. Most Indian languages have their own scripts. This makes parsing for text to speech systems for Indian languages a difficult task. In spite of the number of different families which leads to divergence, there is a convergence owing to borrowings across language families. Most importantly Indian languages are more or less phonetic and can be considered to consist broadly of about 35–38 consonants and 15–18 vowels. In this paper, an attempt is made to unify the languages based on this broad list of phones. A common label set is defined to represent the various phones in Indian languages. A uniform parser is designed across all the languages capitalising on the syllable structure of Indian languages. The proposed parser converts UTF-8 text to common label set, applies letter-to-sound rules and generates the corresponding phoneme sequences. The parser is tested against the custom-built parsers for multiple Indian languages. The TTS results show that the accuracy of the phoneme sequences generated by the proposed parser is more accurate than that of language specific parsers.read more
Citations
More filters
Proceedings ArticleDOI
Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages.
Brij Mohan Lal Srivastava,Sunayana Sitaram,Rupesh Kumar Mehta,Krishna Doss Mohan,Pallavi Matani,Sandeepkumar Satpal,Kalika Bali,Radhakrishnan Srikanth,Niranjan S. Nayak +8 more
TL;DR: A low-resource Automatic Speech Recognition challenge for Indian languages as part of Interspeech 2018, which received 109 submissions from 18 research groups and evaluated the systems in terms of Word Error Rate on a blind test set.
Proceedings ArticleDOI
Building Multilingual End-to-End Speech Synthesisers for Indian Languages
TL;DR: Subjective evaluations indicate that reasonably good quality Indic TTSes can be developed using both approaches, which emphasises the need to incorporate multilingual text processing in the end-to-end framework.
Proceedings ArticleDOI
Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages
TL;DR: In this article, the authors explore the benefits of representing similar target subword units (e.g., Byte Pair Encoded(BPE) units) through a Common Label Set (CLS).
Proceedings ArticleDOI
An Exploration towards Joint Acoustic Modeling for Indian Languages: IIIT-H Submission for Low Resource Speech Recognition Challenge for Indian Languages, INTERSPEECH 2018.
TL;DR: The joint acoustic model trained with RNN-CTC has performed better than monolingual models, due to an efficient data sharing across the languages, andSub-space Gaussian mixture models, and recurrent neural networks trained with connectionst temporal classification (CTC) objective function are explored for training joint acoustic models.
Proceedings ArticleDOI
Deep learning techniques in tandem with signal processing cues for phonetic segmentation for text to speech synthesis in Indian languages
TL;DR: This paper capitalise on the ability of robust acoustic modeling techniques such as deep neural networks (DNN) and convolutional deep neural Networks (CNN) for acoustic modeling to correct the segment boundaries obtained using DNN-HMM/CNN- HMM segmentation.
References
More filters
Journal ArticleDOI
Induction of Decision Trees
TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.
Proceedings ArticleDOI
Speech parameter generation algorithms for HMM-based speech synthesis
TL;DR: A speech parameter generation algorithm for HMM-based speech synthesis, in which the speech parameter sequence is generated from HMMs whose observation vector consists of a spectral parameter vector and its dynamic feature vectors, is derived.
Proceedings Article
An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG
Ann Copestake,Dan Flickinger +1 more
TL;DR: An outline of the LinGO English grammar and LKB system is given, and the ways in which they are currently being used are discussed, which supports collaborative development on many levels.
A common attribute based unified HTS framework for speech synthesis in Indian languages.
B. Ramani,S. Lilly Christina,Rachel G. Anushiya,V. Sherlin Solomi,Mahesh Kumar Nandwana,Anusha Prakash,S. Aswin Shanmugam,Raghava Krishnan,S. Kishore Prahalad,K. Samudravijaya,P. Vijayalakshmi,T. Nagarajan,Hema A. Murthy +12 more
TL;DR: The common phoneset and common question set are used to build HTS based systems for six Indian languages, namely, Hindi, Marathi, Bengali, Tamil, Telugu and Malayalam, and a uniform HMM framework for building speech synthesisers is proposed.