scispace - formally typeset
Journal ArticleDOI

Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks

Reads0
Chats0
TLDR
Feature related to the linguistic and the production constraints are proposed for modeling the prosodic parameters such as duration, intonation and intensities of the syllables for text-to-speech synthesis (TTS) system.
About
This article is published in Neurocomputing.The article was published on 2016-01-01. It has received 19 citations till now. The article focuses on the topics: Prosody & Feedforward neural network.

read more

Citations
More filters
Journal ArticleDOI

An efficient adaptive artificial neural network based text to speech synthesizer for Hindi language

TL;DR: In this article, a text to speech synthesizer for the Hindi language is proposed, which relies on the coefficients of Mel-frequency cepstral (MFCC) features extracted to the production and linguistic constraints proposed for modeling the parameters such as intonation, duration, and syllable intensities.
Journal ArticleDOI

The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility, Communication Efficiency, and Perceived Naturalness of Synthetic Speech.

TL;DR: Sentence-level f0 variation increased naturalness ratings of synthesized speech, whether the variation was prosodically natural or not, which may impact future speech synthesis designs.
Proceedings ArticleDOI

Mongolian prosodic phrase prediction using suffix segmentation

TL;DR: The experimental results show that the proposed method has significantly enhanced the performance of the Mongolian prosodic phrase prediction system through comparing with the conventional method that treats Mongolian word as token directly.
Journal ArticleDOI

Phone-Level Prosody Modelling With GMM-Based MDN for Diverse and Controllable Speech Synthesis

TL;DR: This work proposes a novel approach that models phone-level prosodies with a GMM-based mixture density network(MDN) and then extends it for multi-speaker TTS using speaker adaptation transforms of Gaussian means and variances and shows that it can clone the prosodies from a reference speech by sampling prosody from the Gaussian components that produce the reference prosodies.
References
More filters
Journal ArticleDOI

LIBSVM: A library for support vector machines

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Book

Neural Networks: A Comprehensive Foundation

Simon Haykin
TL;DR: Thorough, well-organized, and completely up to date, this book examines all the important aspects of this emerging technology, including the learning process, back-propagation learning, radial-basis function networks, self-organizing systems, modular networks, temporal processing and neurodynamics, and VLSI implementation of neural networks.
Journal ArticleDOI

Extreme Learning Machine for Regression and Multiclass Classification

TL;DR: ELM provides a unified learning platform with a widespread type of feature mappings and can be applied in regression and multiclass classification applications directly and in theory, ELM can approximate any target continuous function and classify any disjoint regions.
Book

Artificial Neural Networks

TL;DR: artificial neural networks, artificial neural networks , مرکز فناوری اطلاعات و اصاع رسانی, کδاوρزی

New Delhi, India

perbosc
TL;DR: In this article, the authors present a survey of Indian cities in terms of Latitude (°) +N/-SLongitude (−) +E/-W New Delhi India C V Raman Science Club 28.616 77.195
Related Papers (5)