Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks

doi:10.1016/J.NEUCOM.2015.07.053

Journal ArticleDOI

Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks

V. Ramu Reddy, +1 more

- 01 Jan 2016 -

Neurocomputing

- Vol. 171, pp 1323-1334

Chats0

TLDR

Feature related to the linguistic and the production constraints are proposed for modeling the prosodic parameters such as duration, intonation and intensities of the syllables for text-to-speech synthesis (TTS) system.

About:

This article is published in Neurocomputing.The article was published on 2016-01-01. It has received 19 citations till now. The article focuses on the topics: Prosody & Feedforward neural network.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet

Edward J. Vajda

- 01 Dec 2000 -

Language

Journal ArticleDOI

An efficient adaptive artificial neural network based text to speech synthesizer for Hindi language

Ruchika Kumari, +2 more

- 09 Apr 2021 -

Multimedia Tools and Applications

TL;DR: In this article, a text to speech synthesizer for the Hindi language is proposed, which relies on the coefficients of Mel-frequency cepstral (MFCC) features extracted to the production and linguistic constraints proposed for modeling the parameters such as intonation, duration, and syllable intensities.

...read moreread less

Journal ArticleDOI

The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility, Communication Efficiency, and Perceived Naturalness of Synthetic Speech.

Jennifer M. Vojtech, +3 more

- 15 Jul 2019 -

American Journal of Speech-language Path...

TL;DR: Sentence-level f0 variation increased naturalness ratings of synthesized speech, whether the variation was prosodically natural or not, which may impact future speech synthesis designs.

...read moreread less

Proceedings ArticleDOI

Mongolian prosodic phrase prediction using suffix segmentation

Rui Liu, +3 more

TL;DR: The experimental results show that the proposed method has significantly enhanced the performance of the Mongolian prosodic phrase prediction system through comparing with the conventional method that treats Mongolian word as token directly.

...read moreread less

Journal ArticleDOI

Phone-Level Prosody Modelling With GMM-Based MDN for Diverse and Controllable Speech Synthesis

Chenpeng Du, +1 more

- 01 Jan 2022 -

IEEE/ACM transactions on audio, speech, ...

TL;DR: This work proposes a novel approach that models phone-level prosodies with a GMM-based mixture density network(MDN) and then extends it for multi-speaker TTS using speaker adaptation transforms of Gaussian means and variances and shows that it can clone the prosodies from a reference speech by sampling prosody from the Gaussian components that produce the reference prosodies.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

LIBSVM: A library for support vector machines

Chih-Chung Chang, +1 more

- 06 May 2011 -

ACM Transactions on Intelligent Systems ...

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.

...read moreread less

Book

Neural Networks: A Comprehensive Foundation

Simon Haykin

TL;DR: Thorough, well-organized, and completely up to date, this book examines all the important aspects of this emerging technology, including the learning process, back-propagation learning, radial-basis function networks, self-organizing systems, modular networks, temporal processing and neurodynamics, and VLSI implementation of neural networks.

...read moreread less

Journal ArticleDOI

Extreme Learning Machine for Regression and Multiclass Classification

Guang-Bin Huang, +3 more

TL;DR: ELM provides a unified learning platform with a widespread type of feature mappings and can be applied in regression and multiclass classification applications directly and in theory, ELM can approximate any target continuous function and classify any disjoint regions.

...read moreread less

Book

Artificial Neural Networks

B. Yegnanarayana

TL;DR: artificial neural networks, artificial neural networks , مرکز فناوری اطلاعات و اصاع رسانی, کδاوρزی

...read moreread less

New Delhi, India

perbosc

TL;DR: In this article, the authors present a survey of Indian cities in terms of Latitude (°) +N/-SLongitude (−) +E/-W New Delhi India C V Raman Science Club 28.616 77.195

...read moreread less