How to convert normal voice to robotic voice?

Papers (9)	Insight
Proceedings Article•DOI Phonetic posteriorgrams for many-to-one voice conversion without parallel data training Lifa Sun, Kun Li, Hao Wang, Shiyin Kang, Helen Meng - Show less +4 more 11 Jul 2016 296 Citations	This paper proposes a novel approach to voice conversion with non-parallel training data.
Open access•Proceedings Article•DOI Voice conversion based on topological feature maps and time-variant filtering A. Rinscheid 03 Oct 1996 7 Citations	Presents a new voice conversion algorithm.
Proceedings Article•DOI On the Study of Generative Adversarial Networks for Cross-Lingual Voice Conversion Berrak Sisman, Mingyang Zhang, Minghui Dong, Haizhou Li - Show less +3 more 01 Dec 2019 32 Citations	In the experiments, we achieve high-quality converted voice, that performs equally well or better than mono-lingual voice conversion.
Proceedings Article•DOI Voice conversion based on Genetic Algorithms Chen Zhi, Zhang Linghua - Show less +1 more 01 Nov 2010 4 Citations	This capability is very fit of voice conversion.
Proceedings Article•DOI Artificial Empathy in Social Robots: An analysis of Emotions in Speech Jesin James, Catherine Watson, Bruce A. MacDonald - Show less +2 more 01 Aug 2018 32 Citations	The results show that humans are able to perceive empathy and emotions in robot speech, and prefer it over the standard robotic voice.
Book Chapter•DOI Pitch synchronous transform warping in voice conversion Robert Vích, Martin Vondra - Show less +1 more 21 Feb 2011 6 Citations	The proposed voice conversion procedure results in speech with high naturalness.
Proceedings Article•DOI Emotional Voice Conversion Using Neural Networks with Different Temporal Scales of F0 based on Wavelet Transform. Zhaojie Luo, Tetsuya Takiguchi, Yasuo Ariki - Show less +2 more 13 Sep 2016 12 Citations	By utilizing these approaches, the proposed method can change the spectrum and the prosody for an emotional voice at the same time, and was able to outperform other state-of-the-art methods for emotional voice conversion.
Open access•Journal Article•DOI Non-Parallel Sequence-to-Sequence Voice Conversion With Disentangled Linguistic and Speaker Representations Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai - Show less +2 more 01 Jan 2020-IEEE Transactions on Audio, Speech, and Language Processing 87 Citations	Experimental results showed that our method obtained higher similarity and naturalness than the best non-parallel voice conversion method in Voice Conversion Challenge 2018.
Open access•Book Chapter•DOI Voice transformation by mapping the features at syllable level K. Sreenivasa Rao, Rabul Hussain Laskar, Shashidhar G. Koolagudi - Show less +2 more 18 Dec 2007 16 Citations	The results of the listening tests indicate that the proposed voice transformation provides better mapping of the voice characteristics compared to the earlier method proposed by the author.

How can tts be made less robotic?5 answersTo make Text-to-Speech (TTS) less robotic, various approaches have been proposed in recent research. One method is to enhance naturalness and efficiency by utilizing non-autoregressive models like Parallel Tacotron, which incorporates a variational autoencoder-based residual encoder for improved naturalness and parallelizability. Another strategy involves replacing the attention mechanism with an explicit duration predictor, as seen in the Non-Attentive Tacotron, which allows for better control over duration and enhances robustness in TTS systems. Additionally, leveraging syntactically parsed trees to organize inter-phrase/word information has shown to improve pronunciation clarity, prosody, and perceived naturalness in synthesized speech, as demonstrated in a study by Guo et al.. By combining innovative techniques like these, TTS systems can achieve higher quality, more human-like speech output.

Answers from top 9 papers

My columns

Related Questions

See what other people are reading