F
Fei Ren
Publications - 5
Citations - 1203
Fei Ren is an academic researcher. The author has contributed to research in topics: Encoder & Speech synthesis. The author has an hindex of 4, co-authored 5 publications receiving 993 citations.
Papers
More filters
Posted Content
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Yuxuan Wang,Daisy Stanton,Yu Zhang,RJ Skerry-Ryan,Eric Battenberg,Joel Shor,Ying Xiao,Fei Ren,Ye Jia,Rif A. Saurous +9 more
TL;DR: "global style tokens" (GSTs), a bank of embeddings that are jointly trained within Tacotron, a state-of-the-art end-to-end speech synthesis system, learn to factorize noise and speaker identity, providing a path towards highly scalable but robust speech synthesis.
Proceedings Article
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Yuxuan Wang,Daisy Stanton,Yu Zhang,RJ Skerry-Ryan,Eric Battenberg,Joel Shor,Ying Xiao,Fei Ren,Ye Jia,Rif A. Saurous +9 more
TL;DR: In this article, a bank of embeddings that are jointly trained within Tacotron, a state-of-the-art end-to-end speech synthesis system, is proposed.
Proceedings Article
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Ye Jia,Yu Zhang,Ron Weiss,Quan Wang,Jonathan Shen,Fei Ren,Zhifeng Chen,Patrick Nguyen,Ruoming Pang,Ignacio Lopez Moreno,Yonghui Wu +10 more
TL;DR: In this article, a neural network-based system for text-to-speech (TTS) synthesis that is able to generate speech audio in the voice of many different speakers, including those unseen during training is presented.
Posted Content
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Ye Jia,Yu Zhang,Ron Weiss,Quan Wang,Jonathan Shen,Fei Ren,Zhifeng Chen,Patrick Nguyen,Ruoming Pang,Ignacio Lopez Moreno,Yonghui Wu +10 more
TL;DR: In this paper, a neural network-based system for text-to-speech (TTS) synthesis that is able to generate speech audio in the voice of many different speakers, including those unseen during training is presented.