scispace - formally typeset
Journal ArticleDOI

An overview of voice conversion systems

Reads0
Chats0
TLDR
An overview of real-world applications of VC systems, extensively study existing systems proposed in the literature, and discuss remaining challenges are provided.
About
This article is published in Speech Communication.The article was published on 2017-04-01. It has received 232 citations till now. The article focuses on the topics: Voice analysis & Voice activity detection.

read more

Citations
More filters
Proceedings ArticleDOI

The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods

TL;DR: A brief summary of the state-of-the-art techniques for VC is presented, followed by a detailed explanation of the challenge tasks and the results that were obtained.
Journal ArticleDOI

An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning

TL;DR: This article provides a comprehensive overview of the state-of-the-art of voice conversion techniques and their performance evaluation methods from the statistical approaches to deep learning, and discusses their promise and limitations.
Proceedings ArticleDOI

L2-ARCTIC: A Non-Native English Speech Corpus

TL;DR: L2-ARCTIC is introduced, a speech corpus of non-native English that is intended for research in voice conversion, accent conversion, and mispronunciation detection, and is publicly accessible at https://psi.tamu.edu/l2-arctic-corpus/.
Journal ArticleDOI

Non-Parallel Sequence-to-Sequence Voice Conversion With Disentangled Linguistic and Speaker Representations

TL;DR: In this paper, a sequence-to-sequence (seq2seq) voice conversion method using non-parallel training data is proposed, which preserves the linguistic representations of source utterances while replacing the speaker representations with the target ones.
Posted Content

High-quality nonparallel voice conversion based on cycle-consistent adversarial network

TL;DR: In this article, a cycle-consistent adversarial network (CycleGAN) was proposed for non-parallel data-based voice conversion using unpaired image-to-image translation.
References
More filters
Journal Article

Dropout: a simple way to prevent neural networks from overfitting

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.
Journal ArticleDOI

Multilayer feedforward networks are universal approximators

TL;DR: It is rigorously established that standard multilayer feedforward networks with as few as one hidden layer using arbitrary squashing functions are capable of approximating any Borel measurable function from one finite dimensional space to another to any desired degree of accuracy, provided sufficiently many hidden units are available.
Proceedings Article

Understanding the difficulty of training deep feedforward neural networks

TL;DR: The objective here is to understand better why standard gradient descent from random initialization is doing so poorly with deep neural networks, to better understand these recent relative successes and help design better algorithms in the future.
Proceedings Article

Deep Sparse Rectifier Neural Networks

TL;DR: This paper shows that rectifying neurons are an even better model of biological neurons and yield equal or better performance than hyperbolic tangent networks in spite of the hard non-linearity and non-dierentiabil ity.
Related Papers (5)