An overview of voice conversion systems

doi:10.1016/J.SPECOM.2017.01.008

Journal ArticleDOI

An overview of voice conversion systems

Seyed Hamidreza Mohammadi, +1 more

- 01 Apr 2017 -

Speech Communication

- Vol. 88, Iss: 88, pp 65-82

Chats0

TLDR

An overview of real-world applications of VC systems, extensively study existing systems proposed in the literature, and discuss remaining challenges are provided.

About:

This article is published in Speech Communication.The article was published on 2017-04-01. It has received 232 citations till now. The article focuses on the topics: Voice analysis & Voice activity detection.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods

Jaime Lorenzo-Trueba, +6 more

TL;DR: A brief summary of the state-of-the-art techniques for VC is presented, followed by a detailed explanation of the challenge tasks and the results that were obtained.

...read moreread less

Journal ArticleDOI

An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning

Berrak Sisman, +3 more

- 01 Jan 2021 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This article provides a comprehensive overview of the state-of-the-art of voice conversion techniques and their performance evaluation methods from the statistical approaches to deep learning, and discusses their promise and limitations.

...read moreread less

Proceedings ArticleDOI

L2-ARCTIC: A Non-Native English Speech Corpus

Guanlong Zhao, +6 more

TL;DR: L2-ARCTIC is introduced, a speech corpus of non-native English that is intended for research in voice conversion, accent conversion, and mispronunciation detection, and is publicly accessible at https://psi.tamu.edu/l2-arctic-corpus/.

...read moreread less

Journal ArticleDOI

Non-Parallel Sequence-to-Sequence Voice Conversion With Disentangled Linguistic and Speaker Representations

Jing-Xuan Zhang, +2 more

- 01 Jan 2020 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: In this paper, a sequence-to-sequence (seq2seq) voice conversion method using non-parallel training data is proposed, which preserves the linguistic representations of source utterances while replacing the speaker representations with the target ones.

...read moreread less

Posted Content

High-quality nonparallel voice conversion based on cycle-consistent adversarial network

Fuming Fang, +3 more

- 02 Apr 2018 -

arXiv: Audio and Speech Processing

TL;DR: In this article, a cycle-consistent adversarial network (CycleGAN) was proposed for non-parallel data-based voice conversion using unpaired image-to-image translation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal Article

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, +4 more

- 01 Jan 2014 -

Journal of Machine Learning Research

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.

...read moreread less

Journal ArticleDOI

Multilayer feedforward networks are universal approximators

Kurt Hornik, +2 more

- 01 Jul 1989 -

Neural Networks

TL;DR: It is rigorously established that standard multilayer feedforward networks with as few as one hidden layer using arbitrary squashing functions are capable of approximating any Borel measurable function from one finite dimensional space to another to any desired degree of accuracy, provided sufficiently many hidden units are available.

...read moreread less

Journal ArticleDOI

Multilayer feedforward networks are universal approximators

HornikK., +2 more

- 01 Jul 1989 -

Neural Networks

Proceedings Article

Understanding the difficulty of training deep feedforward neural networks

Xavier Glorot, +1 more

TL;DR: The objective here is to understand better why standard gradient descent from random initialization is doing so poorly with deep neural networks, to better understand these recent relative successes and help design better algorithms in the future.

...read moreread less

Proceedings Article

Deep Sparse Rectifier Neural Networks

Xavier Glorot, +2 more

TL;DR: This paper shows that rectifying neurons are an even better model of biological neurons and yield equal or better performance than hyperbolic tangent networks in spite of the hard non-linearity and non-dierentiabil ity.

...read moreread less