Journal ArticleDOI
Voice transformation using PSOLA technique
H. Valbret,Eric Moulines,J. P. Tubach +2 more
- Vol. 11, Iss: 2, pp 175-187
Reads0
Chats0
TLDR
A new system for voice conversion is described that combines a PSOLA (Pitch Synchronous Overlap and Add)-derived synthesizer and a module for spectral transformation, which produces a satisfyingly natural “transformed” voice.Abstract:
In this contribution, a new system for voice conversion is described. The proposed architecture combines a PSOLA (Pitch Synchronous Overlap and Add)-derived synthesizer and a module for spectral transformation. The synthesizer based on the classical source-filter decomposition allows prosodic and spectral transformations to be performed independently. Prosodic modifications are applied on the excitation signal using the TD-PSOLA scheme; converted speech is then synthesized using the transformed spectral parameters. Two different approaches to derive spectral transformations, borrowed from the speech-recognition domain, are compared: Linear Multivariate Regression (LMR) and Dynamic Frequency Warping (DFW). Vector-quantization is carried out as a preliminary stage to render the spectral transformations dependent of the acoustical realization of sounds. A formal listening test shows that the synthesizer produces a satisfyingly natural “transformed” voice. LMR proves yet to allow a slightly better conversion than DFW. Still there is room for improvement in the spectral transformation stage.read more
Citations
More filters
Journal ArticleDOI
Japanese lexical accent recognition for a CALL system by deriving classification equations with perceptual experiments
TL;DR: This work carries out listening tests making use of experiments using resynthesized speech to construct a method that performs comparably to the inter-labeler agreement rate and outperformed SVM-based methods for non-native speech.
Proceedings ArticleDOI
Speech compression by vector quantization of epochs
P. Veprek,A.B. Bradley +1 more
TL;DR: A speech compression method based on vector quantization of epochs and a technique for epoch extrapolation are described, which is evaluated and briefly compared to other waveform coders.
Journal ArticleDOI
STRAIGHT-Based Emotion Conversion Using Quadratic Multivariate Polynomial
TL;DR: Quality of emotional speech conversion can be improved by estimating nonlinear relationship between the neutral and emotional speech feature vectors, and quadratic multivariate polynomial (QMP) has been explored for transforming neutral speech to emotional target speech.
Proceedings ArticleDOI
Performance of Voice Conversion Systems Based on GMM and Applied to Arabic Language
TL;DR: This article is studying the different performances of the standard Arabic speech conversion and in particular that of the vocal tract using a regularized discrete cepstrum in order to estimate the spectrum of the speech signal and the parameters of a GMM model.
Journal ArticleDOI
Restricted Boltzmann Machine-Based Voice Conversion for Nonparallel Corpus
TL;DR: This letter presents a new voice conversion method that needs no parallel speech corpus, and adopts a restricted Boltzmann machine (RBM) to represent the distribution of the spectral features derived from a target speaker.
References
More filters
Journal ArticleDOI
An Algorithm for Vector Quantizer Design
Y. Linde,A. Buzo,Robert M. Gray +2 more
TL;DR: An efficient and intuitive algorithm is presented for the design of vector quantizers based either on a known probabilistic model or on a long training sequence of data.
Book
Linear Prediction of Speech
John E. Markel,A. Gray +1 more
TL;DR: Speech Analysis and Synthesis Models: Basic Physical Principles, Speech Synthesis Structures, and Considerations in Choice of Analysis.
Journal ArticleDOI
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
TL;DR: In a common framework several algorithms that have been proposed recently, in order to improve the voice quality of a text-to-speech synthesis based on acoustical units concatenation based on pitch-synchronous overlap-add approach are reviewed.
Proceedings ArticleDOI
Voice conversion through vector quantization
TL;DR: The authors propose a new voice conversion technique through vector quantization and spectrum mapping which makes it possible to precisely control voice individuality.