Voice transformation using PSOLA technique

doi:10.1016/0167-6393(92)90012-V

Journal ArticleDOI

Voice transformation using PSOLA technique

H. Valbret, +2 more

- Vol. 11, Iss: 2, pp 175-187

Chats0

TLDR

A new system for voice conversion is described that combines a PSOLA (Pitch Synchronous Overlap and Add)-derived synthesizer and a module for spectral transformation, which produces a satisfyingly natural “transformed” voice.

Abstract:

In this contribution, a new system for voice conversion is described. The proposed architecture combines a PSOLA (Pitch Synchronous Overlap and Add)-derived synthesizer and a module for spectral transformation. The synthesizer based on the classical source-filter decomposition allows prosodic and spectral transformations to be performed independently. Prosodic modifications are applied on the excitation signal using the TD-PSOLA scheme; converted speech is then synthesized using the transformed spectral parameters. Two different approaches to derive spectral transformations, borrowed from the speech-recognition domain, are compared: Linear Multivariate Regression (LMR) and Dynamic Frequency Warping (DFW). Vector-quantization is carried out as a preliminary stage to render the spectral transformations dependent of the acoustical realization of sounds. A formal listening test shows that the synthesizer produces a satisfyingly natural “transformed” voice. LMR proves yet to allow a slightly better conversion than DFW. Still there is room for improvement in the spectral transformation stage.

Citations

PDF

Open Access

More filters

Book ChapterDOI

On the implementation of gentle phone's function based on PSOLA algorithm

Jongkuk Kim, +1 more

TL;DR: It is affective to change a blunt society to bright and calmed better telephonic mannered society, so the caller's voice sounds soft and generous as if the voice tone is not over the specific limit.

...read moreread less

Proceedings ArticleDOI

Significance of Prosody Modification in Privacy Preservation on speaker verification

Ayush Agarwal, +3 more

TL;DR: In this work, privacy is provided to the speaker identity information present in speech signals while performing automatic speaker verification (ASV) tasks through a prosody modification based approach.

...read moreread less

Journal ArticleDOI

Vowels and Prosody Contribution in Neural Network Based Voice Conversion Algorithm with Noisy Training Data

Olaide Ayodeji Agbolade

- 10 Mar 2020 -

arXiv: Audio and Speech Processing

TL;DR: The authors used a 2-layer feed-forward neural network to map the linear prediction analysis coefficients of a source speaker to the acoustic vector space of the target speaker with a view to objectively determine the contributions of the voiced, unvoiced and supra-segmental components of sounds to the voice conversion model.

...read moreread less

Journal ArticleDOI

Interface for Dynamic Modification of the Transformation Parameters of the PSOLA Algorithm

Demri Lyes, +2 more

- 23 Oct 2014 -

International Journal of Applied Mathema...

TL;DR: A graphical interface for the modification of the prosodic features of the speech signal (the melodic curve - fundamental frequency and temporal organization of the syllables - and the formantic trajectories) using the PSOLA algorithm is proposed.

...read moreread less

Proceedings ArticleDOI

Applying Spectral Normalisation and Efficient Envelope Estimation and Statistical Transformation for the Voice Conversion Challenge 2016

Fernando Villavicencio, +3 more

TL;DR: Comunicacio presentada a l'Interspeech 2016, celebrat els dies 8 a 12 de setembre de 2016 a San Francisco, California.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

An Algorithm for Vector Quantizer Design

Y. Linde, +2 more

- 01 Jan 1980 -

IEEE Transactions on Communications

TL;DR: An efficient and intuitive algorithm is presented for the design of vector quantizers based either on a known probabilistic model or on a long training sequence of data.

...read moreread less

Book

Linear Prediction of Speech

John E. Markel, +1 more

TL;DR: Speech Analysis and Synthesis Models: Basic Physical Principles, Speech Synthesis Structures, and Considerations in Choice of Analysis.

...read moreread less

Journal ArticleDOI

Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones

Eric Moulines, +1 more

- 01 Dec 1990 -

Speech Communication

TL;DR: In a common framework several algorithms that have been proposed recently, in order to improve the voice quality of a text-to-speech synthesis based on acoustical units concatenation based on pitch-synchronous overlap-add approach are reviewed.

...read moreread less

Proceedings ArticleDOI

Voice conversion through vector quantization

Masanobu Abe, +3 more

TL;DR: The authors propose a new voice conversion technique through vector quantization and spectrum mapping which makes it possible to precisely control voice individuality.

...read moreread less

Journal Article