scispace - formally typeset
Journal ArticleDOI

Spectral characterization of jitter, shimmer, and additive noise in synthetically generated voice signals.

Peter J. Murphy
- 28 Jan 2000 - 
- Vol. 107, Iss: 2, pp 978-988
TLDR
Definite spectral differences are found for each aperiodic component; based on these findings differential quantitative spectral measurements are suggested for future studies involving human voice signals.
Abstract
Alteration of the harmonic structure in voice source spectra, taken over at least two periods of the waveform, may occur due to the presence of fundamental frequency (f0) perturbation, amplitude perturbation, additive noise, or changes within the glottal source signal itself. In order to make accurate inferences regarding glottal-flow dynamics or perceptual evaluations based on spectral measurements taken from the acoustic speech waveform, investigation of the spectral features of each aperiodic component is required. Based on a heuristic development involving a consideration of the partial sum of the Fourier series taken for two periods of a jittered, shimmered, and (additive, random) noise-contaminated signal, the corresponding spectral characteristics are hypothesized. Subsequent to this, the Fourier series coefficients are calculated for the two periods in order to test the hypotheses. Definite spectral differences are found for each aperiodic component; based on these findings differential quantitative spectral measurements are suggested. Further supportive evidence is obtained through use of Fourier transform and periodogram-averaged calculations. The analysis is carried out on synthetically generated glottal-pulse waveforms and on radiated speech waveforms. A discussion of the results is given in terms of voice aperiodicity in general and in terms of their implication for future studies involving human voice signals.

read more

Citations
More filters
Journal ArticleDOI

Cepstral peak prominence: A comprehensive analysis

TL;DR: The meaning of cepstral peak prominence (CPP) is very similar to that of the first rahmonic and some insights are provided on its dependence with fundamental frequency and vocal tract resonances.
Journal ArticleDOI

Voice Stress Analysis: A New Framework for Voice and Effort in Human Performance

TL;DR: The “Model for Voice and Effort” (MoVE) is presented that comprises the integration of ongoing top-down and bottom-up activity under different types of load and combined patterns of voice output and a key-role is given to the anterior cingulate cortex.
Journal ArticleDOI

Voice pathology detection based eon short-term jitter estimations in running speech.

TL;DR: Estimated short-term jittervalues from reading text were found to confirm the studies showing a decrease of jitter with increasing fundamental frequencies, and the more frequent presence of high jitter values in the case of pathological voices as time increases.
Journal ArticleDOI

Spectral models of additive and modulation noise in speech and phonatory excitation signals.

TL;DR: In this article, a spectral model of additive and modulation noise in speech is presented, assuming that the Fourier series of the noise and of the clean monocycle-periodic excitation are known.
Journal ArticleDOI

Noise estimation in voice signals using short-term cepstral analysis

TL;DR: A theoretical description of cepstral processing of voiced speech containing aspiration noise and a new harmonics-to-noise (HNR) estimation technique, which is shown to provide accurate HNR estimates when tested on synthetically generated voice signals.
References
More filters
Journal ArticleDOI

On the use of windows for harmonic analysis with the discrete Fourier transform

F.J. Harris
TL;DR: A comprehensive catalog of data windows along with their significant performance parameters from which the different windows can be compared is included, and an example demonstrates the use and value of windows to resolve closely spaced harmonic signals characterized by large differences in amplitude.
Journal ArticleDOI

Effect of glottal pulse shape on the quality of natural vowels.

TL;DR: In this article, a male speaker recorded monosyllabic words and a continuous sentence and a pitch-synchronous analysis was carried out by a digital computer on the vowel portions of these samples, for every pitch period, the analysis provided: formant frequencies, waveform of the glottal excitation function, and an accurate pitch-period measurement.
Journal ArticleDOI

Glottal characteristics of female speakers: Acoustic correlates

TL;DR: A set of acoustic parameters of the voicing source that reflect individual differences in the voice qualities of female speakers are formulated to contribute to the description of normal variations of voicing characteristics across speakers and to a continuing effort to improve the analysis and synthesis of female speech.
Journal ArticleDOI

Comparisons Among Aerodynamic, Electroglottographic, and Acoustic Spectral Measures of Female Voice

TL;DR: Examination of measures of the glottal airflow waveform, the electroglottographic signal, amplitude differences between peaks in the acoustic spectrum, and observations of the spectral energy content of the third formant, in terms of how they relate to one another, suggested that these signals can be used to complement each other.
Journal ArticleDOI

Speech waveform perturbation analysis: a perceptual-acoustical comparison of seven measures.

TL;DR: The standard deviation of the distribution of the relative frequency differences was suggested as the most useful acoustic measure of waveform perturbations for clinical applications.
Related Papers (5)