Timing patterns in fluent and disfluent spontaneous speech

doi:10.1109/ICASSP.1995.479669

Proceedings ArticleDOI

Timing patterns in fluent and disfluent spontaneous speech

- Vol. 1, pp 600-603

TLDR

This work examines and model global speaking rate, how it varies for both fluent and disfluent spontaneous speech, in terms of the linguistic content of the utterances, and finds application in automatic speech synthesis and recognition.

Abstract:

Most previous acoustic analysis of speech has examined data from speakers who carefully pronounce their speech, usually by reading prepared texts. Natural spontaneous or conversational speech differs from careful or read speech, especially concerning hesitation phenomena and variable speaking rates. We examine and model global speaking rate, how it varies for both fluent and disfluent spontaneous speech, in terms of the linguistic content of the utterances. Speakers tend to maintain a fixed speaking rate during most utterances, but often adopt a faster or slower rate, depending on the cognitive load (i.e., slowing down when having to make unanticipated choices, or accelerating when repeating some words). Such a model can find application in automatic speech synthesis and recognition, because most synthesizers maintain a constant (and unnatural) speaking rate and most recognizers are not capable of adapting their templates or probabilistic models to reflect global changes in speaking rate.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A review of large-vocabulary continuous-speech

Steve Young

- 01 Sep 1996 -

IEEE Signal Processing Magazine

TL;DR: The principles and architecture of current LVR systems are discussed and the key issues affecting their future deployment are identified; to illustrate the various points raised, the Cambridge University HTK system is described.

...read moreread less

Journal ArticleDOI

Robust Speech Rate Estimation for Spontaneous Speech

Dagen Wang, +1 more

- 01 Nov 2007 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This paper compares various spectral and temporal signal analysis and smoothing strategies to better characterize the underlying syllable structure to derive speech rate and describes an automated approach for learning algorithm parameters from data, and finds the optimal settings through Monte Carlo simulations and parameter sensitivity analysis.

...read moreread less

Journal ArticleDOI

The Delta-Phase Spectrum With Application to Voice Activity Detection and Speaker Recognition

Iain McCowan, +4 more

- 01 Sep 2011 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: Experiments show that mel-frequency cepstral coefficients features derived from the delta-phase spectrum can produce broadly similar performance to equivalent magnitude domain features for both voice activity detection and speaker recognition tasks.

...read moreread less

Book ChapterDOI

The significance of empty speech pauses: cognitive and algorithmic issues

Anna Esposito, +3 more

TL;DR: The high consistency, among subjects, in the distribution of speech pauses suggests that, at least in the Italian context, the speaker in narration makes use of an intrinsic timing behavior, probably a general pattern of rules, to control speech flow for discourse organization.

...read moreread less

Book ChapterDOI

On the Significance of Speech Pauses in Depressive Disorders: Results on Read and Spontaneous Narratives

Anna Esposito, +4 more

TL;DR: The results suggest that depressive disorders affect speech quality and speech production through pause and clause durations, as well as, clause quantities, suggest a strong general effect of depressive symptoms on cognitive and psychomotor functions.

...read moreread less

Noriko Umeda

- 01 Mar 1977 -

Journal of the Acoustical Society of Ame...

TL;DR: In this paper, the temporal behavior of all measurable consonants, detailed in all possible conditions, in an extensive reading by one speaker, was discussed and a strong parallelism in duration distributions among similar kinds of consonants was found.

...read moreread less

Timing patterns in fluent and disfluent spontaneous speech

Citations

A review of large-vocabulary continuous-speech

Robust Speech Rate Estimation for Spontaneous Speech

The Delta-Phase Spectrum With Application to Voice Activity Detection and Speaker Recognition

The significance of empty speech pauses: cognitive and algorithmic issues

On the Significance of Speech Pauses in Depressive Disorders: Results on Read and Spontaneous Narratives

References

Linguistic uses of segmental duration in English: Acoustic and perceptual evidence

Speaking Clearly for the Hard of Hearing II: Acoustic Characteristics of Clear and Conversational Speech.

Effects of noise on speech production: Acoustic and perceptual analyses

Articulation Rate and Its Variability in Spontaneous Speech: A Reanalysis and Some Implications

Consonant duration in American English

Related Papers (5)

Local and global models for spontaneous speech segment detection and characterization

Pauses in oral and written narratives

Paying attention to speaking rate

Improvements in children's speech recognition performance

Disfluent Speech Analysis and Synthesis: a preliminary approach.