scispace - formally typeset
Open AccessJournal ArticleDOI

Pauses, gaps and overlaps in conversations

Mattias Heldner, +1 more
- 01 Oct 2010 - 
- Vol. 38, Iss: 4, pp 555-568
TLDR
This paper explores durational aspects of pauses gaps and overlaps in three different conversational corpora with a view to challenge claims about precision timing in turn-taking Distributions of p distributions.
About
This article is published in Journal of Phonetics.The article was published on 2010-10-01 and is currently open access. It has received 390 citations till now. The article focuses on the topics: Conversation.

read more

Citations
More filters
Journal ArticleDOI

Turn-taking in Human Communication – Origins and Implications for Language Processing

TL;DR: Appearing earlier in ontogeny than linguistic competence, the turn-taking system is also found across all the major primate clades, which suggests a possible phylogenetic continuity, which may provide key insights into language evolution.
Journal ArticleDOI

Timing in turn-taking and its implications for processing models of language

TL;DR: Turn-taking has been studied extensively in the literature as discussed by the authors and it has been shown to have the systematic properties originally noted by Sacks, Schegloff and Jefferson (1974; hereafter SSJ).
Journal ArticleDOI

Predicting while comprehending language: a theory and review

TL;DR: It is argued that prediction-by-production is an optional mechanism, which is augmented by mechanisms based on association, and support for this proposal comes from many areas of research.

Timing in turn-taking and its implications for processing models of language

TL;DR: This paper reviews the extensive literature about this system, adding new statistical analyses of behavioral data where they have been missing, demonstrating that turn-taking has the systematic properties originally noted by Sacks et al. (1974); and sketches some first model of the mental processes involved for the participant preparing to speak next.
References
More filters
Journal ArticleDOI

A simplest systematics for the organization of turn-taking for conversation

TL;DR: Turn-taking is used for the ordering of moves in games, for allocating political office, for regulating traffic at intersections, for the servicing of customers at business establishments, and for talking in interviews, meetings, debates, ceremonies, conversations.
Journal ArticleDOI

The HCRC Map Task Corpus

TL;DR: A corpus of unscripted, task-oriented dialogues which has been designed, digitally recorded, and transcribed to support the study of spontaneous speech on many levels is described.
Journal ArticleDOI

Some Signals and Rules for Taking Speaking Turns in Conversations

TL;DR: Turn-taking mechanism in face-to-face interaction was studied in this article, where participants manage the smooth and appropriate exchange of speaking turns in face to face interaction in 2 videotapes showing a therapist-patient interview and a discussion between 2 therapists.
Related Papers (5)
Frequently Asked Questions (11)
Q1. What have the authors contributed in "Pauses, gaps and overlaps in conversations" ?

This paper explores durational aspects of pauses, gaps and overlaps in three different conversational corpora with a view to challenge claims about precision timing in turn-taking. Distributions of pause, gap and overlap durations in conversations are presented, and methodological issues regarding the statistical treatment of such distributions are discussed. These results are discussed in the light of their implications for models of timing in turn-taking, and for interaction control models in speech technology. In particular, it is argued that the proportion of speaker changes that could potentially be triggered by information immediately preceding the speaker change is large enough for reactive interaction controls models to be viable in speech technology. 

Furthermore, as more than 40 % of all between-speaker intervals are long enough for the next speaker to react to information immediately before the silence given minimal response times for spoken utterances, the authors also conclude that reaction is a plausible explanation in a significant proportion of all speaker changes. 

The most common between-speaker interval in all three examined corpora, as indicated by the modes of the distribution functions, is a gap of about 200 ms. 

Assuming instead that we, as highly trained speakers, succeed more often than the authors fail at turntaking, slight gaps is a more plausible goal for between-speaker intervals. 

By relating distributions of between-speaker intervals to minimal response times for spoken utterances, the authors can quantify the proportion of speaker changes where the gap is long enough for the next speaker to react the to the offset of speech, to silence or to some prosodic information immediately before the silence. 

As a general recommendation, the authors suggest that whenever gap as well as overlap durations are available, they should be treated as one distribution, and that no transformation should be applied. 

Once the pauses, gaps and overlaps were identified and classified, their durations were extracted by subtracting the time of the onset of an interval from the time of its offset. 

an examination of the proportion of pauses and gaps with durations of more than 500 ms, a common silence threshold in end-of-utterance detectors, showed that such a threshold captured 51.1% and 47.5% of all gaps, but also 59.6% and 56.0% of all pauses in the Swedish Map Task Corpus and the HCRC Map Task Corpus, respectively. 

While the dataset allows for analyses of differences between, for example, eye contact vs. no eyecontact conditions or gender differences, the authors chose not to subdivide the dataset to make such comparisons. 

There is also the possibility of speaker changes involving overlaps or no-gap–no-overlaps, which were the terms used by Sacks et al. (1974). 

The number of states in such an interaction FSA may be augmented to model other subclassifications, or to model sojourn times, without loss of generality; here, the authors limit ourselves to an FSA of 10 states, and specifically to the 4 phenomena mentioned, as it is most directly relevant to their ongoing work in conversational spoken dialogue systems.