Pauses, gaps and overlaps in conversations

Question

Q1. What have the authors contributed in "Pauses, gaps and overlaps in conversations" ?

Q2. What have the authors stated for future works in "Pauses, gaps and overlaps in conversations" ?

Q3. What is the common between-speaker interval in all three examined corpora?

Q4. What is the plausible goal for between-speaker intervals?

Q5. How can the authors quantify the proportion of speaker changes where the gap is long enough for the next speaker?

Q6. What is the general recommendation for the analysis of gap and overlap durations?

Q7. How did the authors determine the duration of the pauses, gaps and overlaps?

Q8. How many pauses and gaps were detected in the Swedish Map Task Corpus?

Q9. Why did the authors choose not to subdivide the dataset?

Q10. What was the definition of speaker changes?

Q11. How many states can be augmented to model other subclassifications?

Accepted Answer

This paper explores durational aspects of pauses, gaps and overlaps in three different conversational corpora with a view to challenge claims about precision timing in turn-taking. Distributions of pause, gap and overlap durations in conversations are presented, and methodological issues regarding the statistical treatment of such distributions are discussed. These results are discussed in the light of their implications for models of timing in turn-taking, and for interaction control models in speech technology. In particular, it is argued that the proportion of speaker changes that could potentially be triggered by information immediately preceding the speaker change is large enough for reactive interaction controls models to be viable in speech technology.

Accepted Answer

Furthermore, as more than 40 % of all between-speaker intervals are long enough for the next speaker to react to information immediately before the silence given minimal response times for spoken utterances, the authors also conclude that reaction is a plausible explanation in a significant proportion of all speaker changes.

Accepted Answer

The most common between-speaker interval in all three examined corpora, as indicated by the modes of the distribution functions, is a gap of about 200 ms.

Accepted Answer

Assuming instead that we, as highly trained speakers, succeed more often than the authors fail at turntaking, slight gaps is a more plausible goal for between-speaker intervals.

Accepted Answer

By relating distributions of between-speaker intervals to minimal response times for spoken utterances, the authors can quantify the proportion of speaker changes where the gap is long enough for the next speaker to react the to the offset of speech, to silence or to some prosodic information immediately before the silence.

Accepted Answer

As a general recommendation, the authors suggest that whenever gap as well as overlap durations are available, they should be treated as one distribution, and that no transformation should be applied.

Accepted Answer

Once the pauses, gaps and overlaps were identified and classified, their durations were extracted by subtracting the time of the onset of an interval from the time of its offset.

Accepted Answer

an examination of the proportion of pauses and gaps with durations of more than 500 ms, a common silence threshold in end-of-utterance detectors, showed that such a threshold captured 51.1% and 47.5% of all gaps, but also 59.6% and 56.0% of all pauses in the Swedish Map Task Corpus and the HCRC Map Task Corpus, respectively.

Accepted Answer

While the dataset allows for analyses of differences between, for example, eye contact vs. no eyecontact conditions or gender differences, the authors chose not to subdivide the dataset to make such comparisons.

Accepted Answer

There is also the possibility of speaker changes involving overlaps or no-gap–no-overlaps, which were the terms used by Sacks et al. (1974).

Accepted Answer

The number of states in such an interaction FSA may be augmented to model other subclassifications, or to model sojourn times, without loss of generality; here, the authors limit ourselves to an FSA of 10 states, and specifically to the 4 phenomena mentioned, as it is most directly relevant to their ongoing work in conversational spoken dialogue systems.

Pauses, gaps and overlaps in conversations

Figures

Citations

語用論(Pragmatics)を考える

Turn-taking in Human Communication – Origins and Implications for Language Processing

Timing in turn-taking and its implications for processing models of language

Predicting while comprehending language: a theory and review

Timing in turn-taking and its implications for processing models of language

References

A simplest systematics for the organization of turn-taking for conversation

語用論(Pragmatics)を考える

Some functions of gaze direction in social interaction

The HCRC Map Task Corpus

Some Signals and Rules for Taking Speaking Turns in Conversations

Related Papers (5)