Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

doi:10.1152/JN.00297.2011

Open AccessJournal ArticleDOI

Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

Nai Ding, +1 more

- 01 Jan 2012 -

Journal of Neurophysiology

- Vol. 107, Iss: 1, pp 78-89

TLDR

These findings characterize how the spectrotemporal features of speech are encoded in human auditory cortex and establish a single-trial-based paradigm to study the neural basis underlying the cocktail party phenomenon.

Abstract:

The cortical representation of the acoustic features of continuous speech is the foundation of speech perception. In this study, noninvasive magnetoencephalography (MEG) recordings are obtained from human subjects actively listening to spoken narratives, in both simple and cocktail party-like auditory scenes. By modeling how acoustic features of speech are encoded in ongoing MEG activity as a spectrotemporal response function, we demonstrate that the slow temporal modulations of speech in a broad spectral region are represented bilaterally in auditory cortex by a phase-locked temporal code. For speech presented monaurally to either ear, this phase-locked response is always more faithful in the right hemisphere, but with a shorter latency in the hemisphere contralateral to the stimulated ear. When different spoken narratives are presented to each ear simultaneously (dichotic listening), the resulting cortical neural activity precisely encodes the acoustic features of both of the spoken narratives, but slightly weakened and delayed compared with the monaural response. Critically, the early sensory response to the attended speech is considerably stronger than that to the unattended speech, demonstrating top-down attentional gain control. This attentional gain is substantial even during the subjects' very first exposure to the speech mixture and therefore largely independent of knowledge of the speech content. Together, these findings characterize how the spectrotemporal features of speech are encoded in human auditory cortex and establish a single-trial-based paradigm to study the neural basis underlying the cocktail party phenomenon.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Right-hemisphere coherence to speech at pre-reading stages predicts reading performance one year later

Paula Ríos-López, +3 more

- 04 Oct 2021 -

Journal of cognitive psychology

Journal ArticleDOI

The effects of data quantity on performance of temporal response function analyses of natural speech processing

Juraj Mesík, +1 more

- 13 Dec 2022 -

Frontiers in neuroscience

TL;DR: This work uses a dual-talker continuous speech paradigm to demonstrate how a key parameter of experimental design, the quantity of acquired data, influences TRF analyses fit to either individual data (subject-specific analyses), or group data (generic analyses).

...read moreread less

Journal ArticleDOI

Unattended processing of hierarchical pitch variations in spoken sentences

Xiaoqing Li, +1 more

- 01 Aug 2018 -

Brain and Language

TL;DR: The results suggest that, in an unattentive state, the human brain can functionally disentangle hierarchically different levels of pitch variation, and the brain responses to these pitch variations are time‐locked to the presence of the acoustic cues.

...read moreread less

Posted ContentDOI

Contributions of local speech encoding and functional connectivity to audio-visual speech integration

Bruno L. Giordano, +5 more

- 30 Dec 2016 -

bioRxiv

TL;DR: A role of auditory-motor interactions in visual speech representations is demonstrated and functional connectivity along the ventral pathway facilitates speech comprehension in multisensory environments and is suggested to enhance functional connectivity between temporal and inferior frontal cortex.

...read moreread less

Posted ContentDOI

Neural tracking of the fundamental frequency of the voice: male voices preferred

Jana Van Canneyt, +2 more

- 27 Aug 2020 -

bioRxiv

TL;DR: Results indicated that response strength is inversely related to f0 frequency and rate of f0 change throughout the story, and response strength greatly improves for voices with strong higher harmonics, which is particularly useful to boost the small responses evoked by voices with high f0.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Emergence of neural encoding of auditory objects while listening to competing speakers

Nai Ding, +1 more

- 17 Jul 2012 -

Proceedings of the National Academy of S...

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG

James O’Sullivan, +8 more

- 01 Jul 2015 -

Cerebral Cortex

Selective cortical representation of attended speaker in multi-talker speech perception

Nima Mesgarani, +1 more

- 10 May 2012 -

Nature

Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing

Giovanni M. Di Liberto, +2 more

- 05 Oct 2015 -

Current Biology

Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

Citations

Right-hemisphere coherence to speech at pre-reading stages predicts reading performance one year later

The effects of data quantity on performance of temporal response function analyses of natural speech processing

Unattended processing of hierarchical pitch variations in spoken sentences

Contributions of local speech encoding and functional connectivity to audio-visual speech integration

Neural tracking of the fundamental frequency of the voice: male voices preferred

References

Elements of information theory

The cortical organization of speech processing

Some Experiments on the Recognition of Speech, with One and with Two Ears

Speech recognition with primarily temporal cues.

Electrical Signs of Selective Attention in the Human Brain

Related Papers (5)

Emergence of neural encoding of auditory objects while listening to competing speakers

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG

Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party”

Selective cortical representation of attended speaker in multi-talker speech perception

Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing