Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

doi:10.1152/JN.00297.2011

Open AccessJournal ArticleDOI

Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

Nai Ding, +1 more

- 01 Jan 2012 -

Journal of Neurophysiology

- Vol. 107, Iss: 1, pp 78-89

TLDR

These findings characterize how the spectrotemporal features of speech are encoded in human auditory cortex and establish a single-trial-based paradigm to study the neural basis underlying the cocktail party phenomenon.

Abstract:

The cortical representation of the acoustic features of continuous speech is the foundation of speech perception. In this study, noninvasive magnetoencephalography (MEG) recordings are obtained from human subjects actively listening to spoken narratives, in both simple and cocktail party-like auditory scenes. By modeling how acoustic features of speech are encoded in ongoing MEG activity as a spectrotemporal response function, we demonstrate that the slow temporal modulations of speech in a broad spectral region are represented bilaterally in auditory cortex by a phase-locked temporal code. For speech presented monaurally to either ear, this phase-locked response is always more faithful in the right hemisphere, but with a shorter latency in the hemisphere contralateral to the stimulated ear. When different spoken narratives are presented to each ear simultaneously (dichotic listening), the resulting cortical neural activity precisely encodes the acoustic features of both of the spoken narratives, but slightly weakened and delayed compared with the monaural response. Critically, the early sensory response to the attended speech is considerably stronger than that to the unattended speech, demonstrating top-down attentional gain control. This attentional gain is substantial even during the subjects' very first exposure to the speech mixture and therefore largely independent of knowledge of the speech content. Together, these findings characterize how the spectrotemporal features of speech are encoded in human auditory cortex and establish a single-trial-based paradigm to study the neural basis underlying the cocktail party phenomenon.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Cortical tracking of hierarchical linguistic structures in connected speech

Nai Ding, +7 more

- 01 Jan 2016 -

Nature Neuroscience

TL;DR: It is found that, during listening to connected speech, cortical activity of different timescales concurrently tracked the time course of abstract linguistic structures at different hierarchical levels, such as words, phrases and sentences.

...read moreread less

Journal ArticleDOI

Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party”

Elana Zion Golumbic, +15 more

- 06 Mar 2013 -

Neuron

TL;DR: It is found that brain activity dynamically tracks speech streams using both low-frequency phase and high-frequency amplitude fluctuations and that optimal encoding likely combines the two.

...read moreread less

Journal ArticleDOI

Emergence of neural encoding of auditory objects while listening to competing speakers

Nai Ding, +1 more

- 17 Jul 2012 -

Proceedings of the National Academy of S...

TL;DR: Recording from subjects selectively listening to one of two competing speakers using magnetoencephalography indicates that concurrent auditory objects, even if spectrotemporally overlapping and not resolvable at the auditory periphery, are neurally encoded individually in auditory cortex and emerge as fundamental representational units for top-down attentional modulation and bottom-up neural adaptation.

...read moreread less

Journal ArticleDOI

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG

James O’Sullivan, +8 more

- 01 Jul 2015 -

Cerebral Cortex

TL;DR: It is shown that single-trial unaveraged EEG data can be decoded to determine attentional selection in a naturalistic multispeaker environment and a significant correlation between the EEG-based measure of attention and performance on a high-level attention task is shown.

...read moreread less

Journal ArticleDOI

Speech rhythms and multiplexed oscillatory sensory coding in the human brain.

Joachim Gross, +6 more

- 31 Dec 2013 -

PLOS Biology

TL;DR: A neuroimaging study reveals how coupled brain oscillations at different frequencies align with quasi-rhythmic features of continuous speech such as prosody, syllables, and phonemes.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Gabor analysis of auditory midbrain receptive fields: spectro-temporal and binaural composition.

Anqi Qiu, +2 more

- 26 Mar 2003 -

Journal of Neurophysiology

TL;DR: The properties of monaural STRFs and the relationship between ipsi- and contralateral inputs to neurons of the central nucleus of cat inferior colliculus (ICC) of cats are reported and it is shown that most interauralSTRF parameters are highly correlated bilaterally.

...read moreread less

Journal ArticleDOI

Determination of activation areas in the human auditory cortex by means of synthetic aperture magnetometry

Anthony T. Herdman, +9 more

- 01 Oct 2003 -

NeuroImage

TL;DR: Investigating active cortical areas associated with magnetically recorded transient and steady-state auditory evoked responses suggests that SAM is a useful technique for imaging cortical structures involved in processing perceptual information.

...read moreread less

Journal ArticleDOI

Neuromagnetic responses to frequency-tagged sounds: A new method to follow inputs from each ear to the human auditory cortex during binaural hearing

Nobuya Fujiki, +2 more

- 01 Feb 2002 -

The Journal of Neuroscience

TL;DR: A novel method is introduced that allows, for the first time, to selectively follow these inputs in humans up to the cortex during binaural hearing, using neuromagnetic cortical responses to amplitude-modulated continuous tones, with different modulation frequencies at each ear.

...read moreread less

Journal ArticleDOI

The neural processing of masked speech: evidence for different mechanisms in the left and right temporal lobes.

Sophie K. Scott, +4 more

- 04 Mar 2009 -

Journal of the Acoustical Society of Ame...

TL;DR: Functional imaging results reveal that masking speech with speech leads to bilateral superior temporal gyrus (STG) activation relative to a speech-in-noise baseline, while masking Speech and two additional maskers derived from the original speech were investigated, showing that masks can arise through two parallel neural systems, in the left and right temporal lobes.

...read moreread less

Collapse

Related Papers (5)

Emergence of neural encoding of auditory objects while listening to competing speakers

Nai Ding, +1 more

- 17 Jul 2012 -

Proceedings of the National Academy of S...

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG

James O’Sullivan, +8 more

- 01 Jul 2015 -

Cerebral Cortex

Selective cortical representation of attended speaker in multi-talker speech perception

Nima Mesgarani, +1 more

- 10 May 2012 -

Nature

Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing

Giovanni M. Di Liberto, +2 more

- 05 Oct 2015 -

Current Biology

Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

Citations

Cortical tracking of hierarchical linguistic structures in connected speech

Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party”

Emergence of neural encoding of auditory objects while listening to competing speakers

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG

Speech rhythms and multiplexed oscillatory sensory coding in the human brain.

References

Gabor analysis of auditory midbrain receptive fields: spectro-temporal and binaural composition.

Determination of activation areas in the human auditory cortex by means of synthetic aperture magnetometry

Attention-driven auditory cortex short-term plasticity helps segregate relevant sounds from noise.

Neuromagnetic responses to frequency-tagged sounds: A new method to follow inputs from each ear to the human auditory cortex during binaural hearing

The neural processing of masked speech: evidence for different mechanisms in the left and right temporal lobes.

Related Papers (5)

Emergence of neural encoding of auditory objects while listening to competing speakers

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG

Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party”

Selective cortical representation of attended speaker in multi-talker speech perception

Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing