Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

doi:10.1152/JN.00297.2011

Open AccessJournal ArticleDOI

Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

Nai Ding, +1 more

- 01 Jan 2012 -

Journal of Neurophysiology

- Vol. 107, Iss: 1, pp 78-89

TLDR

These findings characterize how the spectrotemporal features of speech are encoded in human auditory cortex and establish a single-trial-based paradigm to study the neural basis underlying the cocktail party phenomenon.

Abstract:

The cortical representation of the acoustic features of continuous speech is the foundation of speech perception. In this study, noninvasive magnetoencephalography (MEG) recordings are obtained from human subjects actively listening to spoken narratives, in both simple and cocktail party-like auditory scenes. By modeling how acoustic features of speech are encoded in ongoing MEG activity as a spectrotemporal response function, we demonstrate that the slow temporal modulations of speech in a broad spectral region are represented bilaterally in auditory cortex by a phase-locked temporal code. For speech presented monaurally to either ear, this phase-locked response is always more faithful in the right hemisphere, but with a shorter latency in the hemisphere contralateral to the stimulated ear. When different spoken narratives are presented to each ear simultaneously (dichotic listening), the resulting cortical neural activity precisely encodes the acoustic features of both of the spoken narratives, but slightly weakened and delayed compared with the monaural response. Critically, the early sensory response to the attended speech is considerably stronger than that to the unattended speech, demonstrating top-down attentional gain control. This attentional gain is substantial even during the subjects' very first exposure to the speech mixture and therefore largely independent of knowledge of the speech content. Together, these findings characterize how the spectrotemporal features of speech are encoded in human auditory cortex and establish a single-trial-based paradigm to study the neural basis underlying the cocktail party phenomenon.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Linear Modeling of Neurophysiological Responses to Speech and Other Continuous Stimuli: Methodological Considerations for Applied Research

Michael J. Crosse, +8 more

- 22 Nov 2021 -

Frontiers in Neuroscience

TL;DR: In this paper, the authors focus on experimental design, data preprocessing and stimulus feature extraction, model design, training and evaluation, and interpretation of model weights, and demonstrate how to implement each stage in MATLAB using the mTRF toolbox.

...read moreread less

Journal ArticleDOI

Attention to natural auditory signals.

Emily Caporello Bluvas, +1 more

- 01 Nov 2013 -

Hearing Research

TL;DR: The role of selective attention in modulating auditory responses to complex natural stimuli in humans is reviewed, and how the current understanding can be applied to the study of selective auditory attention in the context natural signal processing at the level of single neurons and populations in animal models amenable to invasive neuroscience techniques is suggested.

...read moreread less

Journal ArticleDOI

Machine Learning Approaches to Analyze Speech-Evoked Neurophysiological Responses

Zilong Xie, +2 more

- 25 Mar 2019 -

Journal of Speech Language and Hearing R...

TL;DR: It is proposed that ML-based approaches can complement traditional analysis approaches to analyze neurophysiological responses to speech signals and provide a deeper understanding of natural speech and language processing using ecologically valid paradigms in both typical and clinical populations.

...read moreread less

Posted ContentDOI

Late cortical tracking of ignored speech facilitates neural selectivity in acoustically challenging conditions

Lorenz Fiedler, +3 more

- 25 Jul 2018 -

bioRxiv

TL;DR: This work recorded and modelled the electroencephalographic response of 18 participants who attended to one of two simultaneously presented stories, while the SNR between the two talkers varied dynamically, and showed an increasing early-to-late attention-biased selectivity.

...read moreread less

Journal ArticleDOI

The Effects of Audiovisual Inputs on Solving the Cocktail Party Problem in the Human Brain: An fMRI Study.

Yuanqing Li, +5 more

- 01 Oct 2018 -

Cerebral Cortex

TL;DR: It is found that audiovisual inputs enhanced the neural representations of emotion features of the attended objects instead of the unattended objects, which might partially explain the benefits of audiovISual inputs for the brain to solve the cocktail party problem.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Elements of information theory

Thomas M. Cover, +1 more

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.

...read moreread less

Journal ArticleDOI

The cortical organization of speech processing

Gregory Hickok, +1 more

- 13 Apr 2007 -

Nature Reviews Neuroscience

TL;DR: A dual-stream model of speech processing is outlined that assumes that the ventral stream is largely bilaterally organized — although there are important computational differences between the left- and right-hemisphere systems — and that the dorsal stream is strongly left- Hemisphere dominant.

...read moreread less

Journal ArticleDOI

Some Experiments on the Recognition of Speech, with One and with Two Ears

E. Colin Cherry

- 01 Sep 1953 -

Journal of the Acoustical Society of Ame...

TL;DR: In this paper, the relation between the messages received by the two ears was investigated, and two types of test were reported: (a) the behavior of a listener when presented with two speech signals simultaneously (statistical filtering problem) and (b) behavior when different speech signals are presented to his two ears.

...read moreread less

Journal ArticleDOI

Speech recognition with primarily temporal cues.

Robert V. Shannon, +4 more

- 13 Oct 1995 -

Science

TL;DR: Nearly perfect speech recognition was observed under conditions of greatly reduced spectral information; the presentation of a dynamic temporal pattern in only a few broad spectral regions is sufficient for the recognition of speech.

...read moreread less

Journal ArticleDOI

Electrical Signs of Selective Attention in the Human Brain

Steven A. Hillyard, +3 more

- 12 Oct 1973 -

Science

TL;DR: Auditory evoked potentials were recorded from the vertex of subjects who listened selectively to a series of tone pipping in one ear and ignored concurrent tone pips in the other ear to study the response set established to recognize infrequent, higher pitched tone pipped in the attended series.

...read moreread less

Collapse

Related Papers (5)

Emergence of neural encoding of auditory objects while listening to competing speakers

Nai Ding, +1 more

- 17 Jul 2012 -

Proceedings of the National Academy of S...

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG

James O’Sullivan, +8 more

- 01 Jul 2015 -

Cerebral Cortex

Selective cortical representation of attended speaker in multi-talker speech perception

Nima Mesgarani, +1 more

- 10 May 2012 -

Nature

Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing

Giovanni M. Di Liberto, +2 more

- 05 Oct 2015 -

Current Biology

Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

Citations

Linear Modeling of Neurophysiological Responses to Speech and Other Continuous Stimuli: Methodological Considerations for Applied Research

Attention to natural auditory signals.

Machine Learning Approaches to Analyze Speech-Evoked Neurophysiological Responses

Late cortical tracking of ignored speech facilitates neural selectivity in acoustically challenging conditions

The Effects of Audiovisual Inputs on Solving the Cocktail Party Problem in the Human Brain: An fMRI Study.

References

Elements of information theory

The cortical organization of speech processing

Some Experiments on the Recognition of Speech, with One and with Two Ears

Speech recognition with primarily temporal cues.

Electrical Signs of Selective Attention in the Human Brain

Related Papers (5)

Emergence of neural encoding of auditory objects while listening to competing speakers

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG

Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party”

Selective cortical representation of attended speaker in multi-talker speech perception

Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing