scispace - formally typeset
Open AccessJournal ArticleDOI

Neural coding of continuous speech in auditory cortex during monaural and dichotic listening

Nai Ding, +1 more
- 01 Jan 2012 - 
- Vol. 107, Iss: 1, pp 78-89
TLDR
These findings characterize how the spectrotemporal features of speech are encoded in human auditory cortex and establish a single-trial-based paradigm to study the neural basis underlying the cocktail party phenomenon.
Abstract
The cortical representation of the acoustic features of continuous speech is the foundation of speech perception. In this study, noninvasive magnetoencephalography (MEG) recordings are obtained from human subjects actively listening to spoken narratives, in both simple and cocktail party-like auditory scenes. By modeling how acoustic features of speech are encoded in ongoing MEG activity as a spectrotemporal response function, we demonstrate that the slow temporal modulations of speech in a broad spectral region are represented bilaterally in auditory cortex by a phase-locked temporal code. For speech presented monaurally to either ear, this phase-locked response is always more faithful in the right hemisphere, but with a shorter latency in the hemisphere contralateral to the stimulated ear. When different spoken narratives are presented to each ear simultaneously (dichotic listening), the resulting cortical neural activity precisely encodes the acoustic features of both of the spoken narratives, but slightly weakened and delayed compared with the monaural response. Critically, the early sensory response to the attended speech is considerably stronger than that to the unattended speech, demonstrating top-down attentional gain control. This attentional gain is substantial even during the subjects' very first exposure to the speech mixture and therefore largely independent of knowledge of the speech content. Together, these findings characterize how the spectrotemporal features of speech are encoded in human auditory cortex and establish a single-trial-based paradigm to study the neural basis underlying the cocktail party phenomenon.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Cortical tracking of hierarchical linguistic structures in connected speech

TL;DR: It is found that, during listening to connected speech, cortical activity of different timescales concurrently tracked the time course of abstract linguistic structures at different hierarchical levels, such as words, phrases and sentences.
Journal ArticleDOI

Emergence of neural encoding of auditory objects while listening to competing speakers

TL;DR: Recording from subjects selectively listening to one of two competing speakers using magnetoencephalography indicates that concurrent auditory objects, even if spectrotemporally overlapping and not resolvable at the auditory periphery, are neurally encoded individually in auditory cortex and emerge as fundamental representational units for top-down attentional modulation and bottom-up neural adaptation.
Journal ArticleDOI

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG

TL;DR: It is shown that single-trial unaveraged EEG data can be decoded to determine attentional selection in a naturalistic multispeaker environment and a significant correlation between the EEG-based measure of attention and performance on a high-level attention task is shown.
Journal ArticleDOI

Speech rhythms and multiplexed oscillatory sensory coding in the human brain.

TL;DR: A neuroimaging study reveals how coupled brain oscillations at different frequencies align with quasi-rhythmic features of continuous speech such as prosody, syllables, and phonemes.
References
More filters
Journal ArticleDOI

Entrainment of Neuronal Oscillations as a Mechanism of Attentional Selection

TL;DR: It is shown that when attended stimuli are in a rhythmic stream, delta-band oscillations in the primary visual cortex entrain to the rhythm of the stream, resulting in increased response gain for task-relevant events and decreased reaction times.
Journal ArticleDOI

Low-frequency neuronal oscillations as instruments of sensory selection

TL;DR: The evidence for early sensory selection by oscillatory phase-amplitude modulations, its mechanisms and its perceptual and behavioral consequences are reviewed.
Journal ArticleDOI

The analysis of speech in different temporal integration windows: cerebral lateralization as 'asymmetric sampling in time'

TL;DR: The 'asymmetric sampling in time' hypothesis developed here provides a framework for understanding a range of psychophysical and neuropsychological data on speech perception in the context of a revised cortical functional anatomic model.
Journal ArticleDOI

Temporal Information in Speech: Acoustic, Auditory and Linguistic Aspects

TL;DR: A new framework for describing the acoustic structure of speech based purely on temporal aspects has been developed, which is said to be comprised of three main temporal features, based on dominant fluctuation rates: envelope, periodicity, and fine-structure.
Book

An Introduction to the Physiology of Hearing

TL;DR: The book is written so that those entering auditory research from very little background in auditory neuroscience are able to understand the current research issues and research literature and is an excellent reading companion to practitioners and scholars.
Related Papers (5)