1
|
Rupp KM, Hect JL, Harford EE, Holt LL, Ghuman AS, Abel TJ. A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex. Proc Natl Acad Sci U S A 2025; 122:e2412243122. [PMID: 40294254 PMCID: PMC12067213 DOI: 10.1073/pnas.2412243122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Accepted: 03/21/2025] [Indexed: 04/30/2025] Open
Abstract
Efficient behavior is supported by humans' ability to rapidly recognize acoustically distinct sounds as members of a common category. Within the auditory cortex, critical unanswered questions remain regarding the organization and dynamics of sound categorization. We performed intracerebral recordings during epilepsy surgery evaluation as 20 patient-participants listened to natural sounds. We then built encoding models to predict neural responses using sound representations extracted from different layers within a deep neural network (DNN) pretrained to categorize sounds from acoustics. This approach yielded accurate models of neural responses throughout the auditory cortex. The complexity of a cortical site's representation (measured by the depth of the DNN layer that produced the best model) was closely related to its anatomical location, with shallow, middle, and deep layers associated with core (primary auditory cortex), lateral belt, and parabelt regions, respectively. Smoothly varying gradients of representational complexity existed within these regions, with complexity increasing along a posteromedial-to-anterolateral direction in core and lateral belt and along posterior-to-anterior and dorsal-to-ventral dimensions in parabelt. We then characterized the time (relative to sound onset) when feature representations emerged; this measure of temporal dynamics increased across the auditory hierarchy. Finally, we found separable effects of region and temporal dynamics on representational complexity: sites that took longer to begin encoding stimulus features had higher representational complexity independent of region, and downstream regions encoded more complex features independent of temporal dynamics. These findings suggest that hierarchies of timescales and complexity represent a functional organizational principle of the auditory stream underlying our ability to rapidly categorize sounds.
Collapse
Affiliation(s)
- Kyle M. Rupp
- Department of Neurological Surgery, University of Pittsburgh, PA15213
| | - Jasmine L. Hect
- Department of Neurological Surgery, University of Pittsburgh, PA15213
| | - Emily E. Harford
- Department of Neurological Surgery, University of Pittsburgh, PA15213
| | - Lori L. Holt
- Department of Psychology, The University of Texas at Austin, TX78712
| | | | - Taylor J. Abel
- Department of Neurological Surgery, University of Pittsburgh, PA15213
- Department of Bioengineering, University of Pittsburgh, PA15261
| |
Collapse
|
2
|
Harford EE, Holt LL, Abel TJ. Unveiling the development of human voice perception: Neurobiological mechanisms and pathophysiology. CURRENT RESEARCH IN NEUROBIOLOGY 2024; 6:100127. [PMID: 38511174 PMCID: PMC10950757 DOI: 10.1016/j.crneur.2024.100127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 02/22/2024] [Accepted: 02/26/2024] [Indexed: 03/22/2024] Open
Abstract
The human voice is a critical stimulus for the auditory system that promotes social connection, informs the listener about identity and emotion, and acts as the carrier for spoken language. Research on voice processing in adults has informed our understanding of the unique status of the human voice in the mature auditory cortex and provided potential explanations for mechanisms that underly voice selectivity and identity processing. There is evidence that voice perception undergoes developmental change starting in infancy and extending through early adolescence. While even young infants recognize the voice of their mother, there is an apparent protracted course of development to reach adult-like selectivity for human voice over other sound categories and recognition of other talkers by voice. Gaps in the literature do not allow for an exact mapping of this trajectory or an adequate description of how voice processing and its neural underpinnings abilities evolve. This review provides a comprehensive account of developmental voice processing research published to date and discusses how this evidence fits with and contributes to current theoretical models proposed in the adult literature. We discuss how factors such as cognitive development, neural plasticity, perceptual narrowing, and language acquisition may contribute to the development of voice processing and its investigation in children. We also review evidence of voice processing abilities in premature birth, autism spectrum disorder, and phonagnosia to examine where and how deviations from the typical trajectory of development may manifest.
Collapse
Affiliation(s)
- Emily E. Harford
- Department of Neurological Surgery, University of Pittsburgh, USA
| | - Lori L. Holt
- Department of Psychology, The University of Texas at Austin, USA
| | - Taylor J. Abel
- Department of Neurological Surgery, University of Pittsburgh, USA
- Department of Bioengineering, University of Pittsburgh, USA
| |
Collapse
|
3
|
Enge A, Friederici AD, Skeide MA. A meta-analysis of fMRI studies of language comprehension in children. Neuroimage 2020; 215:116858. [PMID: 32304886 DOI: 10.1016/j.neuroimage.2020.116858] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Revised: 04/10/2020] [Accepted: 04/11/2020] [Indexed: 12/14/2022] Open
Abstract
The neural representation of language comprehension has been examined in several meta-analyses of fMRI studies with human adults. To complement this work from a developmental perspective, we conducted a meta-analysis of fMRI studies of auditory language comprehension in human children. Our analysis included 27 independent experiments involving n = 625 children (49% girls) with a mean age of 8.9 years. Activation likelihood estimation and seed-based effect size mapping revealed activation peaks in the pars triangularis of the left inferior frontal gyrus and bilateral superior and middle temporal gyri. In contrast to this distribution of activation in children, previous work in adults found activation peaks in the pars opercularis of the left inferior frontal gyrus and more left-lateralized temporal activation peaks. Accordingly, brain responses during language comprehension may shift from bilateral temporal and left pars triangularis peaks in childhood to left temporal and pars opercularis peaks in adulthood. This shift could be related to the gradually increasing sensitivity of the developing brain to syntactic information.
Collapse
Affiliation(s)
- Alexander Enge
- Department of Neuropsychology, Max Planck Institute for Human Cognitive and Brain Sciences, 04103, Leipzig, Germany; Department of Psychology, Humboldt-Universität zu Berlin, 12489, Berlin, Germany
| | - Angela D Friederici
- Department of Neuropsychology, Max Planck Institute for Human Cognitive and Brain Sciences, 04103, Leipzig, Germany
| | - Michael A Skeide
- Department of Neuropsychology, Max Planck Institute for Human Cognitive and Brain Sciences, 04103, Leipzig, Germany.
| |
Collapse
|
4
|
Liu P, Cole PM, Gilmore RO, Pérez-Edgar KE, Vigeant MC, Moriarty P, Scherf KS. Young children's neural processing of their mother's voice: An fMRI study. Neuropsychologia 2019; 122:11-19. [PMID: 30528586 PMCID: PMC6334756 DOI: 10.1016/j.neuropsychologia.2018.12.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Revised: 11/13/2018] [Accepted: 12/03/2018] [Indexed: 12/20/2022]
Abstract
In addition to semantic content, human speech carries paralinguistic information that conveys important social cues such as a speaker's identity. For young children, their own mothers' voice is one of the most salient vocal inputs in their daily environment. Indeed, qualities of mothers' voices are shown to contribute to children's social development. Our knowledge of how the mother's voice is processed at the neural level, however, is limited. This study investigated whether the voice of a mother modulates activation in the network of regions activated by the human voice in young children differently than the voice of an unfamiliar mother. We collected fMRI data from 32 typically developing 7- and 8-year-olds as they listened to natural speech produced by their mother and another child's mother. We used emotionally-varied natural speech stimuli to approximate the range of children's day-to-day experience. We individually-defined functional ROIs in children's voice-sensitive neural network and then independently investigated the extent to which activation in these regions is modulated by speaker identity. The bilateral posterior auditory cortex, superior temporal gyrus (STG), and inferior frontal gyrus (IFG) exhibit enhanced activation in response to the voice of one's own mother versus that of an unfamiliar mother. The findings indicate that children process the voice of their own mother uniquely, and pave the way for future studies of how social information processing contributes to the trajectory of child social development.
Collapse
Affiliation(s)
- Pan Liu
- Department of Psychology, Child Study Center, The Pennsylvania State University, University Park, PA, USA
| | - Pamela M Cole
- Department of Psychology, Child Study Center, The Pennsylvania State University, University Park, PA, USA.
| | - Rick O Gilmore
- Department of Psychology, Child Study Center, The Pennsylvania State University, University Park, PA, USA
| | - Koraly E Pérez-Edgar
- Department of Psychology, Child Study Center, The Pennsylvania State University, University Park, PA, USA
| | - Michelle C Vigeant
- Graduate Program in Acoustics, The Pennsylvania State University, University Park, PA, USA
| | - Peter Moriarty
- Graduate Program in Acoustics, The Pennsylvania State University, University Park, PA, USA
| | - K Suzanne Scherf
- Department of Psychology, Child Study Center, The Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
5
|
Zuk J, Perdue MV, Becker B, Yu X, Chang M, Raschle NM, Gaab N. Neural correlates of phonological processing: Disrupted in children with dyslexia and enhanced in musically trained children. Dev Cogn Neurosci 2018; 34:82-91. [PMID: 30103188 PMCID: PMC6481189 DOI: 10.1016/j.dcn.2018.07.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2017] [Revised: 06/27/2018] [Accepted: 07/13/2018] [Indexed: 11/24/2022] Open
Abstract
First fMRI investigation of phonological processing in musically trained children. Greater bilateral activation with music training in regions disrupted in dyslexia. Implications for music training to support compensatory neural network in dyslexia.
Phonological processing has been postulated as a core area of deficit among children with dyslexia. Reduced brain activation during phonological processing in children with dyslexia has been observed in left-hemispheric temporoparietal regions. Musical training has shown positive associations with phonological processing abilities, but the neural mechanisms underlying this relationship remain unspecified. The present research aims to distinguish neural correlates of phonological processing in school-age typically developing musically trained children, musically untrained children, and musically untrained children with dyslexia utilizing fMRI. A whole-brain ANCOVA, accounting for gender and nonverbal cognitive abilities, identified a main effect of group in bilateral temporoparietal regions. Subsequent region-of-interest analyses replicated temporoparietal hypoactivation in children with dyslexia relative to typically developing children. By contrast, musically trained children showed greater bilateral activation in temporoparietal regions when compared to each musically untrained group. Therefore, musical training shows associations with enhanced bilateral activation of left-hemispheric regions known to be important for reading. Findings suggest that engagement of these regions through musical training may underlie the putative positive effects of music on reading development. This supports the hypothesis that musical training may facilitate the development of a bilateral compensatory neural network, which aids children with atypical function in left-hemispheric temporoparietal regions.
Collapse
Affiliation(s)
- Jennifer Zuk
- Laboratories of Cognitive Neuroscience, Division of Developmental Medicine, Department of Medicine, Boston Children's Hospital, Boston, MA 02115, USA; Harvard Medical School, Boston, MA 02115, USA
| | - Meaghan V Perdue
- Laboratories of Cognitive Neuroscience, Division of Developmental Medicine, Department of Medicine, Boston Children's Hospital, Boston, MA 02115, USA; Department of Psychological Sciences, University of Connecticut, Storrs, CT 06268, USA
| | - Bryce Becker
- Laboratories of Cognitive Neuroscience, Division of Developmental Medicine, Department of Medicine, Boston Children's Hospital, Boston, MA 02115, USA
| | - Xi Yu
- Laboratories of Cognitive Neuroscience, Division of Developmental Medicine, Department of Medicine, Boston Children's Hospital, Boston, MA 02115, USA; Harvard Medical School, Boston, MA 02115, USA
| | - Michelle Chang
- Laboratories of Cognitive Neuroscience, Division of Developmental Medicine, Department of Medicine, Boston Children's Hospital, Boston, MA 02115, USA
| | - Nora Maria Raschle
- Laboratories of Cognitive Neuroscience, Division of Developmental Medicine, Department of Medicine, Boston Children's Hospital, Boston, MA 02115, USA; Department of Child and Adolescent Psychiatry, University of Basel, Psychiatric University Hospital, Basel, Switzerland
| | - Nadine Gaab
- Laboratories of Cognitive Neuroscience, Division of Developmental Medicine, Department of Medicine, Boston Children's Hospital, Boston, MA 02115, USA; Harvard Medical School, Boston, MA 02115, USA; Harvard Graduate School of Education, Cambridge, MA 02138, USA.
| |
Collapse
|
6
|
Remijn GB, Kikuchi M, Yoshimura Y, Shitamichi K, Ueno S, Tsubokawa T, Kojima H, Higashida H, Minabe Y. A Near-Infrared Spectroscopy Study on Cortical Hemodynamic Responses to Normal and Whispered Speech in 3- to 7-Year-Old Children. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:465-470. [PMID: 28114676 DOI: 10.1044/2016_jslhr-h-15-0435] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 07/24/2016] [Indexed: 06/06/2023]
Abstract
PURPOSE The purpose of this study was to assess cortical hemodynamic response patterns in 3- to 7-year-old children listening to two speech modes: normally vocalized and whispered speech. Understanding whispered speech requires processing of the relatively weak, noisy signal, as well as the cognitive ability to understand the speaker's reason for whispering. METHOD Near-infrared spectroscopy (NIRS) was used to assess changes in cortical oxygenated hemoglobin from 16 typically developing children. RESULTS A profound difference in oxygenated hemoglobin levels between the speech modes was found over left ventral sensorimotor cortex. In particular, over areas that represent speech articulatory body parts and motion, such as the larynx, lips, and jaw, oxygenated hemoglobin was higher for whisper than for normal speech. The weaker stimulus, in terms of sound energy, thus induced the more profound hemodynamic response. This, moreover, occurred over areas involved in speech articulation, even though the children did not overtly articulate speech during measurements. CONCLUSION Because whisper is a special form of communication not often used in daily life, we suggest that the hemodynamic response difference over left ventral sensorimotor cortex resulted from inner (covert) practice or imagination of the different articulatory actions necessary to produce whisper as opposed to normal speech.
Collapse
Affiliation(s)
| | - Mitsuru Kikuchi
- Research Center for Child Mental Development, Kanazawa University, Kanazawa, Japan
| | - Yuko Yoshimura
- Research Center for Child Mental Development, Kanazawa University, Kanazawa, Japan
| | - Kiyomi Shitamichi
- Department of Psychiatry and Neurobiology, Graduate School of Medical Science, Kanazawa University, Kanazawa, Japan
| | - Sanae Ueno
- Department of Psychiatry and Neurobiology, Graduate School of Medical Science, Kanazawa University, Kanazawa, Japan
| | - Tsunehisa Tsubokawa
- Department of Anesthesiology, Graduate School of Medical Science, Kanazawa University, Kanazawa, Japan
| | - Haruyuki Kojima
- Department of Psychology, Kanazawa University, Kanazawa, Japan
| | - Haruhiro Higashida
- Research Center for Child Mental Development, Kanazawa University, Kanazawa, Japan
| | - Yoshio Minabe
- Department of Psychiatry and Neurobiology, Graduate School of Medical Science, Kanazawa University, Kanazawa, Japan
| |
Collapse
|
7
|
Sperdin HF, Schaer M. Aberrant Development of Speech Processing in Young Children with Autism: New Insights from Neuroimaging Biomarkers. Front Neurosci 2016; 10:393. [PMID: 27610073 PMCID: PMC4997090 DOI: 10.3389/fnins.2016.00393] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Accepted: 08/10/2016] [Indexed: 12/13/2022] Open
Abstract
From the time of birth, a newborn is continuously exposed and naturally attracted to human voices, and as he grows, he becomes increasingly responsive to these speech stimuli, which are strong drivers for his language development and knowledge acquisition about the world. In contrast, young children with autism spectrum disorder (ASD) are often insensitive to human voices, failing to orient and respond to them. Failure to attend to speech in turn results in altered development of language and social-communication skills. Here, we review the critical role of orienting to speech in ASD, as well as the neural substrates of human voice processing. Recent functional neuroimaging and electroencephalography studies demonstrate that aberrant voice processing could be a promising marker to identify ASD very early on. With the advent of refined brain imaging methods, coupled with the possibility of screening infants and toddlers, predictive brain function biomarkers are actively being examined and are starting to emerge. Their timely identification might not only help to differentiate between phenotypes, but also guide the clinicians in setting up appropriate therapies, and better predicting or quantifying long-term outcome.
Collapse
Affiliation(s)
- Holger F. Sperdin
- Office Médico-Pédagogique, Department of Psychiatry, University of Geneva School of MedicineGeneva, Switzerland
| | - Marie Schaer
- Office Médico-Pédagogique, Department of Psychiatry, University of Geneva School of MedicineGeneva, Switzerland
- Stanford Cognitive & Systems Neuroscience Laboratory, Stanford University School of MedicinePalo Alto, CA, USA
| |
Collapse
|
8
|
Jäncke L, Alahmadi N. Resting State EEG in Children With Learning Disabilities: An Independent Component Analysis Approach. Clin EEG Neurosci 2016; 47:24-36. [PMID: 26545819 DOI: 10.1177/1550059415612622] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 09/24/2015] [Indexed: 12/16/2022]
Abstract
In this study, the neurophysiological underpinnings of learning disabilities (LD) in children are examined using resting state EEG. We were particularly interested in the neurophysiological differences between children with learning disabilities not otherwise specified (LD-NOS), learning disabilities with verbal disabilities (LD-Verbal), and healthy control (HC) children. We applied 2 different approaches to examine the differences between the different groups. First, we calculated theta/beta and theta/alpha ratios in order to quantify the relationship between slow and fast EEG oscillations. Second, we used a recently developed method for analyzing spectral EEG, namely the group independent component analysis (gICA) model. Using these measures, we identified substantial differences between LD and HC children and between LD-NOS and LD-Verbal children in terms of their spectral EEG profiles. We obtained the following findings: (a) theta/beta and theta/alpha ratios were substantially larger in LD than in HC children, with no difference between LD-NOS and LD-Verbal children; (b) there was substantial slowing of EEG oscillations, especially for gICs located in frontal scalp positions, with LD-NOS children demonstrating the strongest slowing; (c) the estimated intracortical sources of these gICs were mostly located in brain areas involved in the control of executive functions, attention, planning, and language; and (d) the LD-Verbal children demonstrated substantial differences in EEG oscillations compared with LD-NOS children, and these differences were localized in language-related brain areas. The general pattern of atypical neurophysiological activation found in LD children suggests that they suffer from neurophysiological dysfunction in brain areas involved with the control of attention, executive functions, planning, and language functions. LD-Verbal children also demonstrate atypical activation, especially in language-related brain areas. These atypical neurophysiological activation patterns might provide a helpful guide for rehabilitation strategies to treat the deficiencies in these children with LD.
Collapse
Affiliation(s)
- Lutz Jäncke
- Department of Neuropsychology, University Zurich, Zurich, Switzerland
| | - Nsreen Alahmadi
- Department of Special Education, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|