1
|
Momtaz S, Bidelman GM. Effects of Stimulus Rate and Periodicity on Auditory Cortical Entrainment to Continuous Sounds. eNeuro 2024; 11:ENEURO.0027-23.2024. [PMID: 38253583 PMCID: PMC10913036 DOI: 10.1523/eneuro.0027-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 01/14/2024] [Accepted: 01/16/2024] [Indexed: 01/24/2024] Open
Abstract
The neural mechanisms underlying the exogenous coding and neural entrainment to repetitive auditory stimuli have seen a recent surge of interest. However, few studies have characterized how parametric changes in stimulus presentation alter entrained responses. We examined the degree to which the brain entrains to repeated speech (i.e., /ba/) and nonspeech (i.e., click) sounds using phase-locking value (PLV) analysis applied to multichannel human electroencephalogram (EEG) data. Passive cortico-acoustic tracking was investigated in N = 24 normal young adults utilizing EEG source analyses that isolated neural activity stemming from both auditory temporal cortices. We parametrically manipulated the rate and periodicity of repetitive, continuous speech and click stimuli to investigate how speed and jitter in ongoing sound streams affect oscillatory entrainment. Neuronal synchronization to speech was enhanced at 4.5 Hz (the putative universal rate of speech) and showed a differential pattern to that of clicks, particularly at higher rates. PLV to speech decreased with increasing jitter but remained superior to clicks. Surprisingly, PLV entrainment to clicks was invariant to periodicity manipulations. Our findings provide evidence that the brain's neural entrainment to complex sounds is enhanced and more sensitized when processing speech-like stimuli, even at the syllable level, relative to nonspeech sounds. The fact that this specialization is apparent even under passive listening suggests a priority of the auditory system for synchronizing to behaviorally relevant signals.
Collapse
Affiliation(s)
- Sara Momtaz
- School of Communication Sciences & Disorders, University of Memphis, Memphis, Tennessee 38152
- Boys Town National Research Hospital, Boys Town, Nebraska 68131
| | - Gavin M Bidelman
- Department of Speech, Language and Hearing Sciences, Indiana University, Bloomington, Indiana 47408
- Program in Neuroscience, Indiana University, Bloomington, Indiana 47405
| |
Collapse
|
2
|
Dura-Bernal S, Griffith EY, Barczak A, O'Connell MN, McGinnis T, Moreira JVS, Schroeder CE, Lytton WW, Lakatos P, Neymotin SA. Data-driven multiscale model of macaque auditory thalamocortical circuits reproduces in vivo dynamics. Cell Rep 2023; 42:113378. [PMID: 37925640 PMCID: PMC10727489 DOI: 10.1016/j.celrep.2023.113378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 09/05/2023] [Accepted: 10/19/2023] [Indexed: 11/07/2023] Open
Abstract
We developed a detailed model of macaque auditory thalamocortical circuits, including primary auditory cortex (A1), medial geniculate body (MGB), and thalamic reticular nucleus, utilizing the NEURON simulator and NetPyNE tool. The A1 model simulates a cortical column with over 12,000 neurons and 25 million synapses, incorporating data on cell-type-specific neuron densities, morphology, and connectivity across six cortical layers. It is reciprocally connected to the MGB thalamus, which includes interneurons and core and matrix-layer-specific projections to A1. The model simulates multiscale measures, including physiological firing rates, local field potentials (LFPs), current source densities (CSDs), and electroencephalography (EEG) signals. Laminar CSD patterns, during spontaneous activity and in response to broadband noise stimulus trains, mirror experimental findings. Physiological oscillations emerge spontaneously across frequency bands comparable to those recorded in vivo. We elucidate population-specific contributions to observed oscillation events and relate them to firing and presynaptic input patterns. The model offers a quantitative theoretical framework to integrate and interpret experimental data and predict its underlying cellular and circuit mechanisms.
Collapse
Affiliation(s)
- Salvador Dura-Bernal
- Department of Physiology and Pharmacology, State University of New York (SUNY) Downstate Health Sciences University, Brooklyn, NY, USA; Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA.
| | - Erica Y Griffith
- Department of Physiology and Pharmacology, State University of New York (SUNY) Downstate Health Sciences University, Brooklyn, NY, USA; Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA.
| | - Annamaria Barczak
- Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA
| | - Monica N O'Connell
- Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA
| | - Tammy McGinnis
- Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA
| | - Joao V S Moreira
- Department of Physiology and Pharmacology, State University of New York (SUNY) Downstate Health Sciences University, Brooklyn, NY, USA
| | - Charles E Schroeder
- Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA; Departments of Psychiatry and Neurology, Columbia University Medical Center, New York, NY, USA
| | - William W Lytton
- Department of Physiology and Pharmacology, State University of New York (SUNY) Downstate Health Sciences University, Brooklyn, NY, USA; Kings County Hospital Center, Brooklyn, NY, USA
| | - Peter Lakatos
- Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA; Department Psychiatry, NYU Grossman School of Medicine, New York, NY, USA
| | - Samuel A Neymotin
- Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA; Department Psychiatry, NYU Grossman School of Medicine, New York, NY, USA.
| |
Collapse
|
3
|
Chien VSC, Wang P, Maess B, Fishman Y, Knösche TR. Laminar neural dynamics of auditory evoked responses: Computational modeling of local field potentials in auditory cortex of non-human primates. Neuroimage 2023; 281:120364. [PMID: 37683810 DOI: 10.1016/j.neuroimage.2023.120364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 08/15/2023] [Accepted: 09/04/2023] [Indexed: 09/10/2023] Open
Abstract
Evoked neural responses to sensory stimuli have been extensively investigated in humans and animal models both to enhance our understanding of brain function and to aid in clinical diagnosis of neurological and neuropsychiatric conditions. Recording and imaging techniques such as electroencephalography (EEG), magnetoencephalography (MEG), local field potentials (LFPs), and calcium imaging provide complementary information about different aspects of brain activity at different spatial and temporal scales. Modeling and simulations provide a way to integrate these different types of information to clarify underlying neural mechanisms. In this study, we aimed to shed light on the neural dynamics underlying auditory evoked responses by fitting a rate-based model to LFPs recorded via multi-contact electrodes which simultaneously sampled neural activity across cortical laminae. Recordings included neural population responses to best-frequency (BF) and non-BF tones at four representative sites in primary auditory cortex (A1) of awake monkeys. The model considered major neural populations of excitatory, parvalbumin-expressing (PV), and somatostatin-expressing (SOM) neurons across layers 2/3, 4, and 5/6. Unknown parameters, including the connection strength between the populations, were fitted to the data. Our results revealed similar population dynamics, fitted model parameters, predicted equivalent current dipoles (ECD), tuning curves, and lateral inhibition profiles across recording sites and animals, in spite of quite different extracellular current distributions. We found that PV firing rates were higher in BF than in non-BF responses, mainly due to different strengths of tonotopic thalamic input, whereas SOM firing rates were higher in non-BF than in BF responses due to lateral inhibition. In conclusion, we demonstrate the feasibility of the model-fitting approach in identifying the contributions of cell-type specific population activity to stimulus-evoked LFPs across cortical laminae, providing a foundation for further investigations into the dynamics of neural circuits underlying cortical sensory processing.
Collapse
Affiliation(s)
- Vincent S C Chien
- Max Planck Institute for Human Cognitive and Brain Sciences, Germany; Institute of Computer Science of the Czech Academy of Sciences, Czech Republic
| | - Peng Wang
- Max Planck Institute for Human Cognitive and Brain Sciences, Germany; Institute of Psychology, University of Greifswald, Germany; Institute of Psychology, University of Regensburg, Germany
| | - Burkhard Maess
- Max Planck Institute for Human Cognitive and Brain Sciences, Germany
| | - Yonatan Fishman
- Departments of Neurology and Neuroscience, Albert Einstein College of Medicine, USA
| | - Thomas R Knösche
- Max Planck Institute for Human Cognitive and Brain Sciences, Germany.
| |
Collapse
|
4
|
Nakamura T, Dinh TH, Asai M, Matsumoto J, Nishimaru H, Setogawa T, Honda S, Yamada H, Mihara T, Nishijo H. Suppressive effects of ketamine on auditory steady-state responses in intact, awake macaques: A non-human primate model of schizophrenia. Brain Res Bull 2023; 193:84-94. [PMID: 36539101 DOI: 10.1016/j.brainresbull.2022.12.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 12/12/2022] [Accepted: 12/14/2022] [Indexed: 12/23/2022]
Abstract
Auditory steady-state responses (ASSRs) are recurrent neural activities entrained to regular cyclic auditory stimulation. ASSRs are altered in individuals with schizophrenia, and may be related to hypofunction of the N-methyl-D-aspartate (NMDA) glutamate receptor. Noncompetitive NMDA receptor antagonists, including ketamine, have been used in ASSR studies of rodent models of schizophrenia. Although animal studies using non-human primates are required to complement rodent studies, the effects of ketamine on ASSRs are unknown in intact awake non-human primates. In this study, after administration of vehicle or ketamine, click trains at 20-83.3 Hz were presented to elicit ASSRs during recording of electroencephalograms in intact, awake macaque monkeys. The results indicated that ASSRs quantified by event-related spectral perturbation and inter-trial coherence were maximal at 83.3 Hz after vehicle administration, and that ketamine reduced ASSRs at 58.8 and 83.3 Hz, but not at 20 and 40 Hz. The present results demonstrated a reduction of ASSRs by the NMDA receptor antagonist at optimal frequencies with maximal responses in intact, awake macaques, comparable to ASSR reduction in patients with schizophrenia. These findings suggest that ASSR can be used as a neurophysiological biomarker of the disturbance of gamma-oscillatory neural circuits in this ketamine model of schizophrenia using intact, awake macaques. Thus, this model with ASSRs would be useful in the investigation of human brain pathophysiology as well as in preclinical translational research.
Collapse
Affiliation(s)
- Tomoya Nakamura
- System Emotional Science, Faculty of Medicine, University of Toyama, Toyama 930-0194, Japan; Department of Anatomy, Faculty of Medicine, University of Toyama, Toyama 930-0194, Japan
| | - Trong Ha Dinh
- System Emotional Science, Faculty of Medicine, University of Toyama, Toyama 930-0194, Japan; Department of Physiology, Vietnam Military Medical University, Hanoi 100000, Viet Nam
| | - Makoto Asai
- Candidate Discovery Science Labs, Drug Discovery Research, Astellas Pharma Inc., Tsukuba, Ibaraki 305-8585, Japan
| | - Jumpei Matsumoto
- System Emotional Science, Faculty of Medicine, University of Toyama, Toyama 930-0194, Japan; Research Center for Idling Brain Science (RCIBS), University of Toyama, Toyama 930-0194, Japan
| | - Hiroshi Nishimaru
- System Emotional Science, Faculty of Medicine, University of Toyama, Toyama 930-0194, Japan; Research Center for Idling Brain Science (RCIBS), University of Toyama, Toyama 930-0194, Japan
| | - Tsuyoshi Setogawa
- System Emotional Science, Faculty of Medicine, University of Toyama, Toyama 930-0194, Japan; Research Center for Idling Brain Science (RCIBS), University of Toyama, Toyama 930-0194, Japan
| | - Sokichi Honda
- Candidate Discovery Science Labs, Drug Discovery Research, Astellas Pharma Inc., Tsukuba, Ibaraki 305-8585, Japan
| | - Hiroshi Yamada
- Candidate Discovery Science Labs, Drug Discovery Research, Astellas Pharma Inc., Tsukuba, Ibaraki 305-8585, Japan
| | - Takuma Mihara
- Candidate Discovery Science Labs, Drug Discovery Research, Astellas Pharma Inc., Tsukuba, Ibaraki 305-8585, Japan
| | - Hisao Nishijo
- System Emotional Science, Faculty of Medicine, University of Toyama, Toyama 930-0194, Japan; Research Center for Idling Brain Science (RCIBS), University of Toyama, Toyama 930-0194, Japan.
| |
Collapse
|
5
|
Simon JZ, Commuri V, Kulasingham JP. Time-locked auditory cortical responses in the high-gamma band: A window into primary auditory cortex. Front Neurosci 2022; 16:1075369. [PMID: 36570848 PMCID: PMC9773383 DOI: 10.3389/fnins.2022.1075369] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Accepted: 11/24/2022] [Indexed: 12/13/2022] Open
Abstract
Primary auditory cortex is a critical stage in the human auditory pathway, a gateway between subcortical and higher-level cortical areas. Receiving the output of all subcortical processing, it sends its output on to higher-level cortex. Non-invasive physiological recordings of primary auditory cortex using electroencephalography (EEG) and magnetoencephalography (MEG), however, may not have sufficient specificity to separate responses generated in primary auditory cortex from those generated in underlying subcortical areas or neighboring cortical areas. This limitation is important for investigations of effects of top-down processing (e.g., selective-attention-based) on primary auditory cortex: higher-level areas are known to be strongly influenced by top-down processes, but subcortical areas are often assumed to perform strictly bottom-up processing. Fortunately, recent advances have made it easier to isolate the neural activity of primary auditory cortex from other areas. In this perspective, we focus on time-locked responses to stimulus features in the high gamma band (70-150 Hz) and with early cortical latency (∼40 ms), intermediate between subcortical and higher-level areas. We review recent findings from physiological studies employing either repeated simple sounds or continuous speech, obtaining either a frequency following response (FFR) or temporal response function (TRF). The potential roles of top-down processing are underscored, and comparisons with invasive intracranial EEG (iEEG) and animal model recordings are made. We argue that MEG studies employing continuous speech stimuli may offer particular benefits, in that only a few minutes of speech generates robust high gamma responses from bilateral primary auditory cortex, and without measurable interference from subcortical or higher-level areas.
Collapse
Affiliation(s)
- Jonathan Z. Simon
- Department of Electrical and Computer Engineering, University of Maryland, College Park, College Park, MD, United States,Department of Biology, University of Maryland, College Park, College Park, MD, United States,Institute for Systems Research, University of Maryland, College Park, College Park, MD, United States,*Correspondence: Jonathan Z. Simon,
| | - Vrishab Commuri
- Department of Electrical and Computer Engineering, University of Maryland, College Park, College Park, MD, United States
| | | |
Collapse
|
6
|
Nakamura T, Dinh TH, Asai M, Nishimaru H, Matsumoto J, Setogawa T, Ichijo H, Honda S, Yamada H, Mihara T, Nishijo H. Characteristics of auditory steady-state responses to different click frequencies in awake intact macaques. BMC Neurosci 2022; 23:57. [PMID: 36180823 PMCID: PMC9524006 DOI: 10.1186/s12868-022-00741-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 09/13/2022] [Indexed: 11/28/2022] Open
Abstract
Background Auditory steady-state responses (ASSRs) are periodic evoked responses to constant periodic auditory stimuli, such as click trains, and are suggested to be associated with higher cognitive functions in humans. Since ASSRs are disturbed in human psychiatric disorders, recording ASSRs from awake intact macaques would be beneficial to translational research as well as an understanding of human brain function and its pathology. However, ASSR has not been reported in awake macaques. Results Electroencephalograms (EEGs) were recorded from awake intact macaques, while click trains at 20–83.3 Hz were binaurally presented. EEGs were quantified based on event-related spectral perturbation (ERSP) and inter-trial coherence (ITC), and ASSRs were significantly demonstrated in terms of ERSP and ITC in awake intact macaques. A comparison of ASSRs among different click train frequencies indicated that ASSRs were maximal at 83.3 Hz. Furthermore, analyses of laterality indices of ASSRs showed that no laterality dominance of ASSRs was observed. Conclusions The present results demonstrated ASSRs, comparable to those in humans, in awake intact macaques. However, there were some differences in ASSRs between macaques and humans: macaques showed maximal ASSR responses to click frequencies higher than 40 Hz that has been reported to elicit maximal responses in humans, and showed no dominant laterality of ASSRs under the electrode montage in this study compared with humans with right hemisphere dominance. The future ASSR studies using awake intact macaques should be aware of these differences, and possible factors, to which these differences were ascribed, are discussed. Supplementary Information The online version contains supplementary material available at 10.1186/s12868-022-00741-9.
Collapse
Affiliation(s)
- Tomoya Nakamura
- System Emotional Science, Faculty of Medicine, University of Toyama, Sugitani2630, Toyama, 930-0194, Japan.,Department of Anatomy, Faculty of Medicine, University of Toyama, Toyama, 930-0194, Japan
| | - Trong Ha Dinh
- System Emotional Science, Faculty of Medicine, University of Toyama, Sugitani2630, Toyama, 930-0194, Japan.,Department of Physiology, Vietnam Military Medical University, Hanoi, 100000, Vietnam
| | - Makoto Asai
- Candidate Discovery Science Labs, Drug Discovery Research, Astellas Pharma Inc., Tsukuba, Ibaraki, 305-8585, Japan
| | - Hiroshi Nishimaru
- System Emotional Science, Faculty of Medicine, University of Toyama, Sugitani2630, Toyama, 930-0194, Japan.,Research Center for Idling Brain Science (RCIBS), University of Toyama, Toyama, 930-0194, Japan
| | - Jumpei Matsumoto
- System Emotional Science, Faculty of Medicine, University of Toyama, Sugitani2630, Toyama, 930-0194, Japan.,Research Center for Idling Brain Science (RCIBS), University of Toyama, Toyama, 930-0194, Japan
| | - Tsuyoshi Setogawa
- System Emotional Science, Faculty of Medicine, University of Toyama, Sugitani2630, Toyama, 930-0194, Japan.,Research Center for Idling Brain Science (RCIBS), University of Toyama, Toyama, 930-0194, Japan
| | - Hiroyuki Ichijo
- Department of Anatomy, Faculty of Medicine, University of Toyama, Toyama, 930-0194, Japan
| | - Sokichi Honda
- Candidate Discovery Science Labs, Drug Discovery Research, Astellas Pharma Inc., Tsukuba, Ibaraki, 305-8585, Japan
| | - Hiroshi Yamada
- Candidate Discovery Science Labs, Drug Discovery Research, Astellas Pharma Inc., Tsukuba, Ibaraki, 305-8585, Japan
| | - Takuma Mihara
- Candidate Discovery Science Labs, Drug Discovery Research, Astellas Pharma Inc., Tsukuba, Ibaraki, 305-8585, Japan
| | - Hisao Nishijo
- System Emotional Science, Faculty of Medicine, University of Toyama, Sugitani2630, Toyama, 930-0194, Japan. .,Research Center for Idling Brain Science (RCIBS), University of Toyama, Toyama, 930-0194, Japan.
| |
Collapse
|
7
|
Teichert T, Gnanateja GN, Sadagopan S, Chandrasekaran B. A Linear Superposition Model of Envelope and Frequency Following Responses May Help Identify Generators Based on Latency. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2022; 3:441-468. [PMID: 36909931 PMCID: PMC10003646 DOI: 10.1162/nol_a_00072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Envelope and frequency-following responses (FFRENV and FFRTFS) are scalp-recorded electrophysiological potentials that closely follow the periodicity of complex sounds such as speech. These signals have been established as important biomarkers in speech and learning disorders. However, despite important advances, it has remained challenging to map altered FFRENV and FFRTFS to altered processing in specific brain regions. Here we explore the utility of a deconvolution approach based on the assumption that FFRENV and FFRTFS reflect the linear superposition of responses that are triggered by the glottal pulse in each cycle of the fundamental frequency (F0 responses). We tested the deconvolution method by applying it to FFRENV and FFRTFS of rhesus monkeys to human speech and click trains with time-varying pitch patterns. Our analyses show that F0ENV responses could be measured with high signal-to-noise ratio and featured several spectro-temporally and topographically distinct components that likely reflect the activation of brainstem (<5 ms; 200-1000 Hz), midbrain (5-15 ms; 100-250 Hz), and cortex (15-35 ms; ~90 Hz). In contrast, F0TFS responses contained only one spectro-temporal component that likely reflected activity in the midbrain. In summary, our results support the notion that the latency of F0 components map meaningfully onto successive processing stages. This opens the possibility that pathologically altered FFRENV or FFRTFS may be linked to altered F0ENV or F0TFS and from there to specific processing stages and ultimately spatially targeted interventions.
Collapse
Affiliation(s)
- Tobias Teichert
- Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, USA
- Center for Neuroscience, University of Pittsburgh, Pittsburgh, PA, USA
| | - G. Nike Gnanateja
- Department of Communication Sciences and Disorders, University of Pittsburgh, Pittsburgh, PA, USA
| | - Srivatsun Sadagopan
- Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, USA
- Center for Neuroscience, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Neurobiology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Bharath Chandrasekaran
- Department of Communication Sciences and Disorders, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Neurobiology, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
8
|
Brang D, Plass J, Sherman A, Stacey WC, Wasade VS, Grabowecky M, Ahn E, Towle VL, Tao JX, Wu S, Issa NP, Suzuki S. Visual cortex responds to sound onset and offset during passive listening. J Neurophysiol 2022; 127:1547-1563. [PMID: 35507478 DOI: 10.1152/jn.00164.2021] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Sounds enhance our ability to detect, localize, and respond to co-occurring visual targets. Research suggests that sounds improve visual processing by resetting the phase of ongoing oscillations in visual cortex. However, it remains unclear what information is relayed from the auditory system to visual areas and if sounds modulate visual activity even in the absence of visual stimuli (e.g., during passive listening). Using intracranial electroencephalography (iEEG) in humans, we examined the sensitivity of visual cortex to three forms of auditory information during a passive listening task: auditory onset responses, auditory offset responses, and rhythmic entrainment to sounds. Because some auditory neurons respond to both sound onsets and offsets, visual timing and duration processing may benefit from each. Additionally, if auditory entrainment information is relayed to visual cortex, it could support the processing of complex stimulus dynamics that are aligned between auditory and visual stimuli. Results demonstrate that in visual cortex, amplitude-modulated sounds elicited transient onset and offset responses in multiple areas, but no entrainment to sound modulation frequencies. These findings suggest that activity in visual cortex (as measured with iEEG in response to auditory stimuli) may not be affected by temporally fine-grained auditory stimulus dynamics during passive listening (though it remains possible that this signal may be observable with simultaneous auditory-visual stimuli). Moreover, auditory responses were maximal in low-level visual cortex, potentially implicating a direct pathway for rapid interactions between auditory and visual cortices. This mechanism may facilitate perception by time-locking visual computations to environmental events marked by auditory discontinuities.
Collapse
Affiliation(s)
- David Brang
- Department of Psychology, University of Michigan, Ann Arbor, MI, United States
| | - John Plass
- Department of Psychology, University of Michigan, Ann Arbor, MI, United States
| | - Aleksandra Sherman
- Department of Cognitive Science, Occidental College, Los Angeles, CA, United States
| | - William C Stacey
- Department of Neurology, University of Michigan, Ann Arbor, MI, United States
| | | | - Marcia Grabowecky
- Department of Psychology, Northwestern University, Evanston, IL, United States
| | - EunSeon Ahn
- Department of Psychology, University of Michigan, Ann Arbor, MI, United States
| | - Vernon L Towle
- Department of Neurology, The University of Chicago, Chicago, IL, United States
| | - James X Tao
- Department of Neurology, The University of Chicago, Chicago, IL, United States
| | - Shasha Wu
- Department of Neurology, The University of Chicago, Chicago, IL, United States
| | - Naoum P Issa
- Department of Neurology, The University of Chicago, Chicago, IL, United States
| | - Satoru Suzuki
- Department of Psychology, Northwestern University, Evanston, IL, United States
| |
Collapse
|
9
|
Dheerendra P, Baumann S, Joly O, Balezeau F, Petkov CI, Thiele A, Griffiths TD. The Representation of Time Windows in Primate Auditory Cortex. Cereb Cortex 2021; 32:3568-3580. [PMID: 34875029 PMCID: PMC9376871 DOI: 10.1093/cercor/bhab434] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 11/04/2021] [Accepted: 11/05/2021] [Indexed: 11/13/2022] Open
Abstract
Whether human and nonhuman primates process the temporal dimension of sound similarly remains an open question. We examined the brain basis for the processing of acoustic time windows in rhesus macaques using stimuli simulating the spectrotemporal complexity of vocalizations. We conducted functional magnetic resonance imaging in awake macaques to identify the functional anatomy of response patterns to different time windows. We then contrasted it against the responses to identical stimuli used previously in humans. Despite a similar overall pattern, ranging from the processing of shorter time windows in core areas to longer time windows in lateral belt and parabelt areas, monkeys exhibited lower sensitivity to longer time windows than humans. This difference in neuronal sensitivity might be explained by a specialization of the human brain for processing longer time windows in speech.
Collapse
Affiliation(s)
- Pradeep Dheerendra
- Biosciences Institute, Newcastle University, Newcastle upon Tyne, NE2 4HH, UK.,Institute of Neuroscience and Psychology, University of Glasgow, Glasgow G128QB, UK
| | - Simon Baumann
- National Institute of Mental Health, NIH, Bethesda, MD 20892-1148, USA.,Department of Psychology, University of Turin, Torino 10124, Italy
| | - Olivier Joly
- Biosciences Institute, Newcastle University, Newcastle upon Tyne, NE2 4HH, UK
| | - Fabien Balezeau
- Biosciences Institute, Newcastle University, Newcastle upon Tyne, NE2 4HH, UK
| | | | - Alexander Thiele
- Biosciences Institute, Newcastle University, Newcastle upon Tyne, NE2 4HH, UK
| | - Timothy D Griffiths
- Biosciences Institute, Newcastle University, Newcastle upon Tyne, NE2 4HH, UK
| |
Collapse
|
10
|
Homma NY, Bajo VM. Lemniscal Corticothalamic Feedback in Auditory Scene Analysis. Front Neurosci 2021; 15:723893. [PMID: 34489635 PMCID: PMC8417129 DOI: 10.3389/fnins.2021.723893] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Accepted: 07/30/2021] [Indexed: 12/15/2022] Open
Abstract
Sound information is transmitted from the ear to central auditory stations of the brain via several nuclei. In addition to these ascending pathways there exist descending projections that can influence the information processing at each of these nuclei. A major descending pathway in the auditory system is the feedback projection from layer VI of the primary auditory cortex (A1) to the ventral division of medial geniculate body (MGBv) in the thalamus. The corticothalamic axons have small glutamatergic terminals that can modulate thalamic processing and thalamocortical information transmission. Corticothalamic neurons also provide input to GABAergic neurons of the thalamic reticular nucleus (TRN) that receives collaterals from the ascending thalamic axons. The balance of corticothalamic and TRN inputs has been shown to refine frequency tuning, firing patterns, and gating of MGBv neurons. Therefore, the thalamus is not merely a relay stage in the chain of auditory nuclei but does participate in complex aspects of sound processing that include top-down modulations. In this review, we aim (i) to examine how lemniscal corticothalamic feedback modulates responses in MGBv neurons, and (ii) to explore how the feedback contributes to auditory scene analysis, particularly on frequency and harmonic perception. Finally, we will discuss potential implications of the role of corticothalamic feedback in music and speech perception, where precise spectral and temporal processing is essential.
Collapse
Affiliation(s)
- Natsumi Y. Homma
- Center for Integrative Neuroscience, University of California, San Francisco, San Francisco, CA, United States
- Coleman Memorial Laboratory, Department of Otolaryngology – Head and Neck Surgery, University of California, San Francisco, San Francisco, CA, United States
| | - Victoria M. Bajo
- Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
11
|
Tada M, Kirihara K, Koshiyama D, Fujioka M, Usui K, Uka T, Komatsu M, Kunii N, Araki T, Kasai K. Gamma-Band Auditory Steady-State Response as a Neurophysiological Marker for Excitation and Inhibition Balance: A Review for Understanding Schizophrenia and Other Neuropsychiatric Disorders. Clin EEG Neurosci 2020; 51:234-243. [PMID: 31402699 DOI: 10.1177/1550059419868872] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Altered gamma oscillations have attracted considerable attention as an index of the excitation/inhibition (E/I) imbalance in schizophrenia and other neuropsychiatric disorders. The auditory steady-state response (ASSR) has been the most robust probe of abnormal gamma oscillatory dynamics in schizophrenia. Here, we review recent ASSR studies in patients with schizophrenia and other neuropsychiatric disorders. Preclinical ASSR research, which has contributed to the elucidation of the underlying pathophysiology of these diseases, is also discussed. The developmental trajectory of the ASSR has been explored and may show signs of the maturation and disruption of E/I balance in adolescence. Animal model studies have shown that synaptic interactions between parvalbumin-positive GABAergic interneurons and pyramidal neurons contribute to the regulation of E/I balance, which is related to the generation of gamma oscillation. Therefore, ASSR alteration may be a significant electrophysiological finding related to the E/I imbalance in neuropsychiatric disorders, which is a cross-disease feature and may reflect clinical staging. Future studies regarding ASSR generation, especially in nonhuman primate models, will advance our understanding of the brain circuit and the molecular mechanisms underlying neuropsychiatric disorders.
Collapse
Affiliation(s)
- Mariko Tada
- Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan.,International Research Center for Neurointelligence (WPI-IRCN), UTIAS, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Kenji Kirihara
- Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Daisuke Koshiyama
- Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Mao Fujioka
- Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Kaori Usui
- Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Takanori Uka
- Department of Integrative Physiology, Graduate School of Medicine, University of Yamanashi, Chuo, Yamanashi, Japan
| | - Misako Komatsu
- Laboratory for Molecular Analysis of Higher Brain Function, RIKEN Center for Brain Science, Hirosawa, Wako, Saitama, Japan
| | - Naoto Kunii
- Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan.,Department of Neurosurgery, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Tsuyoshi Araki
- Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Kiyoto Kasai
- Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan.,International Research Center for Neurointelligence (WPI-IRCN), UTIAS, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| |
Collapse
|
12
|
Deane KE, Brunk MGK, Curran AW, Zempeltzi MM, Ma J, Lin X, Abela F, Aksit S, Deliano M, Ohl FW, Happel MFK. Ketamine anaesthesia induces gain enhancement via recurrent excitation in granular input layers of the auditory cortex. J Physiol 2020; 598:2741-2755. [PMID: 32329905 DOI: 10.1113/jp279705] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 04/16/2020] [Indexed: 11/08/2022] Open
Abstract
KEY POINTS Ketamine is a common anaesthetic agent used in research and more recently as medication in treatment of depression. It has known effects on inhibition of interneurons and cortical stimulus-locked responses, but the underlying functional network mechanisms are still elusive. Analysing population activity across all layers within the auditory cortex, we found that doses of this anaesthetic induce a stronger activation and stimulus-locked response to pure-tone stimuli. This cortical response is driven by gain enhancement of thalamocortical input processing selectively within granular layers due to an increased recurrent excitation. Time-frequency analysis indicates a higher broadband magnitude response and prolonged phase coherence in granular layers, possibly pointing to disinhibition of this recurrent excitation. These results further the understanding of ketamine's functional mechanisms, which will improve the ability to interpret physiological studies moving from anaesthetized to awake paradigms and may lead to the development of better ketamine-based depression treatments with lower side effects. ABSTRACT Ketamine is commonly used as an anaesthetic agent and has more recently gained attention as an antidepressant. It has been linked to increased stimulus-locked excitability, inhibition of interneurons and modulation of intrinsic neuronal oscillations. However, the functional network mechanisms are still elusive. A better understanding of these anaesthetic network effects may improve upon previous interpretations of seminal studies conducted under anaesthesia and have widespread relevance for neuroscience with awake and anaesthetized subjects as well as in medicine. Here, we investigated the effects of anaesthetic doses of ketamine (15 mg kg-1 h-1 i.p.) on the network activity after pure-tone stimulation within the auditory cortex of male Mongolian gerbils (Meriones unguiculatus). We used laminar current source density (CSD) analysis and subsequent layer-specific continuous wavelet analysis to investigate spatiotemporal response dynamics on cortical columnar processing in awake and ketamine-anaesthetized animals. We found thalamocortical input processing within granular layers III/IV to be significantly increased under ketamine. This layer-dependent gain enhancement under ketamine was not due to changes in cross-trial phase coherence but was rather attributed to a broadband increase in magnitude reflecting an increase in recurrent excitation. A time-frequency analysis was indicative of a prolonged period of stimulus-induced excitation possibly due to a reduced coupling of excitation and inhibition in granular input circuits - in line with the common hypothesis of cortical disinhibition via suppression of GABAergic interneurons.
Collapse
Affiliation(s)
- Katrina E Deane
- Leibniz Institute for Neurobiology, Magdeburg, D-39118, Germany
| | | | - Andrew W Curran
- Leibniz Institute for Neurobiology, Magdeburg, D-39118, Germany.,Graduate School of Life Science, Julius Maximilians University, Würzburg, D-97074, Germany
| | | | - Jing Ma
- Leibniz Institute for Neurobiology, Magdeburg, D-39118, Germany
| | - Xiao Lin
- Leibniz Institute for Neurobiology, Magdeburg, D-39118, Germany
| | - Francesca Abela
- Leibniz Institute for Neurobiology, Magdeburg, D-39118, Germany.,University of Pisa, Pisa, I-56126, Italy
| | - Sümeyra Aksit
- Leibniz Institute for Neurobiology, Magdeburg, D-39118, Germany
| | | | - Frank W Ohl
- Leibniz Institute for Neurobiology, Magdeburg, D-39118, Germany.,Institute of Biology, Otto von Guericke University, Magdeburg, D-39120, Germany.,Center for Behavioral Brain Sciences (CBBS), Magdeburg, 39106, Germany
| | - Max F K Happel
- Leibniz Institute for Neurobiology, Magdeburg, D-39118, Germany.,Institute of Biology, Otto von Guericke University, Magdeburg, D-39120, Germany.,Center for Behavioral Brain Sciences (CBBS), Magdeburg, 39106, Germany
| |
Collapse
|
13
|
Yi HG, Leonard MK, Chang EF. The Encoding of Speech Sounds in the Superior Temporal Gyrus. Neuron 2019; 102:1096-1110. [PMID: 31220442 PMCID: PMC6602075 DOI: 10.1016/j.neuron.2019.04.023] [Citation(s) in RCA: 160] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Revised: 04/08/2019] [Accepted: 04/16/2019] [Indexed: 01/02/2023]
Abstract
The human superior temporal gyrus (STG) is critical for extracting meaningful linguistic features from speech input. Local neural populations are tuned to acoustic-phonetic features of all consonants and vowels and to dynamic cues for intonational pitch. These populations are embedded throughout broader functional zones that are sensitive to amplitude-based temporal cues. Beyond speech features, STG representations are strongly modulated by learned knowledge and perceptual goals. Currently, a major challenge is to understand how these features are integrated across space and time in the brain during natural speech comprehension. We present a theory that temporally recurrent connections within STG generate context-dependent phonological representations, spanning longer temporal sequences relevant for coherent percepts of syllables, words, and phrases.
Collapse
Affiliation(s)
- Han Gyol Yi
- Department of Neurological Surgery, University of California, San Francisco, 675 Nelson Rising Lane, San Francisco, CA 94158, USA
| | - Matthew K Leonard
- Department of Neurological Surgery, University of California, San Francisco, 675 Nelson Rising Lane, San Francisco, CA 94158, USA
| | - Edward F Chang
- Department of Neurological Surgery, University of California, San Francisco, 675 Nelson Rising Lane, San Francisco, CA 94158, USA.
| |
Collapse
|
14
|
Zhu S, Allitt B, Samuel A, Lui L, Rosa MGP, Rajan R. Distributed representation of vocalization pitch in marmoset primary auditory cortex. Eur J Neurosci 2018; 49:179-198. [PMID: 30307660 DOI: 10.1111/ejn.14204] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Revised: 09/10/2018] [Accepted: 10/04/2018] [Indexed: 11/30/2022]
Abstract
The pitch of vocalizations is a key communication feature aiding recognition of individuals and separating sound sources in complex acoustic environments. The neural representation of the pitch of periodic sounds is well defined. However, many natural sounds, like complex vocalizations, contain rich, aperiodic or not strictly periodic frequency content and/or include high-frequency components, but still evoke a strong sense of pitch. Indeed, such sounds are the rule, not the exception but the cortical mechanisms for encoding pitch of such sounds are unknown. We investigated how neurons in the high-frequency representation of primary auditory cortex (A1) of marmosets encoded changes in pitch of four natural vocalizations, two centred around a dominant frequency similar to the neuron's best sensitivity and two around a much lower dominant frequency. Pitch was varied over a fine range that can be used by marmosets to differentiate individuals. The responses of most high-frequency A1 neurons were sensitive to pitch changes in all four vocalizations, with a smaller proportion of the neurons showing pitch-insensitive responses. Classically defined excitatory drive, from the neuron's monaural frequency response area, predicted responses to changes in vocalization pitch in <30% of neurons suggesting most pitch tuning observed is not simple frequency-level response. Moreover, 39% of A1 neurons showed call-invariant tuning of pitch. These results suggest that distributed activity across A1 can represent the pitch of natural sounds over a fine, functionally relevant range, and exhibits pitch tuning for vocalizations within and outside the classical neural tuning area.
Collapse
Affiliation(s)
- Shuyu Zhu
- Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia.,Centre of Excellence in Integrative Brain Function, Australian Research Council, Clayton, Victoria, Australia
| | - Ben Allitt
- Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia
| | - Anil Samuel
- Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia
| | - Leo Lui
- Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia.,Centre of Excellence in Integrative Brain Function, Australian Research Council, Clayton, Victoria, Australia
| | - Marcello G P Rosa
- Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia.,Centre of Excellence in Integrative Brain Function, Australian Research Council, Clayton, Victoria, Australia
| | - Ramesh Rajan
- Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia
| |
Collapse
|
15
|
Knyazeva S, Selezneva E, Gorkin A, Aggelopoulos NC, Brosch M. Neuronal Correlates of Auditory Streaming in Monkey Auditory Cortex for Tone Sequences without Spectral Differences. Front Integr Neurosci 2018; 12:4. [PMID: 29440999 PMCID: PMC5797536 DOI: 10.3389/fnint.2018.00004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 01/16/2018] [Indexed: 11/13/2022] Open
Abstract
This study finds a neuronal correlate of auditory perceptual streaming in the primary auditory cortex for sequences of tone complexes that have the same amplitude spectrum but a different phase spectrum. Our finding is based on microelectrode recordings of multiunit activity from 270 cortical sites in three awake macaque monkeys. The monkeys were presented with repeated sequences of a tone triplet that consisted of an A tone, a B tone, another A tone and then a pause. The A and B tones were composed of unresolved harmonics formed by adding the harmonics in cosine phase, in alternating phase, or in random phase. A previous psychophysical study on humans revealed that when the A and B tones are similar, humans integrate them into a single auditory stream; when the A and B tones are dissimilar, humans segregate them into separate auditory streams. We found that the similarity of neuronal rate responses to the triplets was highest when all A and B tones had cosine phase. Similarity was intermediate when the A tones had cosine phase and the B tones had alternating phase. Similarity was lowest when the A tones had cosine phase and the B tones had random phase. The present study corroborates and extends previous reports, showing similar correspondences between neuronal activity in the primary auditory cortex and auditory streaming of sound sequences. It also is consistent with Fishman’s population separation model of auditory streaming.
Collapse
Affiliation(s)
- Stanislava Knyazeva
- Speziallabor Primatenneurobiologie, Leibniz-Institute für Neurobiologie, Magdeburg, Germany
| | - Elena Selezneva
- Speziallabor Primatenneurobiologie, Leibniz-Institute für Neurobiologie, Magdeburg, Germany
| | - Alexander Gorkin
- Speziallabor Primatenneurobiologie, Leibniz-Institute für Neurobiologie, Magdeburg, Germany.,Laboratory of Psychophysiology, Institute of Psychology, Moscow, Russia
| | | | - Michael Brosch
- Speziallabor Primatenneurobiologie, Leibniz-Institute für Neurobiologie, Magdeburg, Germany.,Center for Behavioral Brain Sciences, Otto-von-Guericke-University, Magdeburg, Germany
| |
Collapse
|
16
|
Bianchi F, Hjortkjær J, Santurette S, Zatorre RJ, Siebner HR, Dau T. Subcortical and cortical correlates of pitch discrimination: Evidence for two levels of neuroplasticity in musicians. Neuroimage 2017; 163:398-412. [DOI: 10.1016/j.neuroimage.2017.07.057] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Revised: 07/11/2017] [Accepted: 07/27/2017] [Indexed: 10/19/2022] Open
|
17
|
Nourski KV, Banks MI, Steinschneider M, Rhone AE, Kawasaki H, Mueller RN, Todd MM, Howard MA. Electrocorticographic delineation of human auditory cortical fields based on effects of propofol anesthesia. Neuroimage 2017; 152:78-93. [PMID: 28254512 PMCID: PMC5432407 DOI: 10.1016/j.neuroimage.2017.02.061] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Revised: 02/13/2017] [Accepted: 02/21/2017] [Indexed: 12/20/2022] Open
Abstract
The functional organization of human auditory cortex remains incompletely characterized. While the posteromedial two thirds of Heschl's gyrus (HG) is generally considered to be part of core auditory cortex, additional subdivisions of HG remain speculative. To further delineate the hierarchical organization of human auditory cortex, we investigated regional heterogeneity in the modulation of auditory cortical responses under varying depths of anesthesia induced by propofol. Non-invasive studies have shown that propofol differentially affects auditory cortical activity, with a greater impact on non-core areas. Subjects were neurosurgical patients undergoing removal of intracranial electrodes placed to identify epileptic foci. Stimuli were 50Hz click trains, presented continuously during an awake baseline period, and subsequently, while propofol infusion was incrementally titrated to induce general anesthesia. Electrocorticographic recordings were made with depth electrodes implanted in HG and subdural grid electrodes implanted over superior temporal gyrus (STG). Depth of anesthesia was monitored using spectral entropy. Averaged evoked potentials (AEPs), frequency-following responses (FFRs) and high gamma (70-150Hz) event-related band power were used to characterize auditory cortical activity. Based on the changes in AEPs and FFRs during the induction of anesthesia, posteromedial HG could be divided into two subdivisions. In the most posteromedial aspect of the gyrus, the earliest AEP deflections were preserved and FFRs increased during induction. In contrast, the remainder of the posteromedial HG exhibited attenuation of both the AEP and the FFR. The anterolateral HG exhibited weaker activation characterized by broad, low-voltage AEPs and the absence of FFRs. Lateral STG exhibited limited activation by click trains, and FFRs there diminished during induction. Sustained high gamma activity was attenuated in the most posteromedial portion of HG, and was absent in all other regions. These differential patterns of auditory cortical activity during the induction of anesthesia may serve as useful physiological markers for field delineation. In this study, the posteromedial HG could be parcellated into at least two subdivisions. Preservation of the earliest AEP deflections and FFRs in the posteromedial HG likely reflects the persistence of feedforward synaptic activity generated by inputs from subcortical auditory pathways, including the medial geniculate nucleus.
Collapse
Affiliation(s)
- Kirill V Nourski
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, USA.
| | - Matthew I Banks
- Department of Anesthesiology, University of Wisconsin - Madison, Madison, WI, USA
| | - Mitchell Steinschneider
- Departments of Neurology and Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Ariane E Rhone
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, USA
| | - Hiroto Kawasaki
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, USA
| | - Rashmi N Mueller
- Department of Anesthesia, The University of Iowa, Iowa City, IA, USA
| | - Michael M Todd
- Department of Anesthesia, The University of Iowa, Iowa City, IA, USA; Department of Anesthesiology, University of Minnesota, Minneapolis, MN, USA
| | - Matthew A Howard
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, USA; Pappajohn Biomedical Institute, The University of Iowa, Iowa City, IA, USA; Iowa Neuroscience Institute, The University of Iowa, Iowa City, IA, USA
| |
Collapse
|
18
|
Abrams DA, Nicol T, White-Schwoch T, Zecker S, Kraus N. Population responses in primary auditory cortex simultaneously represent the temporal envelope and periodicity features in natural speech. Hear Res 2017; 348:31-43. [PMID: 28216125 DOI: 10.1016/j.heares.2017.02.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/07/2016] [Revised: 02/04/2017] [Accepted: 02/13/2017] [Indexed: 10/20/2022]
Abstract
Speech perception relies on a listener's ability to simultaneously resolve multiple temporal features in the speech signal. Little is known regarding neural mechanisms that enable the simultaneous coding of concurrent temporal features in speech. Here we show that two categories of temporal features in speech, the low-frequency speech envelope and periodicity cues, are processed by distinct neural mechanisms within the same population of cortical neurons. We measured population activity in primary auditory cortex of anesthetized guinea pig in response to three variants of a naturally produced sentence. Results show that the envelope of population responses closely tracks the speech envelope, and this cortical activity more closely reflects wider bandwidths of the speech envelope compared to narrow bands. Additionally, neuronal populations represent the fundamental frequency of speech robustly with phase-locked responses. Importantly, these two temporal features of speech are simultaneously observed within neuronal ensembles in auditory cortex in response to clear, conversation, and compressed speech exemplars. Results show that auditory cortical neurons are adept at simultaneously resolving multiple temporal features in extended speech sentences using discrete coding mechanisms.
Collapse
Affiliation(s)
- Daniel A Abrams
- Auditory Neuroscience Laboratory, The Roxelyn and Richard Pepper Department of Communication Sciences and Disorders, Northwestern University, 2240 Campus Drive, Evanston, IL, 60208, USA.
| | - Trent Nicol
- Auditory Neuroscience Laboratory, The Roxelyn and Richard Pepper Department of Communication Sciences and Disorders, Northwestern University, 2240 Campus Drive, Evanston, IL, 60208, USA
| | - Travis White-Schwoch
- Auditory Neuroscience Laboratory, The Roxelyn and Richard Pepper Department of Communication Sciences and Disorders, Northwestern University, 2240 Campus Drive, Evanston, IL, 60208, USA
| | - Steven Zecker
- Auditory Neuroscience Laboratory, The Roxelyn and Richard Pepper Department of Communication Sciences and Disorders, Northwestern University, 2240 Campus Drive, Evanston, IL, 60208, USA
| | - Nina Kraus
- Auditory Neuroscience Laboratory, The Roxelyn and Richard Pepper Department of Communication Sciences and Disorders, Northwestern University, 2240 Campus Drive, Evanston, IL, 60208, USA; Departments of Neurobiology and Physiology, Northwestern University, 2240 Campus Drive, Evanston, IL, 60208, USA; Department of Otolaryngology, Northwestern University, 2240 Campus Drive, Evanston, IL, 60208, USA
| |
Collapse
|
19
|
Krishnan A, Suresh CH, Gandour JT. Changes in pitch height elicit both language-universal and language-dependent changes in neural representation of pitch in the brainstem and auditory cortex. Neuroscience 2017; 346:52-63. [PMID: 28108254 DOI: 10.1016/j.neuroscience.2017.01.013] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2016] [Revised: 12/09/2016] [Accepted: 01/08/2017] [Indexed: 11/24/2022]
Abstract
Language experience shapes encoding of pitch-relevant information at both brainstem and cortical levels of processing. Pitch height is a salient dimension that orders pitch from low to high. Herein we investigate the effects of language experience (Chinese, English) in the brainstem and cortex on (i) neural responses to variations in pitch height, (ii) presence of asymmetry in cortical pitch representation, and (iii) patterns of relative changes in magnitude of pitch height between these two levels of brain structure. Stimuli were three nonspeech homologs of Mandarin Tone 2 varying in pitch height only. The frequency-following response (FFR) and the cortical pitch-specific response (CPR) were recorded concurrently. At the Fz-linked T7/T8 site, peak latency of Na, Pb, and Nb decreased with increasing pitch height for both groups. Peak-to-peak amplitude of Na-Pb and Pb-Nb increased with increasing pitch height across groups. A language-dependent effect was restricted to Na-Pb; the Chinese had larger amplitude than the English group. At temporal sites (T7/T8), the Chinese group had larger amplitude, as compared to English, across stimuli, but also limited to the Na-Pb component and right temporal site. In the brainstem, F0 magnitude decreased with increasing pitch height; Chinese had larger magnitude across stimuli. A comparison of CPR and FFR responses revealed distinct patterns of relative changes in magnitude common to both groups. CPR amplitude increased and FFR amplitude decreased with increasing pitch height. Experience-dependent effects on CPR components vary as a function of neural sensitivity to pitch height within a particular temporal window (Na-Pb). Differences between the auditory brainstem and cortex imply distinct neural mechanisms for pitch extraction at both levels of brain structure.
Collapse
Affiliation(s)
- Ananthanarayan Krishnan
- Purdue University, Department of Speech Language Hearing Sciences, Lyles-Porter Hall, 715 Clinic Drive, West Lafayette, IN 47907-2122, USA.
| | - Chandan H Suresh
- Purdue University, Department of Speech Language Hearing Sciences, Lyles-Porter Hall, 715 Clinic Drive, West Lafayette, IN 47907-2122, USA.
| | - Jackson T Gandour
- Purdue University, Department of Speech Language Hearing Sciences, Lyles-Porter Hall, 715 Clinic Drive, West Lafayette, IN 47907-2122, USA.
| |
Collapse
|
20
|
Estimation of a transient response from steady-state responses by deconvolution with built-in constraints. J Theor Biol 2016; 404:143-159. [PMID: 27234643 DOI: 10.1016/j.jtbi.2016.05.032] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2015] [Revised: 05/11/2016] [Accepted: 05/23/2016] [Indexed: 11/23/2022]
Abstract
Evidence suggests that the steady-state response (SSR) elicited by a periodic train of auditory stimuli can largely be understood as a superposition of transient responses. This study is devoted to the problem of how to estimate that transient response from measured SSRs. The proposed method differs from previous approaches in that the solution can be constrained to be consistent with physiology-based prior knowledge or educated guesses. To achieve this goal, the transient response is not represented by a time series, but by a linear combination of auxiliary functions, called components. Constraints are introduced by assigning certain properties to the components. Only few parameters are required for that purpose, because the individual components are derived from a suitably designed mother component. After adjusting the components to the problem at hand, the component amplitudes are determined by optimizing the match between predicted and measured SSRs. This requires solving a linear inverse problem. A model simulation as well as an analysis of exemplary experimental data (auditory SSRs elicited by periodically presented clicks) prove the workability of the method. Since part of the theory is quite general, it would be relatively easy to refine and extend the method. Not only could responses other than SSRs be dealt with, it could also be realized that certain key parameters of the transient response, such as amplitude and delay, depend on stimulus repetition rate.
Collapse
|
21
|
Neural Representation of Concurrent Vowels in Macaque Primary Auditory Cortex. eNeuro 2016; 3:eN-NWR-0071-16. [PMID: 27294198 PMCID: PMC4901243 DOI: 10.1523/eneuro.0071-16.2016] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Accepted: 04/15/2016] [Indexed: 11/30/2022] Open
Abstract
Successful speech perception in real-world environments requires that the auditory system segregate competing voices that overlap in frequency and time into separate streams. Vowels are major constituents of speech and are comprised of frequencies (harmonics) that are integer multiples of a common fundamental frequency (F0). The pitch and identity of a vowel are determined by its F0 and spectral envelope (formant structure), respectively. When two spectrally overlapping vowels differing in F0 are presented concurrently, they can be readily perceived as two separate “auditory objects” with pitches at their respective F0s. A difference in pitch between two simultaneous vowels provides a powerful cue for their segregation, which in turn, facilitates their individual identification. The neural mechanisms underlying the segregation of concurrent vowels based on pitch differences are poorly understood. Here, we examine neural population responses in macaque primary auditory cortex (A1) to single and double concurrent vowels (/a/ and /i/) that differ in F0 such that they are heard as two separate auditory objects with distinct pitches. We find that neural population responses in A1 can resolve, via a rate-place code, lower harmonics of both single and double concurrent vowels. Furthermore, we show that the formant structures, and hence the identities, of single vowels can be reliably recovered from the neural representation of double concurrent vowels. We conclude that A1 contains sufficient spectral information to enable concurrent vowel segregation and identification by downstream cortical areas.
Collapse
|
22
|
Tang H, Crain S, Johnson BW. Dual temporal encoding mechanisms in human auditory cortex: Evidence from MEG and EEG. Neuroimage 2016; 128:32-43. [DOI: 10.1016/j.neuroimage.2015.12.053] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Revised: 12/01/2015] [Accepted: 12/30/2015] [Indexed: 11/25/2022] Open
|
23
|
Krishnan A, Gandour JT, Suresh CH. Language-experience plasticity in neural representation of changes in pitch salience. Brain Res 2016; 1637:102-117. [PMID: 26903418 DOI: 10.1016/j.brainres.2016.02.021] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2015] [Revised: 02/05/2016] [Accepted: 02/10/2016] [Indexed: 11/28/2022]
Abstract
Neural representation of pitch-relevant information at the brainstem and cortical levels of processing is influenced by language experience. A well-known attribute of pitch is its salience. Brainstem frequency following responses and cortical pitch specific responses, recorded concurrently, were elicited by a pitch salience continuum spanning weak to strong pitch of a dynamic, iterated rippled noise pitch contour-homolog of a Mandarin tone. Our aims were to assess how language experience (Chinese, English) affects i) enhancement of neural activity associated with pitch salience at brainstem and cortical levels, ii) the presence of asymmetry in cortical pitch representation, and iii) patterns of relative changes in magnitude along the pitch salience continuum. Peak latency (Fz: Na, Pb, and Nb) was shorter in the Chinese than the English group across the continuum. Peak-to-peak amplitude (Fz: Na-Pb, Pb-Nb) of the Chinese group grew larger with increasing pitch salience, but an experience-dependent advantage was limited to the Na-Pb component. At temporal sites (T7/T8), the larger amplitude of the Chinese group across the continuum was both limited to the Na-Pb component and the right temporal site. At the brainstem level, F0 magnitude gets larger as you increase pitch salience, and it too reveals Chinese superiority. A direct comparison of cortical and brainstem responses for the Chinese group reveals different patterns of relative changes in magnitude along the pitch salience continuum. Such differences may point to a transformation in pitch processing at the cortical level presumably mediated by local sensory and/or extrasensory influence overlaid on the brainstem output.
Collapse
Affiliation(s)
- Ananthanarayan Krishnan
- Department of Speech Language Hearing Sciences, Purdue University, Lyles Porter Hall, 715 Clinic Drive, West Lafayette, IN 47907-2122, USA.
| | - Jackson T Gandour
- Department of Speech Language Hearing Sciences, Purdue University, Lyles Porter Hall, 715 Clinic Drive, West Lafayette, IN 47907-2122, USA.
| | - Chandan H Suresh
- Department of Speech Language Hearing Sciences, Purdue University, Lyles Porter Hall, 715 Clinic Drive, West Lafayette, IN 47907-2122, USA.
| |
Collapse
|
24
|
Schaefer MK, Hechavarría JC, Kössl M. Quantification of mid and late evoked sinks in laminar current source density profiles of columns in the primary auditory cortex. Front Neural Circuits 2015; 9:52. [PMID: 26557058 PMCID: PMC4617414 DOI: 10.3389/fncir.2015.00052] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 09/14/2015] [Indexed: 11/18/2022] Open
Abstract
Current source density (CSD) analysis assesses spatiotemporal synaptic activations at somatic and/or dendritic levels in the form of depolarizing current sinks. Whereas many studies have focused on the short (<50 ms) latency sinks, associated with thalamocortical projections, sinks with longer latencies have received less attention. Here, we analyzed laminar CSD patterns for the first 600 ms after stimulus onset in the primary auditory cortex of Mongolian gerbils. By applying an algorithm for contour calculation, three distinct mid and four late evoked sinks were identified in layers I, III, Va, VIa, and VIb. Our results further showed that the patterns of intracortical information-flow remained qualitatively similar for low and for high sound pressure level stimuli at the characteristic frequency (CF) as well as for stimuli ± 1 octave from CF. There were, however, differences associated with the strength, vertical extent, onset latency, and duration of the sinks for the four stimulation paradigms used. Stimuli one octave above the most sensitive frequency evoked a new, and quite reliable, sink in layer Va whereas low level stimulation led to the disappearance of the layer VIb sink. These data indicate the presence of input sources specifically activated in response to level and/or frequency parameters. Furthermore, spectral integration above vs. below the CF of neurons is asymmetric as illustrated by CSD profiles. These results are important because synaptic feedback associated with mid and late sinks—beginning at 50 ms post stimulus latency—is likely crucial for response modulation resulting from higher order processes like memory, learning or cognitive control.
Collapse
Affiliation(s)
- Markus K Schaefer
- Institute for Cell Biology and Neuroscience, AK Neurobiology and Biosensors, Goethe University Frankfurt/Main, Germany
| | - Julio C Hechavarría
- Institute for Cell Biology and Neuroscience, AK Neurobiology and Biosensors, Goethe University Frankfurt/Main, Germany
| | - Manfred Kössl
- Institute for Cell Biology and Neuroscience, AK Neurobiology and Biosensors, Goethe University Frankfurt/Main, Germany
| |
Collapse
|
25
|
Nourski KV, Steinschneider M, Rhone AE, Oya H, Kawasaki H, Howard MA, McMurray B. Sound identification in human auditory cortex: Differential contribution of local field potentials and high gamma power as revealed by direct intracranial recordings. BRAIN AND LANGUAGE 2015; 148:37-50. [PMID: 25819402 PMCID: PMC4556541 DOI: 10.1016/j.bandl.2015.03.003] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2014] [Revised: 02/05/2015] [Accepted: 03/03/2015] [Indexed: 06/01/2023]
Abstract
High gamma power has become the principal means of assessing auditory cortical activation in human intracranial studies, albeit at the expense of low frequency local field potentials (LFPs). It is unclear whether limiting analyses to high gamma impedes ability of clarifying auditory cortical organization. We compared the two measures obtained from posterolateral superior temporal gyrus (PLST) and evaluated their relative utility in sound categorization. Subjects were neurosurgical patients undergoing invasive monitoring for medically refractory epilepsy. Stimuli (consonant-vowel syllables varying in voicing and place of articulation and control tones) elicited robust evoked potentials and high gamma activity on PLST. LFPs had greater across-subject variability, yet yielded higher classification accuracy, relative to high gamma power. Classification was enhanced by including temporal detail of LFPs and combining LFP and high gamma. We conclude that future studies should consider utilizing both LFP and high gamma when investigating the functional organization of human auditory cortex.
Collapse
Affiliation(s)
- Kirill V Nourski
- Department of Neurosurgery, The University of Iowa, Iowa City, IA 52242, USA.
| | - Mitchell Steinschneider
- Department of Neurology, Albert Einstein College of Medicine, New York, NY 10461, USA; Department of Neuroscience, Albert Einstein College of Medicine, New York, NY 10461, USA
| | - Ariane E Rhone
- Department of Neurosurgery, The University of Iowa, Iowa City, IA 52242, USA
| | - Hiroyuki Oya
- Department of Neurosurgery, The University of Iowa, Iowa City, IA 52242, USA
| | - Hiroto Kawasaki
- Department of Neurosurgery, The University of Iowa, Iowa City, IA 52242, USA
| | - Matthew A Howard
- Department of Neurosurgery, The University of Iowa, Iowa City, IA 52242, USA
| | - Bob McMurray
- Department of Psychology, The University of Iowa, Iowa City, IA 52242, USA; Department of Communication Sciences and Disorders, The University of Iowa, Iowa City, IA 52242, USA; Department of Linguistics, The University of Iowa, Iowa City, IA 52242, USA
| |
Collapse
|
26
|
Pallesen KJ, Bailey CJ, Brattico E, Gjedde A, Palva JM, Palva S. Experience Drives Synchronization: The phase and Amplitude Dynamics of Neural Oscillations to Musical Chords Are Differentially Modulated by Musical Expertise. PLoS One 2015; 10:e0134211. [PMID: 26291324 PMCID: PMC4546391 DOI: 10.1371/journal.pone.0134211] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Accepted: 07/07/2015] [Indexed: 11/18/2022] Open
Abstract
Musical expertise is associated with structural and functional changes in the brain that underlie facilitated auditory perception. We investigated whether the phase locking (PL) and amplitude modulations (AM) of neuronal oscillations in response to musical chords are correlated with musical expertise and whether they reflect the prototypicality of chords in Western tonal music. To this aim, we recorded magnetoencephalography (MEG) while musicians and non-musicians were presented with common prototypical major and minor chords, and with uncommon, non-prototypical dissonant and mistuned chords, while watching a silenced movie. We then analyzed the PL and AM of ongoing oscillations in the theta (4–8 Hz) alpha (8–14 Hz), beta- (14–30 Hz) and gamma- (30–80 Hz) bands to these chords. We found that musical expertise was associated with strengthened PL of ongoing oscillations to chords over a wide frequency range during the first 300 ms from stimulus onset, as opposed to increased alpha-band AM to chords over temporal MEG channels. In musicians, the gamma-band PL was strongest to non-prototypical compared to other chords, while in non-musicians PL was strongest to minor chords. In both musicians and non-musicians the long-latency (> 200 ms) gamma-band PL was also sensitive to chord identity, and particularly to the amplitude modulations (beats) of the dissonant chord. These findings suggest that musical expertise modulates oscillation PL to musical chords and that the strength of these modulations is dependent on chord prototypicality.
Collapse
Affiliation(s)
- Karen Johanne Pallesen
- Department of Neuroscience and Pharmacology, University of Copenhagen, Copenhagen, Denmark
- The Research Clinic for Functional Disorders and Psychosomatics, Aarhus University Hospital, Aarhus, Denmark
- Center of Functionally Integrative Neuroscience, Aarhus University, Aarhus, Denmark
- * E-mail:
| | | | - Elvira Brattico
- Helsinki Collegium for Advanced Studies, University of Helsinki, Helsinki, Finland
- Cognitive Brain Research Unit, Institute of Behavioral Science, University of Helsinki, Helsinki, Finland
| | - Albert Gjedde
- Department of Neuroscience and Pharmacology, University of Copenhagen, Copenhagen, Denmark
- Center of Functionally Integrative Neuroscience, Aarhus University, Aarhus, Denmark
- Pathophysiology and Experimental Tomography Center, Aarhus University Hospital, Aarhus, Denmark
| | - J. Matias Palva
- Neuroscience Center, University of Helsinki, Helsinki, Finland
| | - Satu Palva
- Neuroscience Center, University of Helsinki, Helsinki, Finland
- BioMag laboratory, HUS Medical Imaging Center, Helsinki University Central Hospital, Helsinki, Finland
| |
Collapse
|
27
|
Kato T, Fujita K, Kashimori Y. A neural mechanism of phase-locked responses to sinusoidally amplitude-modulated signals in the inferior colliculus. Biosystems 2015; 134:24-36. [PMID: 26032987 DOI: 10.1016/j.biosystems.2015.05.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2014] [Revised: 05/14/2015] [Accepted: 05/26/2015] [Indexed: 11/18/2022]
Abstract
The central nucleus of the inferior colliculus (ICc) is an auditory region that receives convergent inputs from a large number of lower auditory nuclei. ICc neurons phase-lock to low frequencies of sinusoidally amplitude-modulated (SAM) signals but have a different mechanism in the phase-locking from that in neurons of lower nuclei. In the mustached bat, the phase-locking ability in lower nuclei is created by the coincidence of phase-locked excitatory and inhibitory inputs that have slightly different latencies. In contrast, the phase-locking property of ICc neurons is little influenced by the blocking of inhibitory synapses. Moreover, ICc neurons exhibit different characteristics in the spike patterns and synchronicity, classified here by three types of ICc neurons, or sustained, onset, and non-onset phase-locking neurons. However it remains unclear how ICc neurons create the phase-locking ability and the different characteristics. To address this issue, we developed a model of ICc neuronal population. Using this model, we show that the phase-locking ability of ICc neurons to low SAM frequencies is created by an intrinsic membrane property of ICc neuron, limited by inhibitory ion channels. We also show that response characteristics of the three types of neurons arise from the difference in an inhibitory effect sensitive to SAM frequencies. Our model reproduces well the experimental results observed in the mustached bat. These findings provide necessary conditions of how ICc neurons can give rise to the phase-locking ability and characteristic responses to low SAM frequencies.
Collapse
Affiliation(s)
- Takayuki Kato
- Graduate School of Information Systems, Univ. of Electro-Communications, Chofu, Tokyo 182-8585 Japan
| | - Kazuhisa Fujita
- Dept. of Engineering Science, Univ. of Electro-Communications, Chofu, Tokyo 182-8585 Japan; National Institute of Technology, Tsuyama Collage, 654-1 Numa, Tsuyama, Okayama 708-8506, Japan.
| | - Yoshiki Kashimori
- Graduate School of Information Systems, Univ. of Electro-Communications, Chofu, Tokyo 182-8585 Japan; Dept. of Engineering Science, Univ. of Electro-Communications, Chofu, Tokyo 182-8585 Japan
| |
Collapse
|
28
|
Disruption of the auditory response to a regular click train by a single, extra click. Exp Brain Res 2015; 233:1875-92. [DOI: 10.1007/s00221-015-4260-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Accepted: 03/16/2015] [Indexed: 11/25/2022]
|
29
|
Krishnan A, Gandour JT, Ananthakrishnan S, Vijayaraghavan V. Language experience enhances early cortical pitch-dependent responses. JOURNAL OF NEUROLINGUISTICS 2015; 33:128-148. [PMID: 25506127 PMCID: PMC4261237 DOI: 10.1016/j.jneuroling.2014.08.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Pitch processing at cortical and subcortical stages of processing is shaped by language experience. We recently demonstrated that specific components of the cortical pitch response (CPR) index the more rapidly-changing portions of the high rising Tone 2 of Mandarin Chinese, in addition to marking pitch onset and sound offset. In this study, we examine how language experience (Mandarin vs. English) shapes the processing of different temporal attributes of pitch reflected in the CPR components using stimuli representative of within-category variants of Tone 2. Results showed that the magnitude of CPR components (Na-Pb and Pb-Nb) and the correlation between these two components and pitch acceleration were stronger for the Chinese listeners compared to English listeners for stimuli that fell within the range of Tone 2 citation forms. Discriminant function analysis revealed that the Na-Pb component was more than twice as important as Pb-Nb in grouping listeners by language affiliation. In addition, a stronger stimulus-dependent, rightward asymmetry was observed for the Chinese group at the temporal, but not frontal, electrode sites. This finding may reflect selective recruitment of experience-dependent, pitch-specific mechanisms in right auditory cortex to extract more complex, time-varying pitch patterns. Taken together, these findings suggest that long-term language experience shapes early sensory level processing of pitch in the auditory cortex, and that the sensitivity of the CPR may vary depending on the relative linguistic importance of specific temporal attributes of dynamic pitch.
Collapse
|
30
|
Abstract
This chapter provides an overview of current invasive recording methodology and experimental paradigms used in the studies of human auditory cortex. Invasive recordings can be obtained from neurosurgical patients undergoing clinical electrophysiologic evaluation for medically refractory epilepsy or brain tumors. This provides a unique research opportunity to study the human auditory cortex with high resolution both in time (milliseconds) and space (millimeters) and to generate valuable information about its organization and function. A historic overview presents the development of the experimental approaches from the pioneering works of Wilder Penfield to modern day. Practical issues regarding research subject population, stimulus presentation, data collection, and analysis are discussed for acute (intraoperative) and chronic experiments. Illustrative examples are provided from experimental paradigms, including studies of spectrotemporal processing, functional connectivity, and functional lesioning in human auditory cortex.
Collapse
Affiliation(s)
- Kirill V Nourski
- Department of Neurosurgery, University of Iowa, Iowa City, IA, USA.
| | - Matthew A Howard
- Department of Neurosurgery, University of Iowa, Iowa City, IA, USA
| |
Collapse
|
31
|
Krishnan A, Gandour JT, Suresh CH. Cortical pitch response components show differential sensitivity to native and nonnative pitch contours. BRAIN AND LANGUAGE 2014; 138:51-60. [PMID: 25306506 PMCID: PMC4335674 DOI: 10.1016/j.bandl.2014.09.005] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2014] [Revised: 08/20/2014] [Accepted: 09/21/2014] [Indexed: 06/04/2023]
Abstract
The aim of this study is to evaluate how nonspeech pitch contours of varying shape influence latency and amplitude of cortical pitch-specific response (CPR) components differentially as a function of language experience. Stimuli included time-varying, high rising Mandarin Tone 2 (T2) and linear rising ramp (Linear), and steady-state (Flat). Both the latency and magnitude of CPR components were differentially modulated by (i) the overall trajectory of pitch contours (time-varying vs. steady-state), (ii) their pitch acceleration rates (changing vs. constant), and (iii) their linguistic status (lexical vs. non-lexical). T2 elicited larger amplitude than Linear in both language groups, but size of the effect was larger in Chinese than English. The magnitude of CPR components elicited by T2 were larger for Chinese than English at the right temporal electrode site. Using the CPR, we provide evidence in support of experience-dependent modulation of dynamic pitch contours at an early stage of sensory processing.
Collapse
Affiliation(s)
| | - Jackson T Gandour
- Department of Speech Language Hearing Sciences, Purdue University, USA.
| | - Chandan H Suresh
- Department of Speech Language Hearing Sciences, Purdue University, USA.
| |
Collapse
|
32
|
Fishman YI, Steinschneider M, Micheyl C. Neural representation of concurrent harmonic sounds in monkey primary auditory cortex: implications for models of auditory scene analysis. J Neurosci 2014; 34:12425-43. [PMID: 25209282 PMCID: PMC4160777 DOI: 10.1523/jneurosci.0025-14.2014] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2014] [Revised: 07/14/2014] [Accepted: 07/28/2014] [Indexed: 11/21/2022] Open
Abstract
The ability to attend to a particular sound in a noisy environment is an essential aspect of hearing. To accomplish this feat, the auditory system must segregate sounds that overlap in frequency and time. Many natural sounds, such as human voices, consist of harmonics of a common fundamental frequency (F0). Such harmonic complex tones (HCTs) evoke a pitch corresponding to their F0. A difference in pitch between simultaneous HCTs provides a powerful cue for their segregation. The neural mechanisms underlying concurrent sound segregation based on pitch differences are poorly understood. Here, we examined neural responses in monkey primary auditory cortex (A1) to two concurrent HCTs that differed in F0 such that they are heard as two separate "auditory objects" with distinct pitches. We found that A1 can resolve, via a rate-place code, the lower harmonics of both HCTs, a prerequisite for deriving their pitches and for their perceptual segregation. Onset asynchrony between the HCTs enhanced the neural representation of their harmonics, paralleling their improved perceptual segregation in humans. Pitches of the concurrent HCTs could also be temporally represented by neuronal phase-locking at their respective F0s. Furthermore, a model of A1 responses using harmonic templates could qualitatively reproduce psychophysical data on concurrent sound segregation in humans. Finally, we identified a possible intracortical homolog of the "object-related negativity" recorded noninvasively in humans, which correlates with the perceptual segregation of concurrent sounds. Findings indicate that A1 contains sufficient spectral and temporal information for segregating concurrent sounds based on differences in pitch.
Collapse
Affiliation(s)
- Yonatan I Fishman
- Departments of Neurology and Neuroscience, Albert Einstein College of Medicine, Bronx, New York 10461,
| | - Mitchell Steinschneider
- Departments of Neurology and Neuroscience, Albert Einstein College of Medicine, Bronx, New York 10461
| | - Christophe Micheyl
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota 55455, and Starkey Hearing Research Center, Berkeley, California 94704
| |
Collapse
|
33
|
Fast transmission from the dopaminergic ventral midbrain to the sensory cortex of awake primates. Brain Struct Funct 2014; 220:3273-94. [PMID: 25084746 DOI: 10.1007/s00429-014-0855-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Accepted: 07/21/2014] [Indexed: 12/21/2022]
Abstract
Motivated by the increasing evidence that auditory cortex is under control of dopaminergic cell structures of the ventral midbrain, we studied how the ventral tegmental area and substantia nigra affect neuronal activity in auditory cortex. We electrically stimulated 567 deep brain sites in total within and in the vicinity of the two dopaminergic ventral midbrain structures and at the same time, recorded local field potentials and neuronal discharges in cortex. In experiments conducted on three awake macaque monkeys, we found that electrical stimulation of the dopaminergic ventral midbrain resulted in short-latency (~35 ms) phasic activations in all cortical layers of auditory cortex. We were also able to demonstrate similar activations in secondary somatosensory cortex and superior temporal polysensory cortex. The electrically evoked responses in these parts of sensory cortex were similar to those previously described for prefrontal cortex. Moreover, these phasic responses could be reversibly altered by the dopamine D1-receptor antagonist SCH23390 for several tens of minutes. Thus, we speculate that the dopaminergic ventral midbrain exerts a temporally precise, phasic influence on sensory cortex using fast-acting non-dopaminergic transmitters and that their effects are modulated by dopamine on a longer timescale. Our findings suggest that some of the information carried by the neuronal discharges in the dopaminergic ventral midbrain, such as the motivational value or the motivational salience, is transmitted to auditory cortex and other parts of sensory cortex. The mesocortical pathway may thus contribute to the representation of non-auditory events in the auditory cortex and to its associative functions.
Collapse
|
34
|
Abstract
A fundamental structure of sounds encountered in the natural environment is the harmonicity. Harmonicity is an essential component of music found in all cultures. It is also a unique feature of vocal communication sounds such as human speech and animal vocalizations. Harmonics in sounds are produced by a variety of acoustic generators and reflectors in the natural environment, including vocal apparatuses of humans and animal species as well as music instruments of many types. We live in an acoustic world full of harmonicity. Given the widespread existence of the harmonicity in many aspects of the hearing environment, it is natural to expect that it be reflected in the evolution and development of the auditory systems of both humans and animals, in particular the auditory cortex. Recent neuroimaging and neurophysiology experiments have identified regions of non-primary auditory cortex in humans and non-human primates that have selective responses to harmonic pitches. Accumulating evidence has also shown that neurons in many regions of the auditory cortex exhibit characteristic responses to harmonically related frequencies beyond the range of pitch. Together, these findings suggest that a fundamental organizational principle of auditory cortex is based on the harmonicity. Such an organization likely plays an important role in music processing by the brain. It may also form the basis of the preference for particular classes of music and voice sounds.
Collapse
Affiliation(s)
- Xiaoqin Wang
- Department of Biomedical Engineering, Johns Hopkins University School of MedicineBaltimore, MD, USA
- Tsinghua-Johns Hopkins Joint Center for Biomedical Engineering Research and Department of Biomedical Engineering, Tsinghua UniversityBeijing, China
| |
Collapse
|
35
|
Abstract
Some areas in auditory cortex respond preferentially to sounds that elicit pitch, such as musical sounds or voiced speech. This study used human electroencephalography (EEG) with an adaptation paradigm to investigate how pitch is represented within these areas and, in particular, whether the representation reflects the physical or perceptual dimensions of pitch. Physically, pitch corresponds to a single monotonic dimension: the repetition rate of the stimulus waveform. Perceptually, however, pitch has to be described with 2 dimensions, a monotonic, "pitch height," and a cyclical, "pitch chroma," dimension, to account for the similarity of the cycle of notes (c, d, e, etc.) across different octaves. The EEG adaptation effect mirrored the cyclicality of the pitch chroma dimension, suggesting that auditory cortex contains a representation of pitch chroma. Source analysis indicated that the centroid of this pitch chroma representation lies somewhat anterior and lateral to primary auditory cortex.
Collapse
Affiliation(s)
- Paul M. Briley
- MRC Institute of Hearing Research, Nottingham, UK
- Department of Psychology, University of York, York, UK
| | - Charlotte Breakey
- School of Biomedical Sciences, University of Nottingham, Nottingham, UK
| | | |
Collapse
|
36
|
Neural representation of harmonic complex tones in primary auditory cortex of the awake monkey. J Neurosci 2013; 33:10312-23. [PMID: 23785145 DOI: 10.1523/jneurosci.0020-13.2013] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Many natural sounds are periodic and consist of frequencies (harmonics) that are integer multiples of a common fundamental frequency (F0). Such harmonic complex tones (HCTs) evoke a pitch corresponding to their F0, which plays a key role in the perception of speech and music. "Pitch-selective" neurons have been identified in non-primary auditory cortex of marmoset monkeys. Noninvasive studies point to a putative "pitch center" located in a homologous cortical region in humans. It remains unclear whether there is sufficient spectral and temporal information available at the level of primary auditory cortex (A1) to enable reliable pitch extraction in non-primary auditory cortex. Here we evaluated multiunit responses to HCTs in A1 of awake macaques using a stimulus design employed in auditory nerve studies of pitch encoding. The F0 of the HCTs was varied in small increments, such that harmonics of the HCTs fell either on the peak or on the sides of the neuronal pure tone tuning functions. Resultant response-amplitude-versus-harmonic-number functions ("rate-place profiles") displayed a periodic pattern reflecting the neuronal representation of individual HCT harmonics. Consistent with psychoacoustic findings in humans, lower harmonics were better resolved in rate-place profiles than higher harmonics. Lower F0s were also temporally represented by neuronal phase-locking to the periodic waveform of the HCTs. Findings indicate that population responses in A1 contain sufficient spectral and temporal information for extracting the pitch of HCTs by neurons in downstream cortical areas that receive their input from A1.
Collapse
|
37
|
Representation of speech in human auditory cortex: is it special? Hear Res 2013; 305:57-73. [PMID: 23792076 DOI: 10.1016/j.heares.2013.05.013] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/30/2013] [Revised: 05/13/2013] [Accepted: 05/28/2013] [Indexed: 11/20/2022]
Abstract
Successful categorization of phonemes in speech requires that the brain analyze the acoustic signal along both spectral and temporal dimensions. Neural encoding of the stimulus amplitude envelope is critical for parsing the speech stream into syllabic units. Encoding of voice onset time (VOT) and place of articulation (POA), cues necessary for determining phonemic identity, occurs within shorter time frames. An unresolved question is whether the neural representation of speech is based on processing mechanisms that are unique to humans and shaped by learning and experience, or is based on rules governing general auditory processing that are also present in non-human animals. This question was examined by comparing the neural activity elicited by speech and other complex vocalizations in primary auditory cortex of macaques, who are limited vocal learners, with that in Heschl's gyrus, the putative location of primary auditory cortex in humans. Entrainment to the amplitude envelope is neither specific to humans nor to human speech. VOT is represented by responses time-locked to consonant release and voicing onset in both humans and monkeys. Temporal representation of VOT is observed both for isolated syllables and for syllables embedded in the more naturalistic context of running speech. The fundamental frequency of male speakers is represented by more rapid neural activity phase-locked to the glottal pulsation rate in both humans and monkeys. In both species, the differential representation of stop consonants varying in their POA can be predicted by the relationship between the frequency selectivity of neurons and the onset spectra of the speech sounds. These findings indicate that the neurophysiology of primary auditory cortex is similar in monkeys and humans despite their vastly different experience with human speech, and that Heschl's gyrus is engaged in general auditory, and not language-specific, processing. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives".
Collapse
|
38
|
The neuronal responses to repetitive acoustic pulses in different fields of the auditory cortex of awake rats. PLoS One 2013; 8:e64288. [PMID: 23696877 PMCID: PMC3655960 DOI: 10.1371/journal.pone.0064288] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2013] [Accepted: 04/10/2013] [Indexed: 11/19/2022] Open
Abstract
Cortical representation of time-varying features of acoustic signals is a fundamental issue of acoustic processing remaining unresolved. The rat is a widely used animal model for auditory cortical processing. Though some electrophysiological studies have investigated the neural responses to temporal repetitive sounds in the auditory cortex (AC) of rats, most of them were conducted under anesthetized condition. Recently, it has been shown that anesthesia could significantly alter the temporal patterns of neural response. For this reason, we systematically examined the single-unit neural responses to click-trains in the core region of rat AC under awake condition. Consistent with the reports on anesthetized rats, we confirmed the existence of characteristic tonotopic organizations, which were used to divide the AC into anterior auditory field (AAF), primary auditory cortex (A1) and posterior auditory field (PAF). We further found that the neuron's capability to synchronize to the temporal repetitive stimuli progressively decreased along the anterior-to-posterior direction of AC. The median of maximum synchronization rate was 64, 32 and 16 Hz in AAF, A1 and PAF, respectively. On the other hand, the percentage of neurons, which showed non-synchronized responses and could represent the stimulus repetition rate by the mean firing rate, increased from 7% in AAF and A1 to 20% in PAF. These results suggest that the temporal resolution of acoustic processing gradually increases from the anterior to posterior part of AC, and thus there may be a hierarchical stream along this direction of rat AC.
Collapse
|
39
|
Multiplexing stimulus information through rate and temporal codes in primate somatosensory cortex. PLoS Biol 2013; 11:e1001558. [PMID: 23667327 PMCID: PMC3646728 DOI: 10.1371/journal.pbio.1001558] [Citation(s) in RCA: 131] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2012] [Accepted: 03/27/2013] [Indexed: 12/03/2022] Open
Abstract
In somatosensory cortex, stimulus amplitude is represented at a relatively coarse temporal resolution, while stimulus frequency is represented by precisely timed action potentials. Our ability to perceive and discriminate textures relies on the transduction and processing of complex, high-frequency vibrations elicited in the fingertip as it is scanned across a surface. How naturalistic vibrations, and by extension texture, are encoded in the responses of neurons in primary somatosensory cortex (S1) is unknown. Combining single unit recordings in awake macaques and perceptual judgments obtained from human subjects, we show that vibratory amplitude is encoded in the strength of the response evoked in S1 neurons. In contrast, the frequency composition of the vibrations, up to 800 Hz, is not encoded in neuronal firing rates, but rather in the phase-locked responses of a subpopulation of neurons. Moreover, analysis of perceptual judgments suggests that spike timing not only conveys stimulus information but also shapes tactile perception. We conclude that information about the amplitude and frequency of natural vibrations is multiplexed at different time scales in S1, and encoded in the rate and temporal patterning of the response, respectively. When we slide our fingertip across a textured surface, small, complex, and high-frequency vibrations are elicited in the skin and our nervous system extracts information about texture from these vibrations. In this study, we investigate how texture-like vibrations are processed in primary somatosensory cortex (S1). First, we show that the time-varying amplitude of skin vibrations is encoded in the time-varying response rates of a subpopulation of S1 neurons. Second, we show that this same subpopulation of S1 neurons produces responses whose timing closely matches that of the vibrations: The frequency composition of the spiking patterns matches that of the stimulus, even for complex vibrations. We demonstrate that this temporal precision is behaviorally relevant by showing that the tactile perception of vibration is better predicted from neuronal responses when spike timing is taken into consideration than when it is not. The activity of S1 neurons is thus multiplexed at different time scales: Stimulus amplitude, which changes relatively slowly, is represented at a relatively coarse temporal resolution, while stimulus frequency is represented by precisely timed action potentials.
Collapse
|
40
|
Auditory cortex represents both pitch judgments and the corresponding acoustic cues. Curr Biol 2013; 23:620-5. [PMID: 23523247 PMCID: PMC3696731 DOI: 10.1016/j.cub.2013.03.003] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2012] [Revised: 12/23/2012] [Accepted: 03/01/2013] [Indexed: 11/22/2022]
Abstract
The neural processing of sensory stimuli involves a transformation of physical stimulus parameters into perceptual features, and elucidating where and how this transformation occurs is one of the ultimate aims of sensory neurophysiology. Recent studies have shown that the firing of neurons in early sensory cortex can be modulated by multisensory interactions [1-5], motor behavior [1, 3, 6, 7], and reward feedback [1, 8, 9], but it remains unclear whether neural activity is more closely tied to perception, as indicated by behavioral choice, or to the physical properties of the stimulus. We investigated which of these properties are predominantly represented in auditory cortex by recording local field potentials (LFPs) and multiunit spiking activity in ferrets while they discriminated the pitch of artificial vowels. We found that auditory cortical activity is informative both about the fundamental frequency (F0) of a target sound and also about the pitch that the animals appear to perceive given their behavioral responses. Surprisingly, although the stimulus F0 was well represented at the onset of the target sound, neural activity throughout auditory cortex frequently predicted the reported pitch better than the target F0.
Collapse
|
41
|
Searching for the mismatch negativity in primary auditory cortex of the awake monkey: deviance detection or stimulus specific adaptation? J Neurosci 2013; 32:15747-58. [PMID: 23136414 DOI: 10.1523/jneurosci.2835-12.2012] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The mismatch negativity (MMN) is a preattentive component of the auditory event-related potential that is elicited by a change in a repetitive acoustic pattern. While MMN has been extensively used in human electrophysiological studies of auditory processing, the neural mechanisms and brain regions underlying its generation remain unclear. We investigate possible homologs of the MMN in macaque primary auditory cortex (A1) using a frequency oddball paradigm in which rare "deviant" tones are randomly interspersed among frequent "standard" tones. Standards and deviants had frequencies equal to the best frequency (BF) of the recorded neural population or to a frequency that evoked a response half the amplitude of the BF response. Early and later field potentials, current source density components, multiunit activity, and induced high-gamma band responses were larger when elicited by deviants than by standards of the same frequency. Laminar analysis indicated that differences between deviant and standard responses were more prominent in later activity, thus suggesting cortical amplification of initial responses driven by thalamocortical inputs. However, unlike the human MMN, larger deviant responses were characterized by the enhancement of "obligatory" responses rather than the introduction of new components. Furthermore, a control condition wherein deviants were interspersed among many tones of variable frequency replicated the larger responses to deviants under the oddball condition. Results suggest that differential responses under the oddball condition in macaque A1 reflect stimulus-specific adaptation rather than deviance detection per se. We conclude that neural mechanisms of deviance detection likely reside in cortical areas outside of A1.
Collapse
|
42
|
Abstract
Pitch, our perception of how high or low a sound is on a musical scale, is a fundamental perceptual attribute of sounds and is important for both music and speech. After more than a century of research, the exact mechanisms used by the auditory system to extract pitch are still being debated. Theoretically, pitch can be computed using either spectral or temporal acoustic features of a sound. We have investigated how cues derived from the temporal envelope and spectrum of an acoustic signal are used for pitch extraction in the common marmoset (Callithrix jacchus), a vocal primate species, by measuring pitch discrimination behaviorally and examining pitch-selective neuronal responses in auditory cortex. We find that pitch is extracted by marmosets using temporal envelope cues for lower pitch sounds composed of higher-order harmonics, whereas spectral cues are used for higher pitch sounds with lower-order harmonics. Our data support dual-pitch processing mechanisms, originally proposed by psychophysicists based on human studies, whereby pitch is extracted using a combination of temporal envelope and spectral cues.
Collapse
|
43
|
Hertrich I, Dietrich S, Ackermann H. Tracking the speech signal--time-locked MEG signals during perception of ultra-fast and moderately fast speech in blind and in sighted listeners. BRAIN AND LANGUAGE 2013; 124:9-21. [PMID: 23332808 DOI: 10.1016/j.bandl.2012.10.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2011] [Revised: 09/28/2012] [Accepted: 10/15/2012] [Indexed: 06/01/2023]
Abstract
Blind people can learn to understand speech at ultra-high syllable rates (ca. 20 syllables/s), a capability associated with hemodynamic activation of the central-visual system. To further elucidate the neural mechanisms underlying this skill, magnetoencephalographic (MEG) measurements during listening to sentence utterances were cross-correlated with time courses derived from the speech signal (envelope, syllable onsets and pitch periodicity) to capture phase-locked MEG components (14 blind, 12 sighted subjects; speech rate=8 or 16 syllables/s, pre-defined source regions: auditory and visual cortex, inferior frontal gyrus). Blind individuals showed stronger phase locking in auditory cortex than sighted controls, and right-hemisphere visual cortex activity correlated with syllable onsets in case of ultra-fast speech. Furthermore, inferior-frontal MEG components time-locked to pitch periodicity displayed opposite lateralization effects in sighted (towards right hemisphere) and blind subjects (left). Thus, ultra-fast speech comprehension in blind individuals appears associated with changes in early signal-related processing mechanisms both within and outside the central-auditory terrain.
Collapse
Affiliation(s)
- Ingo Hertrich
- Department of General Neurology, Hertie Institute for Clinical Brain Research, University of Tübingen, Germany.
| | | | | |
Collapse
|
44
|
Nourski KV, Brugge JF, Reale RA, Kovach CK, Oya H, Kawasaki H, Jenison RL, Howard MA. Coding of repetitive transients by auditory cortex on posterolateral superior temporal gyrus in humans: an intracranial electrophysiology study. J Neurophysiol 2012; 109:1283-95. [PMID: 23236002 DOI: 10.1152/jn.00718.2012] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Evidence regarding the functional subdivisions of human auditory cortex has been slow to converge on a definite model. In part, this reflects inadequacies of current understanding of how the cortex represents temporal information in acoustic signals. To address this, we investigated spatiotemporal properties of auditory responses in human posterolateral superior temporal (PLST) gyrus to acoustic click-train stimuli using intracranial recordings from neurosurgical patients. Subjects were patients undergoing chronic invasive monitoring for refractory epilepsy. The subjects listened passively to acoustic click-train stimuli of varying durations (160 or 1,000 ms) and rates (4-200 Hz), delivered diotically via insert earphones. Multicontact subdural grids placed over the perisylvian cortex recorded intracranial electrocorticographic responses from PLST and surrounding areas. Analyses focused on averaged evoked potentials (AEPs) and high gamma (70-150 Hz) event-related band power (ERBP). Responses to click trains featured prominent AEP waveforms and increases in ERBP. The magnitude of AEPs and ERBP typically increased with click rate. Superimposed on the AEPs were frequency-following responses (FFRs), most prominent at 50-Hz click rates but still detectable at stimulus rates up to 200 Hz. Loci with the largest high gamma responses on PLST were often different from those sites that exhibited the strongest FFRs. The data indicate that responses of non-core auditory cortex of PLST represent temporal stimulus features in multiple ways. These include an isomorphic representation of periodicity (as measured by the FFR), a representation based on increases in non-phase-locked activity (as measured by high gamma ERBP), and spatially distributed patterns of activity.
Collapse
Affiliation(s)
- Kirill V Nourski
- Dept. of Neurosurgery, The Univ. of Iowa, Iowa City, IA 52242, USA.
| | | | | | | | | | | | | | | |
Collapse
|
45
|
Wang X, Walker KMM. Neural mechanisms for the abstraction and use of pitch information in auditory cortex. J Neurosci 2012; 32:13339-42. [PMID: 23015423 PMCID: PMC3752151 DOI: 10.1523/jneurosci.3814-12.2012] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2012] [Revised: 07/18/2012] [Accepted: 07/23/2012] [Indexed: 11/21/2022] Open
Abstract
Experiments in animals have provided an important complement to human studies of pitch perception by revealing how the activity of individual neurons represents harmonic complex and periodic sounds. Such studies have shown that the acoustical parameters associated with pitch are represented by the spiking responses of neurons in A1 (primary auditory cortex) and various higher auditory cortical fields. The responses of these neurons are also modulated by the timbre of sounds. In marmosets, a distinct region on the low-frequency border of primary and non-primary auditory cortex may provide pitch tuning that generalizes across timbre classes.
Collapse
Affiliation(s)
- Xiaoqin Wang
- Tsinghua-Johns Hopkins Joint Center for Biomedical Engineering Research and Department of Biomedical Engineering, Tsinghua University, Beijing 100084, China.
| | | |
Collapse
|
46
|
Hertrich I, Dietrich S, Trouvain J, Moos A, Ackermann H. Magnetic brain activity phase-locked to the envelope, the syllable onsets, and the fundamental frequency of a perceived speech signal. Psychophysiology 2011; 49:322-34. [PMID: 22175821 DOI: 10.1111/j.1469-8986.2011.01314.x] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2011] [Accepted: 09/08/2011] [Indexed: 11/27/2022]
Abstract
During speech perception, acoustic correlates of syllable structure and pitch periodicity are directly reflected in electrophysiological brain activity. Magnetoencephalography (MEG) recordings were made while 10 participants listened to natural or formant-synthesized speech at moderately fast or ultrafast rate. Cross-correlation analysis was applied to show brain activity time-locked to the speech envelope, to an acoustic marker of syllable onsets, and to pitch periodicity. The envelope yielded a right-lateralized M100-like response, syllable onsets gave rise to M50/M100-like fields with an additional anterior M50 component, and pitch (ca. 100 Hz) elicited a neural resonance bound to a central auditory source at a latency of 30 ms. The strength of these MEG components showed differential effects of syllable rate and natural versus synthetic speech. Presumingly, such phase-locking mechanisms serve as neuronal triggers for the extraction of information-bearing elements.
Collapse
Affiliation(s)
- Ingo Hertrich
- Department of General Neurology, Hertie Institute for Clinical Brain Research, University of Tübingen, Tübingen, Germany.
| | | | | | | | | |
Collapse
|
47
|
Nourski KV, Brugge JF. Representation of temporal sound features in the human auditory cortex. Rev Neurosci 2011; 22:187-203. [PMID: 21476940 DOI: 10.1515/rns.2011.016] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Temporal information in acoustic signals is important for the perception of environmental sounds, including speech. This review focuses on several aspects of temporal processing within human auditory cortex and its relevance for the processing of speech sounds. Periodic non-speech sounds, such as trains of acoustic clicks and bursts of amplitude-modulated noise or tones, can elicit different percepts depending on the pulse repetition rate or modulation frequency. Such sounds provide convenient methodological tools to study representation of timing information in the auditory system. At low repetition rates of up to 8-10 Hz, each individual stimulus (a single click or a sinusoidal amplitude modulation cycle) within the sequence is perceived as a separate event. As repetition rates increase up to and above approximately 40 Hz, these events blend together, giving rise first to the percept of flutter and then to pitch. The extent to which neural responses of human auditory cortex encode temporal features of acoustic stimuli is discussed within the context of these perceptual classes of periodic stimuli and their relationship to speech sounds. Evidence for neural coding of temporal information at the level of the core auditory cortex in humans suggests possible physiological counterparts to perceptual categorical boundaries for periodic acoustic stimuli. Temporal coding is less evident in auditory cortical fields beyond the core. Finally, data suggest hemispheric asymmetry in temporal cortical processing.
Collapse
Affiliation(s)
- Kirill V Nourski
- Human Brain Research Laboratory, Department of Neurosurgery, The University of Iowa, 200 Hawkins Dr., Iowa City, IA 52242, USA.
| | | |
Collapse
|
48
|
Yin P, Johnson JS, O'Connor KN, Sutter ML. Coding of amplitude modulation in primary auditory cortex. J Neurophysiol 2010; 105:582-600. [PMID: 21148093 DOI: 10.1152/jn.00621.2010] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Conflicting results have led to different views about how temporal modulation is encoded in primary auditory cortex (A1). Some studies find a substantial population of neurons that change firing rate without synchronizing to temporal modulation, whereas other studies fail to see these nonsynchronized neurons. As a result, the role and scope of synchronized temporal and nonsynchronized rate codes in AM processing in A1 remains unresolved. We recorded A1 neurons' responses in awake macaques to sinusoidal AM noise. We find most (37-78%) neurons synchronize to at least one modulation frequency (MF) without exhibiting nonsynchronized responses. However, we find both exclusively nonsynchronized neurons (7-29%) and "mixed-mode" neurons (13-40%) that synchronize to at least one MF and fire nonsynchronously to at least one other. We introduce new measures for modulation encoding and temporal synchrony that can improve the analysis of how neurons encode temporal modulation. These include comparing AM responses to the responses to unmodulated sounds, and a vector strength measure that is suitable for single-trial analysis. Our data support a transformation from a temporally based population code of AM to a rate-based code as information ascends the auditory pathway. The number of mixed-mode neurons found in A1 indicates this transformation is not yet complete, and A1 neurons may carry multiplexed temporal and rate codes.
Collapse
Affiliation(s)
- Pingbo Yin
- Center for Neuroscience, University of California at Davis, 1544 Newton Court, Davis, CA 95618, USA
| | | | | | | |
Collapse
|
49
|
A possible role for a paralemniscal auditory pathway in the coding of slow temporal information. Hear Res 2010; 272:125-34. [PMID: 21094680 DOI: 10.1016/j.heares.2010.10.009] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2010] [Revised: 09/27/2010] [Accepted: 10/19/2010] [Indexed: 11/20/2022]
Abstract
Low-frequency temporal information present in speech is critical for normal perception, however the neural mechanism underlying the differentiation of slow rates in acoustic signals is not known. Data from the rat trigeminal system suggest that the paralemniscal pathway may be specifically tuned to code low-frequency temporal information. We tested whether this phenomenon occurs in the auditory system by measuring the representation of temporal rate in lemniscal and paralemniscal auditory thalamus and cortex in guinea pig. Similar to the trigeminal system, responses measured in auditory thalamus indicate that slow rates are differentially represented in a paralemniscal pathway. In cortex, both lemniscal and paralemniscal neurons indicated sensitivity to slow rates. We speculate that a paralemniscal pathway in the auditory system may be specifically tuned to code low-frequency temporal information present in acoustic signals. These data suggest that somatosensory and auditory modalities have parallel sub-cortical pathways that separately process slow rates and the spatial representation of the sensory periphery.
Collapse
|
50
|
Neural correlates of auditory scene analysis based on inharmonicity in monkey primary auditory cortex. J Neurosci 2010; 30:12480-94. [PMID: 20844143 DOI: 10.1523/jneurosci.1780-10.2010] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Segregation of concurrent sounds in complex acoustic environments is a fundamental feature of auditory scene analysis. A powerful cue used by the auditory system to segregate concurrent sounds, such as speakers' voices at a cocktail party, is inharmonicity. This can be demonstrated when a component of a harmonic complex tone is perceived as a separate tone "popping out" from the complex as a whole when it is sufficiently mistuned from its harmonic value. The neural bases of perceptual "pop out" of mistuned harmonics are unclear. We recorded multiunit activity from primary auditory cortex (A1) of behaving monkeys elicited by harmonic complex tones that were either "in tune" or that contained a mistuned third harmonic set at the best frequency of the neural populations. Responses to mistuned sounds were enhanced relative to responses to "in-tune" sounds, thus correlating with the enhanced perceptual salience of the mistuned component. Consistent with human psychophysics of "pop out," response enhancements increased with the degree of mistuning, were maximal for neural populations tuned to the frequency of the mistuned component, and were not observed under comparable stimulus conditions that do not elicit perceptual "pop out." Mistuning was also associated with changes in neuronal temporal response patterns phase locked to "beats" in the stimuli. Intracortical auditory evoked potentials paralleled noninvasive neurophysiological correlates of perceptual "pop out" in humans, further augmenting the translational relevance of the results. Findings suggest two complementary neural mechanisms for "pop out," based on the detection of local differences in activation level or coherence of temporal response patterns across A1.
Collapse
|