Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Rodríguez FA, Read HL, Escabí MA. Spectral and temporal modulation tradeoff in the inferior colliculus. J Neurophysiol 2009;103:887-903. [PMID: 20018831 DOI: 10.1152/jn.00813.2009] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

For:	Rodríguez FA, Read HL, Escabí MA. Spectral and temporal modulation tradeoff in the inferior colliculus. J Neurophysiol 2009;103:887-903. [PMID: 20018831 DOI: 10.1152/jn.00813.2009] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Number

Cited by Other Article(s)

Hullett PW, Leonard MK, Gorno-Tempini ML, Mandelli ML, Chang EF. Parallel Encoding of Speech in Human Frontal and Temporal Lobes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.19.585648. [PMID: 38562883 PMCID: PMC10983886 DOI: 10.1101/2024.03.19.585648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]

Abstract

Models of speech perception are centered around a hierarchy in which auditory representations in the thalamus propagate to primary auditory cortex, then to the lateral temporal cortex, and finally through dorsal and ventral pathways to sites in the frontal lobe. However, evidence for short latency speech responses and low-level spectrotemporal representations in frontal cortex raises the question of whether speech-evoked activity in frontal cortex strictly reflects downstream processing from lateral temporal cortex or whether there are direct parallel pathways from the thalamus or primary auditory cortex to the frontal lobe that supplement the traditional hierarchical architecture. Here, we used high-density direct cortical recordings, high-resolution diffusion tractography, and hemodynamic functional connectivity to evaluate for evidence of direct parallel inputs to frontal cortex from low-level areas. We found that neural populations in the frontal lobe show speech-evoked responses that are synchronous or occur earlier than responses in the lateral temporal cortex. These short latency frontal lobe neural populations encode spectrotemporal speech content indistinguishable from spectrotemporal encoding patterns observed in the lateral temporal lobe, suggesting parallel auditory speech representations reaching temporal and frontal cortex simultaneously. This is further supported by white matter tractography and functional connectivity patterns that connect the auditory nucleus of the thalamus (medial geniculate body) and the primary auditory cortex to the frontal lobe. Together, these results support the existence of a robust pathway of parallel inputs from low-level auditory areas to frontal lobe targets and illustrate long-range parallel architecture that works alongside the classical hierarchical speech network model.

Collapse

López-Ramos D, Marrufo-Pérez MI, Eustaquio-Martín A, López-Bascuas LE, Lopez-Poveda EA. Adaptation to Noise in Spectrotemporal Modulation Detection and Word Recognition. Trends Hear 2024;28:23312165241266322. [PMID: 39267369 PMCID: PMC11401146 DOI: 10.1177/23312165241266322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/17/2024] Open

van den Berg MM, Busscher E, Borst JGG, Wong AB. Neuronal responses in mouse inferior colliculus correlate with behavioral detection of amplitude-modulated sound. J Neurophysiol 2023;130:524-546. [PMID: 37465872 DOI: 10.1152/jn.00048.2023] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 07/18/2023] [Accepted: 07/18/2023] [Indexed: 07/20/2023] Open

He F, Stevenson IH, Escabí MA. Two stages of bandwidth scaling drives efficient neural coding of natural sounds. PLoS Comput Biol 2023;19:e1010862. [PMID: 36787338 PMCID: PMC9970106 DOI: 10.1371/journal.pcbi.1010862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 02/27/2023] [Accepted: 01/09/2023] [Indexed: 02/15/2023] Open

Akça M, Vuoskoski JK, Laeng B, Bishop L. Recognition of brief sounds in rapid serial auditory presentation. PLoS One 2023;18:e0284396. [PMID: 37053212 PMCID: PMC10101377 DOI: 10.1371/journal.pone.0284396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Accepted: 03/30/2023] [Indexed: 04/14/2023] Open

Gentile Polese A, Nigam S, Hurley LM. 5-HT1A Receptors Alter Temporal Responses to Broadband Vocalizations in the Mouse Inferior Colliculus Through Response Suppression. Front Neural Circuits 2021;15:718348. [PMID: 34512276 PMCID: PMC8430226 DOI: 10.3389/fncir.2021.718348] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 07/19/2021] [Indexed: 01/21/2023] Open

Abstract

Neuromodulatory systems may provide information on social context to auditory brain regions, but relatively few studies have assessed the effects of neuromodulation on auditory responses to acoustic social signals. To address this issue, we measured the influence of the serotonergic system on the responses of neurons in a mouse auditory midbrain nucleus, the inferior colliculus (IC), to vocal signals. Broadband vocalizations (BBVs) are human-audible signals produced by mice in distress as well as by female mice in opposite-sex interactions. The production of BBVs is context-dependent in that they are produced both at early stages of interactions as females physically reject males and at later stages as males mount females. Serotonin in the IC of males corresponds to these events, and is elevated more in males that experience less female rejection. We measured the responses of single IC neurons to five recorded examples of BBVs in anesthetized mice. We then locally activated the 5-HT1A receptor through iontophoretic application of 8-OH-DPAT. IC neurons showed little selectivity for different BBVs, but spike trains were characterized by local regions of high spike probability, which we called "response features." Response features varied across neurons and also across calls for individual neurons, ranging from 1 to 7 response features for responses of single neurons to single calls. 8-OH-DPAT suppressed spikes and also reduced the numbers of response features. The weakest response features were the most likely to disappear, suggestive of an "iceberg"-like effect in which activation of the 5-HT1A receptor suppressed weakly suprathreshold response features below the spiking threshold. Because serotonin in the IC is more likely to be elevated for mounting-associated BBVs than for rejection-associated BBVs, these effects of the 5-HT1A receptor could contribute to the differential auditory processing of BBVs in different behavioral subcontexts.

Collapse

Boebinger D, Norman-Haignere SV, McDermott JH, Kanwisher N. Music-selective neural populations arise without musical training. J Neurophysiol 2021;125:2237-2263. [PMID: 33596723 PMCID: PMC8285655 DOI: 10.1152/jn.00588.2020] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 02/12/2021] [Accepted: 02/12/2021] [Indexed: 11/22/2022] Open

Laeng B, Flaaten CB, Walle KM, Hochkeppler A, Specht K. "Mickey Mousing" in the Brain: Motion-Sound Synesthesia and the Subcortical Substrate of Audio-Visual Integration. Front Hum Neurosci 2021;15:605166. [PMID: 33658913 PMCID: PMC7917298 DOI: 10.3389/fnhum.2021.605166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Accepted: 01/18/2021] [Indexed: 11/13/2022] Open

Bernstein LR, Trahiotis C. Binaural detection as a joint function of masker bandwidth, masker interaural correlation, and interaural time delay: Empirical data and modeling. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020;148:3481. [PMID: 33379873 DOI: 10.1121/10.0002869] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 11/12/2020] [Indexed: 06/12/2023]

Egorova MA, Akimov AG, Khorunzhii GD, Ehret G. Frequency response areas of neurons in the mouse inferior colliculus. III. Time-domain responses: Constancy, dynamics, and precision in relation to spectral resolution, and perception in the time domain. PLoS One 2020;15:e0240853. [PMID: 33104718 PMCID: PMC7588072 DOI: 10.1371/journal.pone.0240853] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 10/04/2020] [Indexed: 11/23/2022] Open

Abstract

The auditory midbrain (central nucleus of inferior colliculus, ICC) receives multiple brainstem projections and recodes auditory information for perception in higher centers. Many neural response characteristics are represented in gradients (maps) in the three-dimensional ICC space. Map overlap suggests that neurons, depending on their ICC location, encode information in several domains simultaneously by different aspects of their responses. Thus, interdependence of coding, e.g. in spectral and temporal domains, seems to be a general ICC principle. Studies on covariation of response properties and possible impact on sound perception are, however, rare. Here, we evaluated tone-evoked single neuron activity from the mouse ICC and compared shapes of excitatory frequency-response areas (including strength and shape of inhibition within and around the excitatory area; classes I, II, III) with types of temporal response patterns and first-spike response latencies. Analyses showed covariation of sharpness of frequency tuning with constancy and precision of responding to tone onsets. Highest precision (first-spike latency jitter < 1 ms) and stable phasic responses throughout frequency-response areas were the quality mainly of class III neurons with broad frequency tuning, least influenced by inhibition. Class II neurons with narrow frequency tuning and dominating inhibitory influence were unsuitable for time domain coding with high precision. The ICC center seems specialized rather for high spectral resolution (class II presence), lateral parts for constantly precise responding to sound onsets (class III presence). Further, the variation of tone-response latencies in the frequency-response areas of individual neurons with phasic, tonic, phasic-tonic, or pauser responses gave rise to the definition of a core area, which represented a time window of about 20 ms from tone onset for tone-onset responding of the whole ICC. This time window corresponds to the roughly 20 ms shortest time interval that was found critical in several auditory perceptual tasks in humans and mice.

Collapse

Logerot P, Smith PF, Wild M, Kubke MF. Auditory processing in the zebra finch midbrain: single unit responses and effect of rearing experience. PeerJ 2020;8:e9363. [PMID: 32775046 PMCID: PMC7384439 DOI: 10.7717/peerj.9363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 05/26/2020] [Indexed: 11/26/2022] Open

Spiking network optimized for word recognition in noise predicts auditory system hierarchy. PLoS Comput Biol 2020;16:e1007558. [PMID: 32559204 PMCID: PMC7329140 DOI: 10.1371/journal.pcbi.1007558] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 07/01/2020] [Accepted: 11/22/2019] [Indexed: 11/21/2022] Open

Abstract

The auditory neural code is resilient to acoustic variability and capable of recognizing sounds amongst competing sound sources, yet, the transformations enabling noise robust abilities are largely unknown. We report that a hierarchical spiking neural network (HSNN) optimized to maximize word recognition accuracy in noise and multiple talkers predicts organizational hierarchy of the ascending auditory pathway. Comparisons with data from auditory nerve, midbrain, thalamus and cortex reveals that the optimal HSNN predicts several transformations of the ascending auditory pathway including a sequential loss of temporal resolution and synchronization ability, increasing sparseness, and selectivity. The optimal organizational scheme enhances performance by selectively filtering out noise and fast temporal cues such as voicing periodicity, that are not directly relevant to the word recognition task. An identical network arranged to enable high information transfer fails to predict auditory pathway organization and has substantially poorer performance. Furthermore, conventional single-layer linear and nonlinear receptive field networks that capture the overall feature extraction of the HSNN fail to achieve similar performance. The findings suggest that the auditory pathway hierarchy and its sequential nonlinear feature extraction computations enhance relevant cues while removing non-informative sources of noise, thus enhancing the representation of sounds in noise impoverished conditions.

The brain’s ability to recognize sounds in the presence of competing sounds or background noise is essential for everyday hearing tasks. How the brain accomplishes noise resiliency, however, is poorly understood. Using neural recordings from the ascending auditory pathway and an auditory spiking network model trained for sound recognition in noise we explore the computational strategies that enable noise robustness. Our results suggest that the hierarchical feature organization of the ascending auditory pathway and the resulting computations are critical for sound recognition in the presence of noise.

Collapse

Little DF, Snyder JS, Elhilali M. Ensemble modeling of auditory streaming reveals potential sources of bistability across the perceptual hierarchy. PLoS Comput Biol 2020;16:e1007746. [PMID: 32275706 PMCID: PMC7185718 DOI: 10.1371/journal.pcbi.1007746] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Revised: 04/27/2020] [Accepted: 02/25/2020] [Indexed: 11/19/2022] Open

Albouy P, Benjamin L, Morillon B, Zatorre RJ. Distinct sensitivity to spectrotemporal modulation supports brain asymmetry for speech and melody. Science 2020;367:1043-1047. [DOI: 10.1126/science.aaz3468] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Accepted: 01/02/2020] [Indexed: 01/08/2023]

Ito T. Different coding strategy of sound information between GABAergic and glutamatergic neurons in the auditory midbrain. J Physiol 2020;598:1039-1072. [DOI: 10.1113/jp279296] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Accepted: 01/13/2020] [Indexed: 12/19/2022] Open

Gourévitch B, Mahrt EJ, Bakay W, Elde C, Portfors CV. GABA_A receptors contribute more to rate than temporal coding in the IC of awake mice. J Neurophysiol 2020;123:134-148. [PMID: 31721644 DOI: 10.1152/jn.00377.2019] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

Speech is our most important form of communication, yet we have a poor understanding of how communication sounds are processed by the brain. Mice make great model organisms to study neural processing of communication sounds because of their rich repertoire of social vocalizations and because they have brain structures analogous to humans, such as the auditory midbrain nucleus inferior colliculus (IC). Although the combined roles of GABAergic and glycinergic inhibition on vocalization selectivity in the IC have been studied to a limited degree, the discrete contributions of GABAergic inhibition have only rarely been examined. In this study, we examined how GABAergic inhibition contributes to shaping responses to pure tones as well as selectivity to complex sounds in the IC of awake mice. In our set of long-latency neurons, we found that GABAergic inhibition extends the evoked firing rate range of IC neurons by lowering the baseline firing rate but maintaining the highest probability of firing rate. GABAergic inhibition also prevented IC neurons from bursting in a spontaneous state. Finally, we found that although GABAergic inhibition shaped the spectrotemporal response to vocalizations in a nonlinear fashion, it did not affect the neural code needed to discriminate vocalizations, based either on spiking patterns or on firing rate. Overall, our results emphasize that even if GABAergic inhibition generally decreases the firing rate, it does so while maintaining or extending the abilities of neurons in the IC to code the wide variety of sounds that mammals are exposed to in their daily lives.NEW & NOTEWORTHY GABAergic inhibition adds nonlinearity to neuronal response curves. This increases the neuronal range of evoked firing rate by reducing baseline firing. GABAergic inhibition prevents bursting responses from neurons in a spontaneous state, reducing noise in the temporal coding of the neuron. This could result in improved signal transmission to the cortex.

Collapse

Su Y, Delgutte B. Pitch of harmonic complex tones: rate and temporal coding of envelope repetition rate in inferior colliculus of unanesthetized rabbits. J Neurophysiol 2019;122:2468-2485. [PMID: 31664871 DOI: 10.1152/jn.00512.2019] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Abstract

Harmonic complex tones (HCTs) found in speech, music, and animal vocalizations evoke strong pitch percepts at their fundamental frequencies. The strongest pitches are produced by HCTs that contain harmonics resolved by cochlear frequency analysis, but HCTs containing solely unresolved harmonics also evoke a weaker pitch at their envelope repetition rate (ERR). In the auditory periphery, neurons phase lock to the stimulus envelope, but this temporal representation of ERR degrades and gives way to rate codes along the ascending auditory pathway. To assess the role of the inferior colliculus (IC) in such transformations, we recorded IC neuron responses to HCT and sinusoidally modulated broadband noise (SAMN) with varying ERR from unanesthetized rabbits. Different interharmonic phase relationships of HCT were used to manipulate the temporal envelope without changing the power spectrum. Many IC neurons demonstrated band-pass rate tuning to ERR between 60 and 1,600 Hz for HCT and between 40 and 500 Hz for SAMN. The tuning was not related to the pure-tone best frequency of neurons but was dependent on the shape of the stimulus envelope, indicating a temporal rather than spectral origin. A phenomenological model suggests that the tuning may arise from peripheral temporal response patterns via synaptic inhibition. We also characterized temporal coding to ERR. Some IC neurons could phase lock to the stimulus envelope up to 900 Hz for either HCT or SAMN, but phase locking was weaker with SAMN. Together, the rate code and the temporal code represent a wide range of ERR, providing strong cues for the pitch of unresolved harmonics.NEW & NOTEWORTHY Envelope repetition rate (ERR) provides crucial cues for pitch perception of frequency components that are not individually resolved by the cochlea, but the neural representation of ERR for stimuli containing many harmonics is poorly characterized. Here we show that the pitch of stimuli with unresolved harmonics is represented by both a rate code and a temporal code for ERR in auditory midbrain neurons and propose possible underlying neural mechanisms with a computational model.

Collapse

Sadeghi M, Zhai X, Stevenson IH, Escabí MA. A neural ensemble correlation code for sound category identification. PLoS Biol 2019;17:e3000449. [PMID: 31574079 PMCID: PMC6788721 DOI: 10.1371/journal.pbio.3000449] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Revised: 10/11/2019] [Accepted: 09/03/2019] [Indexed: 12/25/2022] Open

Chen C, Read HL, Escabí MA. A temporal integration mechanism enhances frequency selectivity of broadband inputs to inferior colliculus. PLoS Biol 2019;17:e2005861. [PMID: 31233489 PMCID: PMC6611646 DOI: 10.1371/journal.pbio.2005861] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Revised: 07/05/2019] [Accepted: 05/22/2019] [Indexed: 11/18/2022] Open

Goyer D, Silveira MA, George AP, Beebe NL, Edelbrock RM, Malinski PT, Schofield BR, Roberts MT. A novel class of inferior colliculus principal neurons labeled in vasoactive intestinal peptide-Cre mice. eLife 2019;8:43770. [PMID: 30998185 PMCID: PMC6516826 DOI: 10.7554/elife.43770] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 04/17/2019] [Indexed: 12/17/2022] Open

Bernstein LR, Trahiotis C. No more than "slight" hearing loss and degradations in binaural processing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019;145:2094. [PMID: 31046341 DOI: 10.1121/1.5096652] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Accepted: 03/14/2019] [Indexed: 06/09/2023]

Zhang Q, Hu X, Hong B, Zhang B. A hierarchical sparse coding model predicts acoustic feature encoding in both auditory midbrain and cortex. PLoS Comput Biol 2019;15:e1006766. [PMID: 30742609 PMCID: PMC6386396 DOI: 10.1371/journal.pcbi.1006766] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2018] [Revised: 02/22/2019] [Accepted: 12/21/2018] [Indexed: 12/03/2022] Open

Abstract

The auditory pathway consists of multiple stages, from the cochlear nucleus to the auditory cortex. Neurons acting at different stages have different functions and exhibit different response properties. It is unclear whether these stages share a common encoding mechanism. We trained an unsupervised deep learning model consisting of alternating sparse coding and max pooling layers on cochleogram-filtered human speech. Evaluation of the response properties revealed that computing units in lower layers exhibited spectro-temporal receptive fields (STRFs) similar to those of inferior colliculus neurons measured in physiological experiments, including properties such as sound onset and termination, checkerboard pattern, and spectral motion. Units in upper layers tended to be tuned to phonetic features such as plosivity and nasality, resembling the results of field recording in human auditory cortex. Variation of the sparseness level of the units in each higher layer revealed a positive correlation between the sparseness level and the strength of phonetic feature encoding. The activities of the units in the top layer, but not other layers, correlated with the dynamics of the first two formants (F1, F2) of all phonemes, indicating the encoding of phoneme dynamics in these units. These results suggest that the principles of sparse coding and max pooling may be universal in the human auditory pathway.

When speech enters the ear, it is subjected to a series of processing stages prior to arriving at the auditory cortex. Neurons acting at different processing stages have different response properties. For example, at the auditory midbrain, a neuron may specifically detect the onsets of a frequency component in the speech, whereas in the auditory cortex, a neuron may specifically detect phonetic features. The encoding mechanisms underlying these neuronal functions remain unclear. To address this issue, we designed a hierarchical sparse coding model, inspired by the sparse activity of neurons in the sensory system, to learn features in speech signals. We found that the computing units in different layers exhibited hierarchical extraction of speech sound features, similar to those of neurons in the auditory midbrain and auditory cortex, although the computational principles in these layers were the same. The results suggest that sparse coding and max pooling represent universal computational principles throughout the auditory pathway.

Collapse

Moerel M, De Martino F, Uğurbil K, Formisano E, Yacoub E. Evaluating the Columnar Stability of Acoustic Processing in the Human Auditory Cortex. J Neurosci 2018;38:7822-7832. [PMID: 30185539 PMCID: PMC6125808 DOI: 10.1523/jneurosci.3576-17.2018] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Revised: 07/10/2018] [Accepted: 07/11/2018] [Indexed: 12/27/2022] Open

Abstract

Using ultra-high field fMRI, we explored the cortical depth-dependent stability of acoustic feature preference in human auditory cortex. We collected responses from human auditory cortex (subjects from either sex) to a large number of natural sounds at submillimeter spatial resolution, and observed that these responses were well explained by a model that assumes neuronal population tuning to frequency-specific spectrotemporal modulations. We observed a relatively stable (columnar) tuning to frequency and temporal modulations. However, spectral modulation tuning was variable throughout the cortical depth. This difference in columnar stability between feature maps could not be explained by a difference in map smoothness, as the preference along the cortical sheet varied in a similar manner for the different feature maps. Furthermore, tuning to all three features was more columnar in primary than nonprimary auditory cortex. The observed overall lack of overlapping columnar regions across acoustic feature maps suggests, especially for primary auditory cortex, a coding strategy in which across cortical depths tuning to some features is kept stable, whereas tuning to other features systematically varies.SIGNIFICANCE STATEMENT In the human auditory cortex, sound aspects are processed in large-scale maps. Invasive animal studies show that an additional processing organization may be implemented orthogonal to the cortical sheet (i.e., in the columnar direction), but it is unknown whether observed organizational principles apply to the human auditory cortex. Combining ultra-high field fMRI with natural sounds, we explore the columnar organization of various sound aspects. Our results suggest that the human auditory cortex contains a modular coding strategy, where, for each module, several sound aspects act as an anchor along which computations are performed while the processing of another sound aspect undergoes a transformation. This strategy may serve to optimally represent the content of our complex acoustic natural environment.

Collapse

Khatami F, Wöhr M, Read HL, Escabí MA. Origins of scale invariance in vocalization sequences and speech. PLoS Comput Biol 2018;14:e1005996. [PMID: 29659561 PMCID: PMC5919684 DOI: 10.1371/journal.pcbi.1005996] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Revised: 04/26/2018] [Accepted: 01/23/2018] [Indexed: 11/18/2022] Open

Abstract

To communicate effectively animals need to detect temporal vocalization cues that vary over several orders of magnitude in their amplitude and frequency content. This large range of temporal cues is evident in the power-law scale-invariant relationship between the power of temporal fluctuations in sounds and the sound modulation frequency (f). Though various forms of scale invariance have been described for natural sounds, the origins and implications of scale invariant phenomenon remain unknown. Using animal vocalization sequences, including continuous human speech, and a stochastic model of temporal amplitude fluctuations we demonstrate that temporal acoustic edges are the primary acoustic cue accounting for the scale invariant phenomenon. The modulation spectrum of vocalization sequences and the model both exhibit a dual regime lowpass structure with a flat region at low modulation frequencies and scale invariant 1/f² trend for high modulation frequencies. Moreover, we find a time-frequency tradeoff between the average vocalization duration of each vocalization sequence and the cutoff frequency beyond which scale invariant behavior is observed. These results indicate that temporal edges are universal features responsible for scale invariance in vocalized sounds. This is significant since temporal acoustic edges are salient perceptually and the auditory system could exploit such statistical regularities to minimize redundancies and generate compact neural representations of vocalized sounds.

The efficient coding hypothesis posits that the brain encodes sensory signals efficiently in order to reduce metabolic cost and preserve behaviorally relevant environment information. In audition, recognition and coding depends on the brain’s ability to accurately and efficiently encode statistical regularities that are prevalent in natural sounds. Similarly, efficient audio coding and compression schemes attempt to preserve salient sound qualities while minimizing data bandwidth. A widely observed statistical regularity in nearly all natural sounds is the presence of scale invariance where the power of amplitude fluctuations is inversely related to the sound amplitude modulation frequency. In this study, we explore the physical sound cues responsible for the scale invariant phenomenon previously observed. We demonstrate that for animal vocalizations, including human speech, the scale invariant behavior is fully accounted by the presence of temporal acoustic edges that are largely created by opening and closing of the oral cavity and which mark the beginning and end of isolated vocalizations. The findings thus identify a single physical cue responsible for the universal scale invariant phenomenon that the brain can exploit to optimize coding and perception of vocalized sounds.

Collapse

McCreery D, Yadev K, Han M. Responses of neurons in the feline inferior colliculus to modulated electrical stimuli applied on and within the ventral cochlear nucleus; Implications for an advanced auditory brainstem implant. Hear Res 2018;363:85-97. [PMID: 29573880 DOI: 10.1016/j.heares.2018.03.009] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/29/2017] [Revised: 03/01/2018] [Accepted: 03/06/2018] [Indexed: 11/25/2022]

Abstract

Auditory brainstem implants (ABIs) can restore useful hearing to persons with deafness who cannot benefit from cochlear implants. However, the quality of hearing restored by ABIs rarely is comparable to that provided by cochlear implants in persons for whom those are appropriate. In an animal model, we evaluated elements of a prototype of an ABI in which the functions of macroelectrodes on the surface of the dorsal cochlear nucleus would be integrated with the function of multiple penetrating microelectrodes implanted into the ventral cochlear nucleus. The surface electrodes would convey most of the range of loudness percepts while the intranuclear microelectrodes would sharpen and focus pitch percepts. In the present study, stimulating electrodes were implanted chronically on the surface of the animal's dorsal cochlear nucleus (DCN) and also within their ventral cochlear nucleus (VCN). Recording microelectrodes were implanted into the central nucleus of the inferior colliculus (ICC). The electrical stimuli were sinusoidally modulated stimulus pulse trains applied on the DCN and within the VCN. Temporal encoding of neuronal responses was quantified as vector strength (VS) and as full-cycle rate of neuronal activity in the ICC. VS and full-cycle AP rate were measured for 4 stimulation modes; continuous and transient amplitude modulation of the stimulus pulse trains, each delivered via the macroelectrode on the surface of the DCN and then by the intranuclear penetrating microelectrodes. In the proposed clinical device the functions of the surface and intranuclear microelectrodes could best be integrated if there is minimal variation in the neuronal responses across the range of modulation depth, modulation frequencies, and across the four stimulation modes. In this study VS did vary as much as 34% across modulation frequency and modulation depth within a stimulation mode, and up to 40% between modulation modes. However, these intra- and inter-mode variances differed for different stimulation rates, and at 500 Hz the inter-mode differences in VS and across the range of modulation frequencies and modulation depths was<Roman> = </Roman>24% and the intra-modal differences were<Roman> = </Roman>15%. The findings were generally similar for rate encoding of modulation depth, although the depth of transient amplitude modulation delivered by the surface electrode was weakly encoded as full-cycle rate. Overall, our findings support the concept of a clinical ABI that employs surface stimulation and intranuclear microstimulation in an integrated manner.

Collapse

Cluster-based analysis improves predictive validity of spike-triggered receptive field estimates. PLoS One 2017;12:e0183914. [PMID: 28877194 PMCID: PMC5587334 DOI: 10.1371/journal.pone.0183914] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2016] [Accepted: 08/14/2017] [Indexed: 11/19/2022] Open

Santoro R, Moerel M, De Martino F, Valente G, Ugurbil K, Yacoub E, Formisano E. Reconstructing the spectrotemporal modulations of real-life sounds from fMRI response patterns. Proc Natl Acad Sci U S A 2017;114:4799-4804. [PMID: 28420788 PMCID: PMC5422795 DOI: 10.1073/pnas.1617622114] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Oetjen A, Verhey JL. Characteristics of spectro-temporal modulation frequency selectivity in humans. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017;141:1887. [PMID: 28372116 DOI: 10.1121/1.4976537] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Yassin L, Pecka M, Kajopoulos J, Gleiss H, Li L, Leibold C, Felmy F. Differences in synaptic and intrinsic properties result in topographic heterogeneity of temporal processing of neurons within the inferior colliculus. Hear Res 2016;341:79-90. [DOI: 10.1016/j.heares.2016.08.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/08/2016] [Revised: 08/15/2016] [Accepted: 08/16/2016] [Indexed: 10/21/2022]

Synchrony, connectivity, and functional similarity in auditory midbrain local circuits. Neuroscience 2016;335:30-53. [PMID: 27544405 DOI: 10.1016/j.neuroscience.2016.08.024] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2016] [Revised: 08/08/2016] [Accepted: 08/10/2016] [Indexed: 11/21/2022]

Abstract

The central nucleus of the inferior colliculus (ICC) contains a laminar structure that functions as an organizing substrate of ascending inputs and local processing. While topographic distributions of ICC response parameters within and across laminae have been reported, the functional micro-organization of the ICC is less well understood. For pairs of neighboring ICC neurons, we examined the nature of functional connectivity and receptive field preferences to gain a better understanding of the structure and function of local circuits. By recording from pairs of adjacent neurons and presenting pure-tone and dynamic broad-band stimulation, we estimated functional connectivity and local differences in frequency response areas (FRAs), spectrotemporal receptive fields (STRFs), nonlinear input/output functions, and single-spike information. From the cross-covariance functions we identified putative unidirectional as well as bidirectional excitatory/inhibitory interactions. STRFs of neighboring neurons strongly conserve best frequency, and moderately agree in STRF similarity, bandwidth, temporal response type, best modulation frequency, nonlinearity structure, and degree of information processing. Excitatory connectivity was stronger and temporally more precise than for inhibitory connections. Neither connection strength nor degree of synchrony correlated with receptive field parameters. The functional similarity of local pairs of ICC neurons was substantially less than for local pairs in the granular layers of primary auditory cortex (AI). These results imply that while the ICC is an obligatory nexus of ascending information, local neurons are comparatively weakly connected and exhibit considerable receptive field variability, potentially reflecting the heterogeneity of converging inputs to ICC functional zones.

Collapse

Human Superior Temporal Gyrus Organization of Spectrotemporal Modulation Tuning Derived from Speech Stimuli. J Neurosci 2016;36:2014-26. [PMID: 26865624 DOI: 10.1523/jneurosci.1779-15.2016] [Citation(s) in RCA: 95] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Abstract

UNLABELLED

The human superior temporal gyrus (STG) is critical for speech perception, yet the organization of spectrotemporal processing of speech within the STG is not well understood. Here, to characterize the spatial organization of spectrotemporal processing of speech across human STG, we use high-density cortical surface field potential recordings while participants listened to natural continuous speech. While synthetic broad-band stimuli did not yield sustained activation of the STG, spectrotemporal receptive fields could be reconstructed from vigorous responses to speech stimuli. We find that the human STG displays a robust anterior-posterior spatial distribution of spectrotemporal tuning in which the posterior STG is tuned for temporally fast varying speech sounds that have relatively constant energy across the frequency axis (low spectral modulation) while the anterior STG is tuned for temporally slow varying speech sounds that have a high degree of spectral variation across the frequency axis (high spectral modulation). This work illustrates organization of spectrotemporal processing in the human STG, and illuminates processing of ethologically relevant speech signals in a region of the brain specialized for speech perception.

SIGNIFICANCE STATEMENT

Considerable evidence has implicated the human superior temporal gyrus (STG) in speech processing. However, the gross organization of spectrotemporal processing of speech within the STG is not well characterized. Here we use natural speech stimuli and advanced receptive field characterization methods to show that spectrotemporal features within speech are well organized along the posterior-to-anterior axis of the human STG. These findings demonstrate robust functional organization based on spectrotemporal modulation content, and illustrate that much of the encoded information in the STG represents the physical acoustic properties of speech stimuli.

Collapse

Norman-Haignere S, Kanwisher NG, McDermott JH. Distinct Cortical Pathways for Music and Speech Revealed by Hypothesis-Free Voxel Decomposition. Neuron 2016;88:1281-1296. [PMID: 26687225 DOI: 10.1016/j.neuron.2015.11.035] [Citation(s) in RCA: 198] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Revised: 10/03/2015] [Accepted: 11/23/2015] [Indexed: 11/19/2022]

Lee CM, Osman AF, Volgushev M, Escabí MA, Read HL. Neural spike-timing patterns vary with sound shape and periodicity in three auditory cortical fields. J Neurophysiol 2016;115:1886-904. [PMID: 26843599 DOI: 10.1152/jn.00784.2015] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2015] [Accepted: 01/29/2016] [Indexed: 11/22/2022] Open

Lyzwa D, Herrmann JM, Wörgötter F. Natural Vocalizations in the Mammalian Inferior Colliculus are Broadly Encoded by a Small Number of Independent Multi-Units. Front Neural Circuits 2016;9:91. [PMID: 26869890 PMCID: PMC4740783 DOI: 10.3389/fncir.2015.00091] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Accepted: 12/28/2015] [Indexed: 11/18/2022] Open

Schnupp JWH, Garcia-Lazaro JA, Lesica NA. Periodotopy in the gerbil inferior colliculus: local clustering rather than a gradient map. Front Neural Circuits 2015;9:37. [PMID: 26379508 PMCID: PMC4550179 DOI: 10.3389/fncir.2015.00037] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2015] [Accepted: 07/07/2015] [Indexed: 11/13/2022] Open

Lindeberg T, Friberg A. Idealized computational models for auditory receptive fields. PLoS One 2015;10:e0119032. [PMID: 25822973 PMCID: PMC4379182 DOI: 10.1371/journal.pone.0119032] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2014] [Accepted: 01/24/2015] [Indexed: 11/19/2022] Open

Abstract

We present a theory by which idealized models of auditory receptive fields can be derived in a principled axiomatic manner, from a set of structural properties to (i) enable invariance of receptive field responses under natural sound transformations and (ii) ensure internal consistency between spectro-temporal receptive fields at different temporal and spectral scales. For defining a time-frequency transformation of a purely temporal sound signal, it is shown that the framework allows for a new way of deriving the Gabor and Gammatone filters as well as a novel family of generalized Gammatone filters, with additional degrees of freedom to obtain different trade-offs between the spectral selectivity and the temporal delay of time-causal temporal window functions. When applied to the definition of a second-layer of receptive fields from a spectrogram, it is shown that the framework leads to two canonical families of spectro-temporal receptive fields, in terms of spectro-temporal derivatives of either spectro-temporal Gaussian kernels for non-causal time or a cascade of time-causal first-order integrators over the temporal domain and a Gaussian filter over the logspectral domain. For each filter family, the spectro-temporal receptive fields can be either separable over the time-frequency domain or be adapted to local glissando transformations that represent variations in logarithmic frequencies over time. Within each domain of either non-causal or time-causal time, these receptive field families are derived by uniqueness from the assumptions. It is demonstrated how the presented framework allows for computation of basic auditory features for audio processing and that it leads to predictions about auditory receptive fields with good qualitative similarity to biological receptive fields measured in the inferior colliculus (ICC) and primary auditory cortex (A1) of mammals.

Collapse

King J, Insanally M, Jin M, Martins ARO, D'amour JA, Froemke RC. Rodent auditory perception: Critical band limitations and plasticity. Neuroscience 2015;296:55-65. [PMID: 25827498 DOI: 10.1016/j.neuroscience.2015.03.053] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2014] [Revised: 03/20/2015] [Accepted: 03/22/2015] [Indexed: 10/23/2022]

Affiliation(s)

J King Skirball Institute for Biomolecular Medicine, New York University School of Medicine, New York, NY, USA; Department of Otolaryngology, New York University School of Medicine, New York, NY, USA; Department of Neuroscience and Physiology, New York University School of Medicine, New York, NY, USA; Center for Neural Science, New York University, New York, NY, USA
M Insanally Skirball Institute for Biomolecular Medicine, New York University School of Medicine, New York, NY, USA; Department of Otolaryngology, New York University School of Medicine, New York, NY, USA; Department of Neuroscience and Physiology, New York University School of Medicine, New York, NY, USA; Center for Neural Science, New York University, New York, NY, USA
M Jin Skirball Institute for Biomolecular Medicine, New York University School of Medicine, New York, NY, USA; Department of Otolaryngology, New York University School of Medicine, New York, NY, USA; Department of Neuroscience and Physiology, New York University School of Medicine, New York, NY, USA; Center for Neural Science, New York University, New York, NY, USA
A R O Martins Skirball Institute for Biomolecular Medicine, New York University School of Medicine, New York, NY, USA; Department of Otolaryngology, New York University School of Medicine, New York, NY, USA; Department of Neuroscience and Physiology, New York University School of Medicine, New York, NY, USA; Center for Neural Science, New York University, New York, NY, USA; PhD Programme in Experimental Biology and Biomedicine, Center for Neurosciences and Cell Biology, University of Coimbra, Portugal
J A D'amour Skirball Institute for Biomolecular Medicine, New York University School of Medicine, New York, NY, USA; Department of Otolaryngology, New York University School of Medicine, New York, NY, USA; Department of Neuroscience and Physiology, New York University School of Medicine, New York, NY, USA; Center for Neural Science, New York University, New York, NY, USA
R C Froemke Skirball Institute for Biomolecular Medicine, New York University School of Medicine, New York, NY, USA; Department of Otolaryngology, New York University School of Medicine, New York, NY, USA; Department of Neuroscience and Physiology, New York University School of Medicine, New York, NY, USA; Center for Neural Science, New York University, New York, NY, USA.

Collapse

Oetjen A, Verhey JL. Spectro-temporal modulation masking patterns reveal frequency selectivity. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015;137:714-723. [PMID: 25698006 DOI: 10.1121/1.4906171] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Escabí MA, Read HL, Viventi J, Kim DH, Higgins NC, Storace DA, Liu ASK, Gifford AM, Burke JF, Campisi M, Kim YS, Avrin AE, Spiegel Jan VD, Huang Y, Li M, Wu J, Rogers JA, Litt B, Cohen YE. A high-density, high-channel count, multiplexed μECoG array for auditory-cortex recordings. J Neurophysiol 2014;112:1566-83. [PMID: 24920021 DOI: 10.1152/jn.00179.2013] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Affiliation(s)

Monty A Escabí Department of Psychology, University of Connecticut, Storrs, Connecticut; Department of Biomedical Engineering, University of Connecticut, Storrs, Connecticut; Department of Electrical Engineering, University of Connecticut, Storrs, Connecticut
Heather L Read Department of Psychology, University of Connecticut, Storrs, Connecticut; Department of Biomedical Engineering, University of Connecticut, Storrs, Connecticut
Jonathan Viventi Center for Neural Science, New York University, New York, New York; Department of Electrical and Computer Engineering, Polytechnic Institute of New York University, Brooklyn, New York
Dae-Hyeong Kim Center for Nanoparticle Research of Institute for Basic Science, School of Chemical and Biological Engineering, Seoul National University, Seoul, Republic of Korea
Nathan C Higgins Department of Psychology, University of Connecticut, Storrs, Connecticut
Douglas A Storace Department of Psychology, University of Connecticut, Storrs, Connecticut
Andrew S K Liu Bioengineering Graduate Group, University of Pennsylvania, Philadelphia, Pennsylvania
Adam M Gifford Neuroscience Graduate Group, University of Pennsylvania, Philadelphia, Pennsylvania
John F Burke Neuroscience Graduate Group, University of Pennsylvania, Philadelphia, Pennsylvania
Matthew Campisi Department of Electrical and Computer Engineering, Polytechnic Institute of New York University, Brooklyn, New York
Yun-Soung Kim Department of Materials Science and Engineering, Beckman Institute for Advanced Science and Technology and Frederick Seitz Materials Research Laboratory, University of Illinois at Urbana-Champaign, Urbana, Illinois
Andrew E Avrin Department of Electrical and Systems Engineering, University of Pennsylvania, Philadelphia, Pennsylvania
Van der Spiegel Jan Department of Electrical and Systems Engineering, University of Pennsylvania, Philadelphia, Pennsylvania
Yonggang Huang Departments of Mechanical Engineering and Civil and Environmental Engineering, Northwestern University, Evanston, Illinois
Ming Li State Key Laboratory of Structural Analysis for Industrial Equipment, Dalian University of Technology, Dalian, China
Jian Wu Department of Engineering Mechanics, Tsinghua University, Beijing, China
John A Rogers Department of Materials Science and Engineering, Beckman Institute for Advanced Science and Technology and Frederick Seitz Materials Research Laboratory, University of Illinois at Urbana-Champaign, Urbana, Illinois
Brian Litt Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania; Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
Yale E Cohen Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania; Department of Neuroscience, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania; and Department of Otorhinolaryngology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania

Collapse

Fontaine B, MacLeod KM, Lubejko ST, Steinberg LJ, Köppl C, Peña JL. Emergence of band-pass filtering through adaptive spiking in the owl's cochlear nucleus. J Neurophysiol 2014;112:430-45. [PMID: 24790170 DOI: 10.1152/jn.00132.2014] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Schafer PB, Jin DZ. Noise-Robust Speech Recognition Through Auditory Feature Detection and Spike Sequence Decoding. Neural Comput 2014;26:523-56. [DOI: 10.1162/neco_a_00557] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Bernstein LR, Trahiotis C. Sensitivity to envelope-based interaural delays at high frequencies: center frequency affects the envelope rate-limitation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014;135:808-816. [PMID: 25234889 PMCID: PMC3985968 DOI: 10.1121/1.4861251] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2013] [Revised: 12/12/2013] [Accepted: 12/16/2013] [Indexed: 05/31/2023]

Santoro R, Moerel M, De Martino F, Goebel R, Ugurbil K, Yacoub E, Formisano E. Encoding of natural sounds at multiple spectral and temporal resolutions in the human auditory cortex. PLoS Comput Biol 2014;10:e1003412. [PMID: 24391486 PMCID: PMC3879146 DOI: 10.1371/journal.pcbi.1003412] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2013] [Accepted: 11/12/2013] [Indexed: 11/18/2022] Open

Abstract

Functional neuroimaging research provides detailed observations of the response patterns that natural sounds (e.g. human voices and speech, animal cries, environmental sounds) evoke in the human brain. The computational and representational mechanisms underlying these observations, however, remain largely unknown. Here we combine high spatial resolution (3 and 7 Tesla) functional magnetic resonance imaging (fMRI) with computational modeling to reveal how natural sounds are represented in the human brain. We compare competing models of sound representations and select the model that most accurately predicts fMRI response patterns to natural sounds. Our results show that the cortical encoding of natural sounds entails the formation of multiple representations of sound spectrograms with different degrees of spectral and temporal resolution. The cortex derives these multi-resolution representations through frequency-specific neural processing channels and through the combined analysis of the spectral and temporal modulations in the spectrogram. Furthermore, our findings suggest that a spectral-temporal resolution trade-off may govern the modulation tuning of neuronal populations throughout the auditory cortex. Specifically, our fMRI results suggest that neuronal populations in posterior/dorsal auditory regions preferably encode coarse spectral information with high temporal precision. Vice-versa, neuronal populations in anterior/ventral auditory regions preferably encode fine-grained spectral information with low temporal precision. We propose that such a multi-resolution analysis may be crucially relevant for flexible and behaviorally-relevant sound processing and may constitute one of the computational underpinnings of functional specialization in auditory cortex.

How does the human brain analyze natural sounds? Previous functional neuroimaging research could only describe the response patterns that sounds evoke in the human brain at the level of preferential regional activations. A comprehensive account of the neural basis of human hearing, however, requires deriving computational models that are able to provide quantitative predictions of brain responses to natural sounds. Here, we make a significant step in this direction by combining functional magnetic resonance imaging (fMRI) with computational modeling. We compare competing computational models of sound representations and select the model that most accurately predicts the measured fMRI response patterns. The computational models describe the processing of three relevant properties of natural sounds: frequency, temporal modulations and spectral modulations. We find that a model that represents spectral and temporal modulations jointly and in a frequency-dependent fashion provides the best account of fMRI responses and that the functional specialization of auditory cortical fields can be partially accounted for by their modulation tuning. Our results provide insights on how natural sounds are encoded in human auditory cortex and our methodological approach constitutes an advance in the way this question can be addressed in future studies.

Collapse

Atencio CA, Shih JY, Schreiner CE, Cheung SW. Primary auditory cortical responses to electrical stimulation of the thalamus. J Neurophysiol 2013;111:1077-87. [PMID: 24335216 DOI: 10.1152/jn.00749.2012] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

McCreery D, Han M, Pikov V, Yadav K, Pannu S. Encoding of the amplitude modulation of pulsatile electrical stimulation in the feline cochlear nucleus by neurons in the inferior colliculus; effects of stimulus pulse rate. J Neural Eng 2013;10:056010. [PMID: 23928683 DOI: 10.1088/1741-2560/10/5/056010] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Abstract

OBJECTIVES

Persons without a functional auditory nerve cannot benefit from cochlear implants, but some hearing can be restored by an auditory brainstem implant (ABI) with stimulating electrodes implanted on the surface of the cochlear nucleus (CN). Most users benefit from their ABI, but speech recognition tends to be poorer than for users of cochlear implants. Psychophysical studies suggest that poor modulation detection may contribute to the limited performance of ABI users. In a cat model, we determined how the pulse rate of the electrical stimulus applied within or on the CN affects temporal and rate encoding of amplitude modulation (AM) by neurons in the central nucleus of the inferior colliculus (ICC).

APPROACH

Stimulating microelectrodes were implanted chronically in and on the cats' CN, and multi-site recording microelectrodes were implanted chronically into the ICC. Encoding of AM pulse trains by neurons in the ICC was characterized as vector strength (VS), the synchrony of neural activity with the AM, and as the mean rate of neuronal action potentials (neuronal spike rate (NSR)).

MAIN RESULTS

For intranuclear microstimulation, encoding of AM as VS was up to 3 dB greater when stimulus pulse rate was increased from 250 to 500 pps, but only for neuronal units with low best acoustic frequencies, and when the electrical stimulation was modulated at low frequencies (10-20 Hz). For stimulation on the surface of the CN, VS was similar at 250 and 500 pps, and the dynamic range of the VS was reduced for pulse rates greater than 250 pps. Modulation depth was encoded strongly as VS when the maximum stimulus amplitude was held constant across a range of modulation depth. This 'constant maximum' protocol allows enhancement of modulation depth while preserving overall dynamic range. However, modulation depth was not encoded as strongly as NSR.

SIGNIFICANCE

The findings have implications for improved sound processors for present and future ABIs. The performance of ABIs may benefit from using pulse rates greater than those presently used in most ABIs, and by sound processing strategies that enhance the modulation depth of the electrical stimulus while preserving dynamic range.

Collapse

Fontaine B, Steinberg LJ, Peña JL. Sound envelope extraction in cochlear nucleus neurons: modulation filterbank and cellular mechanism. BMC Neurosci 2013. [PMCID: PMC3704833 DOI: 10.1186/1471-2202-14-s1-p312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Chandrasekaran L, Xiao Y, Sivaramakrishnan S. Functional architecture of the inferior colliculus revealed with voltage-sensitive dyes. Front Neural Circuits 2013;7:41. [PMID: 23518906 PMCID: PMC3602642 DOI: 10.3389/fncir.2013.00041] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2012] [Accepted: 02/28/2013] [Indexed: 11/22/2022] Open

Abstract

We used optical imaging with voltage-sensitive dyes to investigate the spatio-temporal dynamics of synaptically evoked activity in brain slices of the inferior colliculus (IC). Responses in transverse slices which preserve cross-frequency connections and in modified sagittal slices that preserve connections within frequency laminae were evoked by activating the lateral lemniscal tract. Comparing activity between small and large populations of cells revealed response areas in the central nucleus of the IC that were similar in magnitude but graded temporally. In transverse sections, these response areas are summed to generate a topographic response profile. Activity through the commissure to the contralateral IC required an excitation threshold that was reached when GABAergic inhibition was blocked. Within laminae, module interaction created temporal homeostasis. Diffuse activity evoked by a single lemniscal shock re-organized into distinct spatial and temporal compartments when stimulus trains were used, and generated a directional activity profile within the lamina. Using different stimulus patterns to activate subsets of microcircuits in the central nucleus of the IC, we found that localized responses evoked by low-frequency stimulus trains spread extensively when train frequency was increased, suggesting recruitment of silent microcircuits. Long stimulus trains activated a circuit specific to post-inhibitory rebound neurons. Rebound microcircuits were defined by a focal point of initiation that spread to an annular ring that oscillated between inhibition and excitation. We propose that much of the computing power of the IC is derived from local circuits, some of which are cell-type specific. These circuits organize activity within and across frequency laminae, and are critical in determining the stimulus-selectivity of auditory coding.

Collapse

Bernstein LR, Trahiotis C. When and how envelope "rate-limitations" affect processing of interaural temporal disparities conveyed by high-frequency stimuli. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2013;787:263-71. [PMID: 23716232 DOI: 10.1007/978-1-4614-1590-9_30] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

Auditory abstraction from spectro-temporal features to coding auditory entities. Proc Natl Acad Sci U S A 2012;109:18968-73. [PMID: 23112145 DOI: 10.1073/pnas.1111242109] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Chen C, Rodriguez FC, Read HL, Escabí MA. Spectrotemporal sound preferences of neighboring inferior colliculus neurons: implications for local circuitry and processing. Front Neural Circuits 2012;6:62. [PMID: 23060750 PMCID: PMC3461703 DOI: 10.3389/fncir.2012.00062] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2012] [Accepted: 08/19/2012] [Indexed: 11/13/2022] Open