101
|
Livezey JA, Bouchard KE, Chang EF. Deep learning as a tool for neural data analysis: Speech classification and cross-frequency coupling in human sensorimotor cortex. PLoS Comput Biol 2019; 15:e1007091. [PMID: 31525179 PMCID: PMC6762206 DOI: 10.1371/journal.pcbi.1007091] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Revised: 09/26/2019] [Accepted: 05/10/2019] [Indexed: 11/26/2022] Open
Abstract
A fundamental challenge in neuroscience is to understand what structure in the world is represented in spatially distributed patterns of neural activity from multiple single-trial measurements. This is often accomplished by learning a simple, linear transformations between neural features and features of the sensory stimuli or motor task. While successful in some early sensory processing areas, linear mappings are unlikely to be ideal tools for elucidating nonlinear, hierarchical representations of higher-order brain areas during complex tasks, such as the production of speech by humans. Here, we apply deep networks to predict produced speech syllables from a dataset of high gamma cortical surface electric potentials recorded from human sensorimotor cortex. We find that deep networks had higher decoding prediction accuracy compared to baseline models. Having established that deep networks extract more task relevant information from neural data sets relative to linear models (i.e., higher predictive accuracy), we next sought to demonstrate their utility as a data analysis tool for neuroscience. We first show that deep network's confusions revealed hierarchical latent structure in the neural data, which recapitulated the underlying articulatory nature of speech motor control. We next broadened the frequency features beyond high-gamma and identified a novel high-gamma-to-beta coupling during speech production. Finally, we used deep networks to compare task-relevant information in different neural frequency bands, and found that the high-gamma band contains the vast majority of information relevant for the speech prediction task, with little-to-no additional contribution from lower-frequency amplitudes. Together, these results demonstrate the utility of deep networks as a data analysis tool for basic and applied neuroscience.
Collapse
Affiliation(s)
- Jesse A. Livezey
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
- Redwood Center for Theoretical Neuroscience, University of California, Berkeley, Berkeley, California, United States of America
| | - Kristofer E. Bouchard
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
- Redwood Center for Theoretical Neuroscience, University of California, Berkeley, Berkeley, California, United States of America
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, California, United States of America
| | - Edward F. Chang
- Department of Neurological Surgery and Department of Physiology, University of California, San Francisco, San Francisco, California, United States of America
- Center for Integrative Neuroscience, University of California, San Francisco, San Francisco, California, United States of America
- UCSF Epilepsy Center, University of California, San Francisco, San Francisco, California, United States of America
| |
Collapse
|
102
|
Zhang C, Qiao K, Wang L, Tong L, Hu G, Zhang RY, Yan B. A visual encoding model based on deep neural networks and transfer learning for brain activity measured by functional magnetic resonance imaging. J Neurosci Methods 2019; 325:108318. [DOI: 10.1016/j.jneumeth.2019.108318] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Revised: 03/29/2019] [Accepted: 06/16/2019] [Indexed: 11/28/2022]
|
103
|
Occam’s Razor for Big Data? On Detecting Quality in Large Unstructured Datasets. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9153065] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Detecting quality in large unstructured datasets requires capacities far beyond the limits of human perception and communicability and, as a result, there is an emerging trend towards increasingly complex analytic solutions in data science to cope with this problem. This new trend towards analytic complexity represents a severe challenge for the principle of parsimony (Occam’s razor) in science. This review article combines insight from various domains such as physics, computational science, data engineering, and cognitive science to review the specific properties of big data. Problems for detecting data quality without losing the principle of parsimony are then highlighted on the basis of specific examples. Computational building block approaches for data clustering can help to deal with large unstructured datasets in minimized computation time, and meaning can be extracted rapidly from large sets of unstructured image or video data parsimoniously through relatively simple unsupervised machine learning algorithms. Why we still massively lack in expertise for exploiting big data wisely to extract relevant information for specific tasks, recognize patterns and generate new information, or simply store and further process large amounts of sensor data is then reviewed, and examples illustrating why we need subjective views and pragmatic methods to analyze big data contents are brought forward. The review concludes on how cultural differences between East and West are likely to affect the course of big data analytics, and the development of increasingly autonomous artificial intelligence (AI) aimed at coping with the big data deluge in the near future.
Collapse
|
104
|
Hansen BC, Field DJ, Greene MR, Olson C, Miskovic V. Towards a state-space geometry of neural responses to natural scenes: A steady-state approach. Neuroimage 2019; 201:116027. [PMID: 31325643 DOI: 10.1016/j.neuroimage.2019.116027] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Revised: 06/13/2019] [Accepted: 07/16/2019] [Indexed: 10/26/2022] Open
Abstract
Our understanding of information processing by the mammalian visual system has come through a variety of techniques ranging from psychophysics and fMRI to single unit recording and EEG. Each technique provides unique insights into the processing framework of the early visual system. Here, we focus on the nature of the information that is carried by steady state visual evoked potentials (SSVEPs). To study the information provided by SSVEPs, we presented human participants with a population of natural scenes and measured the relative SSVEP response. Rather than focus on particular features of this signal, we focused on the full state-space of possible responses and investigated how the evoked responses are mapped onto this space. Our results show that it is possible to map the relatively high-dimensional signal carried by SSVEPs onto a 2-dimensional space with little loss. We also show that a simple biologically plausible model can account for a high proportion of the explainable variance (~73%) in that space. Finally, we describe a technique for measuring the mutual information that is available about images from SSVEPs. The techniques introduced here represent a new approach to understanding the nature of the information carried by SSVEPs. Crucially, this approach is general and can provide a means of comparing results across different neural recording methods. Altogether, our study sheds light on the encoding principles of early vision and provides a much needed reference point for understanding subsequent transformations of the early visual response space to deeper knowledge structures that link different visual environments.
Collapse
Affiliation(s)
- Bruce C Hansen
- Colgate University, Department of Psychological & Brain Sciences, Neuroscience Program, Hamilton, NY, USA.
| | - David J Field
- Cornell University, Department of Psychology, Ithaca, NY, USA
| | | | - Cassady Olson
- Colgate University, Department of Psychological & Brain Sciences, Neuroscience Program, Hamilton, NY, USA; Current Address: University of Chicago, Committee on Computational Neuroscience, Chicago, IL, USA
| | - Vladimir Miskovic
- State University of New York at Binghamton, Department of Psychology, Binghamton, NY, USA
| |
Collapse
|
105
|
Stringer C, Pachitariu M, Steinmetz N, Carandini M, Harris KD. High-dimensional geometry of population responses in visual cortex. Nature 2019; 571:361-365. [PMID: 31243367 PMCID: PMC6642054 DOI: 10.1038/s41586-019-1346-5] [Citation(s) in RCA: 279] [Impact Index Per Article: 46.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Accepted: 05/29/2019] [Indexed: 01/13/2023]
Abstract
A neuronal population encodes information most efficiently when its stimulus responses are high-dimensional and uncorrelated, and most robustly when they are lower-dimensional and correlated. Here we analysed the dimensionality of the encoding of natural images by large populations of neurons in the visual cortex of awake mice. The evoked population activity was high-dimensional, and correlations obeyed an unexpected power law: the nth principal component variance scaled as 1/n. This scaling was not inherited from the power law spectrum of natural images, because it persisted after stimulus whitening. We proved mathematically that if the variance spectrum was to decay more slowly then the population code could not be smooth, allowing small changes in input to dominate population activity. The theory also predicts larger power-law exponents for lower-dimensional stimulus ensembles, which we validated experimentally. These results suggest that coding smoothness may represent a fundamental constraint that determines correlations in neural population codes.
Collapse
Affiliation(s)
- Carsen Stringer
- HHMI Janelia Research Campus, Ashburn, VA, USA.
- UCL Gatsby Computational Neuroscience Unit, University College London, London, UK.
| | - Marius Pachitariu
- HHMI Janelia Research Campus, Ashburn, VA, USA.
- UCL Institute of Neurology, University College London, London, UK.
| | - Nicholas Steinmetz
- UCL Institute of Neurology, University College London, London, UK
- Department of Biological Structure, University of Washington, Seattle, WA, USA
| | - Matteo Carandini
- UCL Institute of Ophthalmology, University College London, London, UK
| | - Kenneth D Harris
- UCL Institute of Neurology, University College London, London, UK.
| |
Collapse
|
106
|
Task-dependent functional organizations of the visual ventral stream. Sci Rep 2019; 9:9316. [PMID: 31249350 PMCID: PMC6597703 DOI: 10.1038/s41598-019-45707-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 06/12/2019] [Indexed: 11/08/2022] Open
Abstract
The visual hierarchy of the ventral stream has been widely studied. However, it remains unclear how the hierarchical system organizes its functional coupling during top-down cognitive process. The present fMRI study investigated task-dependent functional connectivity along the ventral stream, while twenty-eight participants performed object recognition tasks that required different types of visual processing: i) searching or ii) memorizing visual objects embedded in natural scene images or iii) free viewing of the same images. Utilizing a seed-based approach that explicitly compared task-specific BOLD time-series, we identified task-dependent functional connectivity of the visual ventral stream, demonstrating different correlation structures. Searching for a target object manifested both correlated and anti-correlated structures, separating the visual areas V1 and V4 from the posterior part of the inferior temporal cortex (PIT). In contrast, the ventral stream structure remained correlated during memorizing objects, but increased the correlation between the right V4 and PIT. On the other hand, V1 and V4 showed task-dependent activation, whereas PIT was deactivated. These results highlight the context-dependent nature of the visual ventral stream and shed light on how the visual hierarchy is selectively organized to bias object recognition toward features of interest.
Collapse
|
107
|
Cortical Tracking of Complex Sound Envelopes: Modeling the Changes in Response with Intensity. eNeuro 2019; 6:ENEURO.0082-19.2019. [PMID: 31171606 PMCID: PMC6597859 DOI: 10.1523/eneuro.0082-19.2019] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 05/02/2019] [Accepted: 05/03/2019] [Indexed: 11/21/2022] Open
Abstract
Characterizing how the brain responds to stimuli has been a goal of sensory neuroscience for decades. One key approach has been to fit linear models to describe the relationship between sensory inputs and neural responses. This has included models aimed at predicting spike trains, local field potentials, BOLD responses, and EEG/MEG. In the case of EEG/MEG, one explicit use of this linear modeling approach has been the fitting of so-called temporal response functions (TRFs). TRFs have been used to study how auditory cortex tracks the amplitude envelope of acoustic stimuli, including continuous speech. However, such linear models typically assume that variations in the amplitude of the stimulus feature (i.e., the envelope) produce variations in the magnitude but not the latency or morphology of the resulting neural response. Here, we show that by amplitude binning the stimulus envelope, and then using it to fit a multivariate TRF, we can better account for these amplitude-dependent changes, and that this leads to a significant improvement in model performance for both amplitude-modulated noise and continuous speech in humans. We also show that this performance can be further improved through the inclusion of an additional envelope representation that emphasizes onsets and positive changes in the stimulus, consistent with the idea that while some neurons track the entire envelope, others respond preferentially to onsets in the stimulus. We contend that these results have practical implications for researchers interested in modeling brain responses to amplitude modulated sounds.
Collapse
|
108
|
Beyeler M, Rounds EL, Carlson KD, Dutt N, Krichmar JL. Neural correlates of sparse coding and dimensionality reduction. PLoS Comput Biol 2019; 15:e1006908. [PMID: 31246948 PMCID: PMC6597036 DOI: 10.1371/journal.pcbi.1006908] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Supported by recent computational studies, there is increasing evidence that a wide range of neuronal responses can be understood as an emergent property of nonnegative sparse coding (NSC), an efficient population coding scheme based on dimensionality reduction and sparsity constraints. We review evidence that NSC might be employed by sensory areas to efficiently encode external stimulus spaces, by some associative areas to conjunctively represent multiple behaviorally relevant variables, and possibly by the basal ganglia to coordinate movement. In addition, NSC might provide a useful theoretical framework under which to understand the often complex and nonintuitive response properties of neurons in other brain areas. Although NSC might not apply to all brain areas (for example, motor or executive function areas) the success of NSC-based models, especially in sensory areas, warrants further investigation for neural correlates in other regions.
Collapse
Affiliation(s)
- Michael Beyeler
- Department of Psychology, University of Washington, Seattle, Washington, United States of America
- Institute for Neuroengineering, University of Washington, Seattle, Washington, United States of America
- eScience Institute, University of Washington, Seattle, Washington, United States of America
- Department of Computer Science, University of California, Irvine, California, United States of America
| | - Emily L. Rounds
- Department of Cognitive Sciences, University of California, Irvine, California, United States of America
| | - Kristofor D. Carlson
- Department of Cognitive Sciences, University of California, Irvine, California, United States of America
- Sandia National Laboratories, Albuquerque, New Mexico, United States of America
| | - Nikil Dutt
- Department of Computer Science, University of California, Irvine, California, United States of America
- Department of Cognitive Sciences, University of California, Irvine, California, United States of America
| | - Jeffrey L. Krichmar
- Department of Computer Science, University of California, Irvine, California, United States of America
- Department of Cognitive Sciences, University of California, Irvine, California, United States of America
| |
Collapse
|
109
|
Kupers ER, Carrasco M, Winawer J. Modeling visual performance differences 'around' the visual field: A computational observer approach. PLoS Comput Biol 2019; 15:e1007063. [PMID: 31125331 PMCID: PMC6553792 DOI: 10.1371/journal.pcbi.1007063] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 06/06/2019] [Accepted: 05/02/2019] [Indexed: 01/25/2023] Open
Abstract
Visual performance depends on polar angle, even when eccentricity is held constant; on many psychophysical tasks observers perform best when stimuli are presented on the horizontal meridian, worst on the upper vertical, and intermediate on the lower vertical meridian. This variation in performance 'around' the visual field can be as pronounced as that of doubling the stimulus eccentricity. The causes of these asymmetries in performance are largely unknown. Some factors in the eye, e.g. cone density, are positively correlated with the reported variations in visual performance with polar angle. However, the question remains whether these correlations can quantitatively explain the perceptual differences observed 'around' the visual field. To investigate the extent to which the earliest stages of vision-optical quality and cone density-contribute to performance differences with polar angle, we created a computational observer model. The model uses the open-source software package ISETBIO to simulate an orientation discrimination task for which visual performance differs with polar angle. The model starts from the photons emitted by a display, which pass through simulated human optics with fixational eye movements, followed by cone isomerizations in the retina. Finally, we classify stimulus orientation using a support vector machine to learn a linear classifier on the photon absorptions. To account for the 30% increase in contrast thresholds for upper vertical compared to horizontal meridian, as observed psychophysically on the same task, our computational observer model would require either an increase of ~7 diopters of defocus or a reduction of 500% in cone density. These values far exceed the actual variations as a function of polar angle observed in human eyes. Therefore, we conclude that these factors in the eye only account for a small fraction of differences in visual performance with polar angle. Substantial additional asymmetries must arise in later retinal and/or cortical processing.
Collapse
Affiliation(s)
- Eline R. Kupers
- Department of Psychology, New York University, New York, New York, United States of America
| | - Marisa Carrasco
- Department of Psychology, New York University, New York, New York, United States of America
- Center for Neural Science, New York University, New York, New York, United States of America
| | - Jonathan Winawer
- Department of Psychology, New York University, New York, New York, United States of America
- Center for Neural Science, New York University, New York, New York, United States of America
| |
Collapse
|
110
|
Cadena SA, Denfield GH, Walker EY, Gatys LA, Tolias AS, Bethge M, Ecker AS. Deep convolutional models improve predictions of macaque V1 responses to natural images. PLoS Comput Biol 2019; 15:e1006897. [PMID: 31013278 PMCID: PMC6499433 DOI: 10.1371/journal.pcbi.1006897] [Citation(s) in RCA: 140] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Revised: 05/03/2019] [Accepted: 02/21/2019] [Indexed: 11/18/2022] Open
Abstract
Despite great efforts over several decades, our best models of primary visual cortex (V1) still predict spiking activity quite poorly when probed with natural stimuli, highlighting our limited understanding of the nonlinear computations in V1. Recently, two approaches based on deep learning have emerged for modeling these nonlinear computations: transfer learning from artificial neural networks trained on object recognition and data-driven convolutional neural network models trained end-to-end on large populations of neurons. Here, we test the ability of both approaches to predict spiking activity in response to natural images in V1 of awake monkeys. We found that the transfer learning approach performed similarly well to the data-driven approach and both outperformed classical linear-nonlinear and wavelet-based feature representations that build on existing theories of V1. Notably, transfer learning using a pre-trained feature space required substantially less experimental time to achieve the same performance. In conclusion, multi-layer convolutional neural networks (CNNs) set the new state of the art for predicting neural responses to natural images in primate V1 and deep features learned for object recognition are better explanations for V1 computation than all previous filter bank theories. This finding strengthens the necessity of V1 models that are multiple nonlinearities away from the image domain and it supports the idea of explaining early visual cortex based on high-level functional goals.
Collapse
Affiliation(s)
- Santiago A. Cadena
- Centre for Integrative Neuroscience and Institute for Theoretical Physics, University of Tübingen, Tübingen, Germany
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
- Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, Texas, United States of America
| | - George H. Denfield
- Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, Texas, United States of America
- Department of Neuroscience, Baylor College of Medicine, Houston, Houston, Texas, United States of America
| | - Edgar Y. Walker
- Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, Texas, United States of America
- Department of Neuroscience, Baylor College of Medicine, Houston, Houston, Texas, United States of America
| | - Leon A. Gatys
- Centre for Integrative Neuroscience and Institute for Theoretical Physics, University of Tübingen, Tübingen, Germany
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
| | - Andreas S. Tolias
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
- Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, Texas, United States of America
- Department of Neuroscience, Baylor College of Medicine, Houston, Houston, Texas, United States of America
- Department of Electrical and Computer Engineering, Rice University, Houston, Houston, Texas, United States of America
| | - Matthias Bethge
- Centre for Integrative Neuroscience and Institute for Theoretical Physics, University of Tübingen, Tübingen, Germany
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
- Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, Texas, United States of America
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
| | - Alexander S. Ecker
- Centre for Integrative Neuroscience and Institute for Theoretical Physics, University of Tübingen, Tübingen, Germany
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
- Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, Texas, United States of America
| |
Collapse
|
111
|
Suppressive Traveling Waves Shape Representations of Illusory Motion in Primary Visual Cortex of Awake Primate. J Neurosci 2019; 39:4282-4298. [PMID: 30886010 DOI: 10.1523/jneurosci.2792-18.2019] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Revised: 03/14/2019] [Accepted: 03/14/2019] [Indexed: 12/13/2022] Open
Abstract
How does the brain link visual stimuli across space and time? Visual illusions provide an experimental paradigm to study these processes. When two stationary dots are flashed in close spatial and temporal succession, human observers experience a percept of apparent motion. Large spatiotemporal separation challenges the visual system to keep track of object identity along the apparent motion path, the so-called "correspondence problem." Here, we use voltage-sensitive dye imaging in primary visual cortex (V1) of awake monkeys to show that intracortical connections within V1 can solve this issue by shaping cortical dynamics to represent the illusory motion. We find that the appearance of the second stimulus in V1 creates a systematic suppressive wave traveling toward the retinotopic representation of the first. Using a computational model, we show that the suppressive wave is the emergent property of a recurrent gain control fed by the intracortical network. This suppressive wave acts to explain away ambiguous correspondence problems and contributes to precisely encode the expected motion velocity at the surface of V1. Together, these results demonstrate that the nonlinear dynamics within retinotopic maps can shape cortical representations of illusory motion. Understanding these dynamics will shed light on how the brain links sensory stimuli across space and time, by preformatting population responses for a straightforward read-out by downstream areas.SIGNIFICANCE STATEMENT Traveling waves have recently been observed in different animal species, brain areas, and behavioral states. However, it is still unclear what are their functional roles. In the case of cortical visual processing, waves propagate across retinotopic maps and can hereby generate interactions between spatially and temporally separated instances of feedforward driven activity. Such interactions could participate in processing long-range apparent motion stimuli, an illusion for which no clear neuronal mechanisms have yet been proposed. Using this paradigm in awake monkeys, we show that suppressive traveling waves produce a spatiotemporal normalization of apparent motion stimuli. Our study suggests that cortical waves shape the representation of illusory moving stimulus within retinotopic maps for a straightforward read-out by downstream areas.
Collapse
|
112
|
Ukita J, Yoshida T, Ohki K. Characterisation of nonlinear receptive fields of visual neurons by convolutional neural network. Sci Rep 2019; 9:3791. [PMID: 30846783 PMCID: PMC6405885 DOI: 10.1038/s41598-019-40535-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Accepted: 02/19/2019] [Indexed: 12/31/2022] Open
Abstract
A comprehensive understanding of the stimulus-response properties of individual neurons is necessary to crack the neural code of sensory cortices. However, a barrier to achieving this goal is the difficulty of analysing the nonlinearity of neuronal responses. Here, by incorporating convolutional neural network (CNN) for encoding models of neurons in the visual cortex, we developed a new method of nonlinear response characterisation, especially nonlinear estimation of receptive fields (RFs), without assumptions regarding the type of nonlinearity. Briefly, after training CNN to predict the visual responses to natural images, we synthesised the RF image such that the image would predictively evoke a maximum response. We first demonstrated the proof-of-principle using a dataset of simulated cells with various types of nonlinearity. We could visualise RFs with various types of nonlinearity, such as shift-invariant RFs or rotation-invariant RFs, suggesting that the method may be applicable to neurons with complex nonlinearities in higher visual areas. Next, we applied the method to a dataset of neurons in mouse V1. We could visualise simple-cell-like or complex-cell-like (shift-invariant) RFs and quantify the degree of shift-invariance. These results suggest that CNN encoding model is useful in nonlinear response analyses of visual neurons and potentially of any sensory neurons.
Collapse
Affiliation(s)
- Jumpei Ukita
- Department of Physiology, The University of Tokyo School of Medicine, Bunkyo-ku, Tokyo, Japan.
| | - Takashi Yoshida
- Department of Physiology, The University of Tokyo School of Medicine, Bunkyo-ku, Tokyo, Japan
- Department of Molecular Physiology, Graduate School of Medical Sciences, Kyushu University, Higashi-ku, Fukuoka, Japan
| | - Kenichi Ohki
- Department of Physiology, The University of Tokyo School of Medicine, Bunkyo-ku, Tokyo, Japan.
- Department of Molecular Physiology, Graduate School of Medical Sciences, Kyushu University, Higashi-ku, Fukuoka, Japan.
- International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo, Bunkyo-ku, Tokyo, Japan.
| |
Collapse
|
113
|
Popovkina DV, Bair W, Pasupathy A. Modeling diverse responses to filled and outline shapes in macaque V4. J Neurophysiol 2019; 121:1059-1077. [PMID: 30699004 DOI: 10.1152/jn.00456.2018] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Visual area V4 is an important midlevel cortical processing stage that subserves object recognition in primates. Studies investigating shape coding in V4 have largely probed neuronal responses with filled shapes, i.e., shapes defined by both a boundary and an interior fill. As a result, we do not know whether form-selective V4 responses are dictated by boundary features alone or if interior fill is also important. We studied 43 V4 neurons in two male macaque monkeys ( Macaca mulatta) with a set of 362 filled shapes and their corresponding outlines to determine how interior fill modulates neuronal responses in shape-selective neurons. Only a minority of neurons exhibited similar response strength and shape preferences for filled and outline stimuli. A majority responded preferentially to one stimulus category (either filled or outline shapes) and poorly to the other. Our findings are inconsistent with predictions of the hierarchical-max (HMax) V4 model that builds form selectivity from oriented boundary features and takes little account of attributes related to object surface, such as the phase of the boundary edge. We modified the V4 HMax model to include sensitivity to interior fill by either removing phase-pooling or introducing unoriented units at the V1 level; both modifications better explained our data without increasing the number of free parameters. Overall, our results suggest that boundary orientation and interior surface information are both maintained until at least the midlevel visual representation, consistent with the idea that object fill is important for recognition and perception in natural vision. NEW & NOTEWORTHY The shape of an object's boundary is critical for identification; consistent with this idea, models of object recognition predict that filled and outline versions of a shape are encoded similarly. We report that many neurons in a midlevel visual cortical area respond differently to filled and outline shapes and modify a biologically plausible model to account for our data. Our results suggest that representations of boundary shape and surface fill are interrelated in visual cortex.
Collapse
Affiliation(s)
- Dina V Popovkina
- Department of Biological Structure, Washington National Primate Research Center, University of Washington , Seattle, Washington
| | - Wyeth Bair
- Department of Biological Structure, Washington National Primate Research Center, University of Washington , Seattle, Washington
| | - Anitha Pasupathy
- Department of Biological Structure, Washington National Primate Research Center, University of Washington , Seattle, Washington
| |
Collapse
|
114
|
Speed-Selectivity in Retinal Ganglion Cells is Sharpened by Broad Spatial Frequency, Naturalistic Stimuli. Sci Rep 2019; 9:456. [PMID: 30679564 PMCID: PMC6345785 DOI: 10.1038/s41598-018-36861-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Accepted: 11/09/2018] [Indexed: 11/28/2022] Open
Abstract
Motion detection represents one of the critical tasks of the visual system and has motivated a large body of research. However, it remains unclear precisely why the response of retinal ganglion cells (RGCs) to simple artificial stimuli does not predict their response to complex, naturalistic stimuli. To explore this topic, we use Motion Clouds (MC), which are synthetic textures that preserve properties of natural images and are merely parameterized, in particular by modulating the spatiotemporal spectrum complexity of the stimulus by adjusting the frequency bandwidths. By stimulating the retina of the diurnal rodent, Octodon degus with MC we show that the RGCs respond to increasingly complex stimuli by narrowing their adjustment curves in response to movement. At the level of the population, complex stimuli produce a sparser code while preserving movement information; therefore, the stimuli are encoded more efficiently. Interestingly, these properties were observed throughout different populations of RGCs. Thus, our results reveal that the response at the level of RGCs is modulated by the naturalness of the stimulus - in particular for motion - which suggests that the tuning to the statistics of natural images already emerges at the level of the retina.
Collapse
|
115
|
Turner MH, Sanchez Giraldo LG, Schwartz O, Rieke F. Stimulus- and goal-oriented frameworks for understanding natural vision. Nat Neurosci 2019; 22:15-24. [PMID: 30531846 PMCID: PMC8378293 DOI: 10.1038/s41593-018-0284-0] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Accepted: 10/22/2018] [Indexed: 12/21/2022]
Abstract
Our knowledge of sensory processing has advanced dramatically in the last few decades, but this understanding remains far from complete, especially for stimuli with the large dynamic range and strong temporal and spatial correlations characteristic of natural visual inputs. Here we describe some of the issues that make understanding the encoding of natural images a challenge. We highlight two broad strategies for approaching this problem: a stimulus-oriented framework and a goal-oriented one. Different contexts can call for one framework or the other. Looking forward, recent advances, particularly those based in machine learning, show promise in borrowing key strengths of both frameworks and by doing so illuminating a path to a more comprehensive understanding of the encoding of natural stimuli.
Collapse
Affiliation(s)
- Maxwell H Turner
- Department of Physiology and Biophysics, University of Washington, Seattle, WA, USA
- Graduate Program in Neuroscience, University of Washington, Seattle, WA, USA
| | | | - Odelia Schwartz
- Department of Computer Science, University of Miami, Coral Gables, FL, USA
| | - Fred Rieke
- Department of Physiology and Biophysics, University of Washington, Seattle, WA, USA.
| |
Collapse
|
116
|
Neri P. The empirical characteristics of human pattern vision defy theoretically-driven expectations. PLoS Comput Biol 2018; 14:e1006585. [PMID: 30513091 PMCID: PMC6294397 DOI: 10.1371/journal.pcbi.1006585] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2018] [Revised: 12/14/2018] [Accepted: 10/17/2018] [Indexed: 11/19/2022] Open
Abstract
Contrast is the most fundamental property of images. Consequently, any comprehensive model of biological vision must incorporate this attribute and provide a veritable description of its impact on visual perception. Current theoretical and computational models predict that vision should modify its characteristics at low contrast: for example, it should become broader (more lowpass) to protect from noise, as often demonstrated by individual neurons. We find that the opposite is true for human discrimination of elementary image elements: vision becomes sharper, not broader, as contrast approaches threshold levels. Furthermore, it suffers from increased internal variability at low contrast and it transitions from a surprisingly linear regime at high contrast to a markedly nonlinear processing mode in the low-contrast range. These characteristics are hard-wired in that they happen on a single trial without memory or expectation. Overall, the empirical results urge caution when attempting to interpret human vision from the standpoint of optimality and related theoretical constructs. Direct measurements of this phenomenon indicate that the actual constraints derive from intrinsic architectural features, such as the co-existence of complex-cell-like and simple-cell-like components. Small circuits built around these elements can indeed account for the empirical results, but do not appear to operate in a manner that conforms to optimality even approximately. More generally, our results provide a compelling demonstration of how far we still are from securing an adequate computational account of the most basic operations carried out by human vision.
Collapse
Affiliation(s)
- Peter Neri
- Laboratoire des Systèmes Perceptifs, Département d’études cognitives, École normale supérieure, PSL University, CNRS, 75005 Paris, France
| |
Collapse
|
117
|
Leek EC, Roberts MV, Dundon NM, Pegna AJ. Early sensitivity of evoked potentials to surface and volumetric structure during the visual perception of three-dimensional object shape. Eur J Neurosci 2018; 52:4453-4467. [PMID: 30447162 DOI: 10.1111/ejn.14270] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 10/11/2018] [Accepted: 10/15/2018] [Indexed: 11/26/2022]
Abstract
This study used event-related potentials (ERPs) to elucidate how the human visual system processes three-dimensional (3-D) object shape structure. In particular, we examined whether the perceptual mechanisms that support the analysis of 3-D shape are differentially sensitive to higher order surface and volumetric part structure. Observers performed a whole-part novel object matching task in which part stimuli comprised sub-regions of closed edge contour, surfaces or volumetric parts. Behavioural response latency data showed an advantage in matching surfaces and volumetric parts to whole objects over contours, but no difference between surfaces and volumes. ERPs were analysed using a convergence of approaches based on stimulus dependent amplitude modulations of evoked potentials, topographic segmentation, and spatial frequency oscillations. The results showed early differential perceptual processing of contours, surfaces, and volumetric part stimuli. This was first reliably observed over occipitoparietal electrodes during the N1 (140-200 ms) with a mean peak latency of 170 ms, and continued on subsequent P2 (220-260 ms) and N2 (260-320 ms) components. The differential sensitivity in perceptual processing during the N1 was accompanied by distinct microstate patterns that distinguished among contours, surfaces and volumes, and predominant theta band activity around 4-7 Hz over right occipitoparietal and orbitofrontal sites. These results provide the first evidence of early differential perceptual processing of higher order surface and volumetric shape structure within the first 200 ms of stimulus processing. The findings challenge theoretical models of object recognition that do not attribute functional significance to surface and volumetric object structure during visual perception.
Collapse
Affiliation(s)
- E Charles Leek
- School of Psychology, Institute of Life and Human Sciences, University of Liverpool, Liverpool, L69 7ZA, UK
| | | | - Neil M Dundon
- Brain Imaging Center, Department of Psychological and Brain Sciences, University of California, Santa Barbara, CA, USA.,Department of Child and Adolescent Psychiatry, Psychotherapy and Psychosomatics, University of Freiburg, Freiburg, Germany
| | - Alan J Pegna
- School of Psychology, University of Queensland, Saint Lucia, Qld, Australia
| |
Collapse
|
118
|
Flow stimuli reveal ecologically appropriate responses in mouse visual cortex. Proc Natl Acad Sci U S A 2018; 115:11304-11309. [PMID: 30327345 DOI: 10.1073/pnas.1811265115] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Assessments of the mouse visual system based on spatial-frequency analysis imply that its visual capacity is low, with few neurons responding to spatial frequencies greater than 0.5 cycles per degree. However, visually mediated behaviors, such as prey capture, suggest that the mouse visual system is more precise. We introduce a stimulus class-visual flow patterns-that is more like what the mouse would encounter in the natural world than are sine-wave gratings but is more tractable for analysis than are natural images. We used 128-site silicon microelectrodes to measure the simultaneous responses of single neurons in the primary visual cortex (V1) of alert mice. While holding temporal-frequency content fixed, we explored a class of drifting patterns of black or white dots that have energy only at higher spatial frequencies. These flow stimuli evoke strong visually mediated responses well beyond those predicted by spatial-frequency analysis. Flow responses predominate in higher spatial-frequency ranges (0.15-1.6 cycles per degree), many are orientation or direction selective, and flow responses of many neurons depend strongly on sign of contrast. Many cells exhibit distributed responses across our stimulus ensemble. Together, these results challenge conventional linear approaches to visual processing and expand our understanding of the mouse's visual capacity to behaviorally relevant ranges.
Collapse
|
119
|
Abstract
Generalized linear models (GLMs) have a wide range of applications in systems neuroscience describing the encoding of stimulus and behavioral variables, as well as the dynamics of single neurons. However, in any given experiment, many variables that have an impact on neural activity are not observed or not modeled. Here we demonstrate, in both theory and practice, how these omitted variables can result in biased parameter estimates for the effects that are included. In three case studies, we estimate tuning functions for common experiments in motor cortex, hippocampus, and visual cortex. We find that including traditionally omitted variables changes estimates of the original parameters and that modulation originally attributed to one variable is reduced after new variables are included. In GLMs describing single-neuron dynamics, we then demonstrate how postspike history effects can also be biased by omitted variables. Here we find that omitted variable bias can lead to mistaken conclusions about the stability of single-neuron firing. Omitted variable bias can appear in any model with confounders-where omitted variables modulate neural activity and the effects of the omitted variables covary with the included effects. Understanding how and to what extent omitted variable bias affects parameter estimates is likely to be important for interpreting the parameters and predictions of many neural encoding models.
Collapse
Affiliation(s)
- Ian H Stevenson
- Department of Psychological Sciences, Department of Biomedical Engineering, and CT Institute for Brain and Cognitive Sciences, University of Connecticut, Storrs, CT 06269, U.S.A.
| |
Collapse
|
120
|
Kay KN. Principles for models of neural information processing. Neuroimage 2018; 180:101-109. [DOI: 10.1016/j.neuroimage.2017.08.016] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2017] [Revised: 08/02/2017] [Accepted: 08/03/2017] [Indexed: 11/25/2022] Open
|
121
|
Abstract
Contemporary neuroscience suggests that perception is perhaps best understood as a dynamically iterative process that does not honor cleanly segregated "bottom-up" or "top-down" streams. We argue that there is substantial empirical support for the idea that affective influences infiltrate the earliest reaches of sensory processing and even that primitive internal affective dimensions (e.g., goodness-to-badness) are represented alongside physical dimensions of the external world.
Collapse
|
122
|
Maheswaranathan N, Kastner DB, Baccus SA, Ganguli S. Inferring hidden structure in multilayered neural circuits. PLoS Comput Biol 2018; 14:e1006291. [PMID: 30138312 PMCID: PMC6124781 DOI: 10.1371/journal.pcbi.1006291] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2017] [Revised: 09/05/2018] [Accepted: 06/09/2018] [Indexed: 01/26/2023] Open
Abstract
A central challenge in sensory neuroscience involves understanding how neural circuits shape computations across cascaded cell layers. Here we attempt to reconstruct the response properties of experimentally unobserved neurons in the interior of a multilayered neural circuit, using cascaded linear-nonlinear (LN-LN) models. We combine non-smooth regularization with proximal consensus algorithms to overcome difficulties in fitting such models that arise from the high dimensionality of their parameter space. We apply this framework to retinal ganglion cell processing, learning LN-LN models of retinal circuitry consisting of thousands of parameters, using 40 minutes of responses to white noise. Our models demonstrate a 53% improvement in predicting ganglion cell spikes over classical linear-nonlinear (LN) models. Internal nonlinear subunits of the model match properties of retinal bipolar cells in both receptive field structure and number. Subunits have consistently high thresholds, supressing all but a small fraction of inputs, leading to sparse activity patterns in which only one subunit drives ganglion cell spiking at any time. From the model’s parameters, we predict that the removal of visual redundancies through stimulus decorrelation across space, a central tenet of efficient coding theory, originates primarily from bipolar cell synapses. Furthermore, the composite nonlinear computation performed by retinal circuitry corresponds to a boolean OR function applied to bipolar cell feature detectors. Our methods are statistically and computationally efficient, enabling us to rapidly learn hierarchical non-linear models as well as efficiently compute widely used descriptive statistics such as the spike triggered average (STA) and covariance (STC) for high dimensional stimuli. This general computational framework may aid in extracting principles of nonlinear hierarchical sensory processing across diverse modalities from limited data. Computation in neural circuits arises from the cascaded processing of inputs through multiple cell layers. Each of these cell layers performs operations such as filtering and thresholding in order to shape a circuit’s output. It remains a challenge to describe both the computations and the mechanisms that mediate them given limited data recorded from a neural circuit. A standard approach to describing circuit computation involves building quantitative encoding models that predict the circuit response given its input, but these often fail to map in an interpretable way onto mechanisms within the circuit. In this work, we build two layer linear-nonlinear cascade models (LN-LN) in order to describe how the retinal output is shaped by nonlinear mechanisms in the inner retina. We find that these LN-LN models, fit to ganglion cell recordings alone, identify filters and nonlinearities that are readily mapped onto individual circuit components inside the retina, namely bipolar cells and the bipolar-to-ganglion cell synaptic threshold. This work demonstrates how combining simple prior knowledge of circuit properties with partial experimental recordings of a neural circuit’s output can yield interpretable models of the entire circuit computation, including parts of the circuit that are hidden or not directly observed in neural recordings.
Collapse
Affiliation(s)
- Niru Maheswaranathan
- Neurosciences Graduate Program, Stanford University, Stanford, California, United States of America
| | - David B. Kastner
- Neurosciences Graduate Program, Stanford University, Stanford, California, United States of America
| | - Stephen A. Baccus
- Department of Neurobiology, Stanford University, Stanford, California, United States of America
| | - Surya Ganguli
- Department of Applied Physics, Stanford University, Stanford, California, United States of America
- * E-mail:
| |
Collapse
|
123
|
Kong Q, Han J, Zeng Y, Xu B. Efficient coding matters in the organization of the early visual system. Neural Netw 2018; 105:218-226. [PMID: 29870929 DOI: 10.1016/j.neunet.2018.04.019] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Revised: 02/24/2018] [Accepted: 04/26/2018] [Indexed: 11/27/2022]
Abstract
Individual areas in the brain are organized into a hierarchical network as a result of evolution. Previous work indicated that the receptive fields (RFs) of individual areas have been evolved to favor metabolically efficient neural codes. In this paper, we propose that not only the RFs of individual areas, but also the organization of adjacent neurons and the hierarchical structure composed of these areas have been evolved to support efficient coding. To verify this hypothesis, we introduce a feed-forward three-layer network to simulate the early stages of human visual system. We emphasize that the network is not a purely feed-forward one since it also includes intra-layer connections, which are essential but usually ignored in the literature. Simulation results strongly reveal that (1) the obtained RFs of the simulated retinal ganglion cells (RGCs) or neurons in the lateral geniculate nucleus (LGN) and V1 simple neurons are consistent to the neurophysiological data; (2) the responses of closer RGCs are more correlated, and V1 simple neurons with similar orientations prefer to cluster together; (3) the hierarchical organization of the early visual system is beneficial for saving energy, which accords with the requirement of metabolically efficient neural coding in the process of human brain evolution.
Collapse
Affiliation(s)
- Qingqun Kong
- Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China; University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Jiuqi Han
- Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China; Department of Neural Engineering and Biological Interdisciplinary Studies, Institute of Military Cognition and Brain Sciences, Academy of Military Medical Sciences, Beijing, 100850, China
| | - Yi Zeng
- Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China; University of Chinese Academy of Sciences, Beijing, 100049, China; Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai, 200031, China; National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China.
| | - Bo Xu
- Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China; University of Chinese Academy of Sciences, Beijing, 100049, China; Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai, 200031, China
| |
Collapse
|
124
|
Zhang Y, Lee TS, Li M, Liu F, Tang S. Convolutional neural network models of V1 responses to complex patterns. J Comput Neurosci 2018; 46:33-54. [PMID: 29869761 DOI: 10.1007/s10827-018-0687-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Revised: 04/26/2018] [Accepted: 04/30/2018] [Indexed: 11/30/2022]
Abstract
In this study, we evaluated the convolutional neural network (CNN) method for modeling V1 neurons of awake macaque monkeys in response to a large set of complex pattern stimuli. CNN models outperformed all the other baseline models, such as Gabor-based standard models for V1 cells and various variants of generalized linear models. We then systematically dissected different components of the CNN and found two key factors that made CNNs outperform other models: thresholding nonlinearity and convolution. In addition, we fitted our data using a pre-trained deep CNN via transfer learning. The deep CNN's higher layers, which encode more complex patterns, outperformed lower ones, and this result was consistent with our earlier work on the complexity of V1 neural code. Our study systematically evaluates the relative merits of different CNN components in the context of V1 neuron modeling.
Collapse
Affiliation(s)
- Yimeng Zhang
- Center for the Neural Basis of Cognition and Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, 15213, USA.
| | - Tai Sing Lee
- Center for the Neural Basis of Cognition and Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - Ming Li
- Peking University School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Beijing, 100871, China.,IDG/McGovern Institute for Brain Research at Peking University, Beijing, 100871, China
| | - Fang Liu
- Peking University School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Beijing, 100871, China.,IDG/McGovern Institute for Brain Research at Peking University, Beijing, 100871, China
| | - Shiming Tang
- Peking University School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Beijing, 100871, China. .,IDG/McGovern Institute for Brain Research at Peking University, Beijing, 100871, China.
| |
Collapse
|
125
|
Khan AG, Hofer SB. Contextual signals in visual cortex. Curr Opin Neurobiol 2018; 52:131-138. [PMID: 29883940 DOI: 10.1016/j.conb.2018.05.003] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Accepted: 05/11/2018] [Indexed: 11/15/2022]
Abstract
Vision is an active process. What we perceive strongly depends on our actions, intentions and expectations. During visual processing, these internal signals therefore need to be integrated with the visual information from the retina. The mechanisms of how this is achieved by the visual system are still poorly understood. Advances in recording and manipulating neuronal activity in specific cell types and axonal projections together with tools for circuit tracing are beginning to shed light on the neuronal circuit mechanisms of how internal, contextual signals shape sensory representations. Here we review recent work, primarily in mice, that has advanced our understanding of these processes, focusing on contextual signals related to locomotion, behavioural relevance and predictions.
Collapse
Affiliation(s)
- Adil G Khan
- Centre for Developmental Neurobiology, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Sonja B Hofer
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK.
| |
Collapse
|
126
|
Papale P, Leo A, Cecchetti L, Handjaras G, Kay KN, Pietrini P, Ricciardi E. Foreground-Background Segmentation Revealed during Natural Image Viewing. eNeuro 2018; 5:ENEURO.0075-18.2018. [PMID: 29951579 PMCID: PMC6019392 DOI: 10.1523/eneuro.0075-18.2018] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Revised: 05/15/2018] [Accepted: 05/15/2018] [Indexed: 11/21/2022] Open
Abstract
One of the major challenges in visual neuroscience is represented by foreground-background segmentation. Data from nonhuman primates show that segmentation leads to two distinct, but associated processes: the enhancement of neural activity during figure processing (i.e., foreground enhancement) and the suppression of background-related activity (i.e., background suppression). To study foreground-background segmentation in ecological conditions, we introduce a novel method based on parametric modulation of low-level image properties followed by application of simple computational image-processing models. By correlating the outcome of this procedure with human fMRI activity, measured during passive viewing of 334 natural images, we produced easily interpretable "correlation images" from visual populations. Results show evidence of foreground enhancement in all tested regions, from V1 to lateral occipital complex (LOC), while background suppression occurs in V4 and LOC only. Correlation images derived from V4 and LOC revealed a preserved spatial resolution of foreground textures, indicating a richer representation of the salient part of natural images, rather than a simplistic model of object shape. Our results indicate that scene segmentation occurs during natural viewing, even when individuals are not required to perform any particular task.
Collapse
Affiliation(s)
- Paolo Papale
- Molecular Mind Lab, IMT School for Advanced Studies Lucca, Lucca, 55100 Italy
| | - Andrea Leo
- Molecular Mind Lab, IMT School for Advanced Studies Lucca, Lucca, 55100 Italy
| | - Luca Cecchetti
- Molecular Mind Lab, IMT School for Advanced Studies Lucca, Lucca, 55100 Italy
| | - Giacomo Handjaras
- Molecular Mind Lab, IMT School for Advanced Studies Lucca, Lucca, 55100 Italy
| | - Kendrick N. Kay
- Center for Magnetic Resonance Research, Department of Radiology, University of Minnesota, Twin Cities, Minneapolis, MN, 55455
| | - Pietro Pietrini
- Molecular Mind Lab, IMT School for Advanced Studies Lucca, Lucca, 55100 Italy
| | - Emiliano Ricciardi
- Molecular Mind Lab, IMT School for Advanced Studies Lucca, Lucca, 55100 Italy
| |
Collapse
|
127
|
Reichenthal A, Ben-Tov M, Segev R. Coding Schemes in the Archerfish Optic Tectum. Front Neural Circuits 2018; 12:18. [PMID: 29559898 PMCID: PMC5845554 DOI: 10.3389/fncir.2018.00018] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Accepted: 02/13/2018] [Indexed: 01/11/2023] Open
Abstract
Many studies have yielded valuable knowledge on the early visual system but it is biased since the studies have focused on terrestrial mammals alone. Here, to better account for visual systems in different environments and animal classes, we studied the structure of early visual processing in the archerfish which harnesses its extreme visual ability to hunt by shooting water jets at prey hanging on vegetation above the water. Thus, the archerfish provides a unique opportunity to study visual processing in a vertebrate which is an expert vision-guided predator with a very different brain structure than mammals. The receptive field structures in the archerfish (both sexes) optic tectum, the main visual processing region in the fish brain, were measured and linear non-linear cascades were used to analyze their properties. The findings indicate that the spatial receptive field structures lie on a continuum between circular and elliptical shapes. In addition, the cells' functional properties display a richness of response characteristics, since many cells could be captured by more than a single linear filter. Finally, the non-linear response functions that link linear filters and neuronal responses were found to be similar to the non-linear functions of models that describe terrestrial mammalian single cell activity. Overall our results help to better understand the early visual processing system across vertebrates.
Collapse
Affiliation(s)
- Adam Reichenthal
- Life Sciences Department and Zlotowski Center for Neuroscience, Ben Gurion University of the Negev, Beersheba, Israel
| | - Mor Ben-Tov
- Department of Neurobiology, Duke University, Durham, NC, United States
| | - Ronen Segev
- Life Sciences Department and Zlotowski Center for Neuroscience, Ben Gurion University of the Negev, Beersheba, Israel
| |
Collapse
|
128
|
Muller L, Chavane F, Reynolds J, Sejnowski TJ. Cortical travelling waves: mechanisms and computational principles. Nat Rev Neurosci 2018; 19:255-268. [PMID: 29563572 DOI: 10.1038/nrn.2018.20] [Citation(s) in RCA: 283] [Impact Index Per Article: 40.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Multichannel recording technologies have revealed travelling waves of neural activity in multiple sensory, motor and cognitive systems. These waves can be spontaneously generated by recurrent circuits or evoked by external stimuli. They travel along brain networks at multiple scales, transiently modulating spiking and excitability as they pass. Here, we review recent experimental findings that have found evidence for travelling waves at single-area (mesoscopic) and whole-brain (macroscopic) scales. We place these findings in the context of the current theoretical understanding of wave generation and propagation in recurrent networks. During the large low-frequency rhythms of sleep or the relatively desynchronized state of the awake cortex, travelling waves may serve a variety of functions, from long-term memory consolidation to processing of dynamic visual stimuli. We explore new avenues for experimental and computational understanding of the role of spatiotemporal activity patterns in the cortex.
Collapse
Affiliation(s)
- Lyle Muller
- Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Frédéric Chavane
- Institut de Neurosciences de la Timone (INT), Centre National de la Recherche Scientifique (CNRS) and Aix-Marseille Université, Marseille, France
| | - John Reynolds
- Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Terrence J Sejnowski
- Salk Institute for Biological Studies, La Jolla, CA, USA.,Division of Biological Sciences, University of California, La Jolla, CA, USA
| |
Collapse
|
129
|
Duan Y, Yakovleva A, Norcia AM. Determinants of neural responses to disparity in natural scenes. J Vis 2018; 18:21. [PMID: 29677337 PMCID: PMC6097643 DOI: 10.1167/18.3.21] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 02/05/2018] [Indexed: 11/24/2022] Open
Abstract
We studied disparity-evoked responses in natural scenes using high-density electroencephalography (EEG) in an event-related design. Thirty natural scenes that mainly included outdoor settings with trees and buildings were used. Twenty-four subjects viewed a series of trials composed of sequential two-alternative temporal forced-choice presentation of two different versions (two-dimensional [2D] vs. three-dimensional [3D]) of the same scene interleaved by a scrambled image with the same power spectrum. Scenes were viewed orthostereoscopically at 3 m through a pair of shutter glasses. After each trial, participants indicated with a key press which version of the scene was 3D. Performance on the discrimination was >90%. Participants who were more accurate also tended to respond faster; scenes that were reported more accurately as 3D also led to faster reaction times. We compared visual evoked potentials elicited by scrambled, 2D, and 3D scenes using reliable component analysis to reduce dimensionality. The disparity-evoked response to natural scene stimuli, measured from the difference potential between 2D and 3D scenes, comprised a sustained relative negativity in the dominant response component. The magnitude of the disparity-specific response was correlated with the observer's stereoacuity. Scenes with more homogeneous depth maps also tended to elicit large disparity-specific responses. Finally, the magnitude of the disparity-specific response was correlated with the magnitude of the differential response between scrambled and 2D scenes, suggesting that monocular higher-order scene statistics modulate disparity-specific responses.
Collapse
Affiliation(s)
- Yiran Duan
- Department of Psychology, Stanford University, Stanford, CA, USA
| | | | - Anthony M Norcia
- Department of Psychology, Stanford University, Stanford, CA, USA
| |
Collapse
|
130
|
Tuncel Y, Başaklar T, Ider YZ. Period doubling behavior in human steady state visual evoked potentials. Biomed Phys Eng Express 2018. [DOI: 10.1088/2057-1976/aaa78f] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
131
|
Lazar AA, Ukani NH, Zhou Y. Sparse Functional Identification of Complex Cells from Spike Times and the Decoding of Visual Stimuli. JOURNAL OF MATHEMATICAL NEUROSCIENCE 2018; 8:2. [PMID: 29349664 PMCID: PMC5773573 DOI: 10.1186/s13408-017-0057-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 12/29/2017] [Indexed: 05/21/2023]
Abstract
We investigate the sparse functional identification of complex cells and the decoding of spatio-temporal visual stimuli encoded by an ensemble of complex cells. The reconstruction algorithm is formulated as a rank minimization problem that significantly reduces the number of sampling measurements (spikes) required for decoding. We also establish the duality between sparse decoding and functional identification and provide algorithms for identification of low-rank dendritic stimulus processors. The duality enables us to efficiently evaluate our functional identification algorithms by reconstructing novel stimuli in the input space. Finally, we demonstrate that our identification algorithms substantially outperform the generalized quadratic model, the nonlinear input model, and the widely used spike-triggered covariance algorithm.
Collapse
Affiliation(s)
- Aurel A. Lazar
- Department of Electrical Engineering, Columbia University, 500 W 120th Street, Mudd 1300, New York, NY 10027 USA
| | - Nikul H. Ukani
- Department of Electrical Engineering, Columbia University, 500 W 120th Street, Mudd 1300, New York, NY 10027 USA
| | - Yiyin Zhou
- Department of Electrical Engineering, Columbia University, 500 W 120th Street, Mudd 1300, New York, NY 10027 USA
| |
Collapse
|
132
|
Rasmussen R, Yonehara K. Circuit Mechanisms Governing Local vs. Global Motion Processing in Mouse Visual Cortex. Front Neural Circuits 2017; 11:109. [PMID: 29311845 PMCID: PMC5743699 DOI: 10.3389/fncir.2017.00109] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2017] [Accepted: 12/14/2017] [Indexed: 11/21/2022] Open
Abstract
A withstanding question in neuroscience is how neural circuits encode representations and perceptions of the external world. A particularly well-defined visual computation is the representation of global object motion by pattern direction-selective (PDS) cells from convergence of motion of local components represented by component direction-selective (CDS) cells. However, how PDS and CDS cells develop their distinct response properties is still unresolved. The visual cortex of the mouse is an attractive model for experimentally solving this issue due to the large molecular and genetic toolbox available. Although mouse visual cortex lacks the highly ordered orientation columns of primates, it is organized in functional sub-networks and contains striate- and extrastriate areas like its primate counterparts. In this Perspective article, we provide an overview of the experimental and theoretical literature on global motion processing based on works in primates and mice. Lastly, we propose what types of experiments could illuminate what circuit mechanisms are governing cortical global visual motion processing. We propose that PDS cells in mouse visual cortex appear as the perfect arena for delineating and solving how individual sensory features extracted by neural circuits in peripheral brain areas are integrated to build our rich cohesive sensory experiences.
Collapse
Affiliation(s)
- Rune Rasmussen
- The Danish Research Institute of Translational Neuroscience-DANDRITE, Nordic EMBL Partnership for Molecular Medicine, Department of Biomedicine, Aarhus University, Aarhus, Denmark
| | - Keisuke Yonehara
- The Danish Research Institute of Translational Neuroscience-DANDRITE, Nordic EMBL Partnership for Molecular Medicine, Department of Biomedicine, Aarhus University, Aarhus, Denmark
| |
Collapse
|
133
|
Tang S, Lee TS, Li M, Zhang Y, Xu Y, Liu F, Teo B, Jiang H. Complex Pattern Selectivity in Macaque Primary Visual Cortex Revealed by Large-Scale Two-Photon Imaging. Curr Biol 2017; 28:38-48.e3. [PMID: 29249660 DOI: 10.1016/j.cub.2017.11.039] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Revised: 10/30/2017] [Accepted: 11/17/2017] [Indexed: 10/18/2022]
Abstract
Visual objects contain rich local high-order patterns such as curvature, corners, and junctions. In the standard hierarchical model of visual object recognition, V1 neurons were commonly assumed to code local orientation components of those high-order patterns. Here, by using two-photon imaging in awake macaques and systematically characterizing V1 neuronal responses to an extensive set of stimuli, we found a large percentage of neurons in the V1 superficial layer responded more strongly to complex patterns, such as corners, junctions, and curvature, than to their oriented line or edge components. Our results suggest that those individual V1 neurons could play the role in detecting local high-order visual patterns in the early stage of object recognition hierarchy.
Collapse
Affiliation(s)
- Shiming Tang
- Peking University School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Beijing 100871, China; IDG/McGovern Institute for Brain Research at Peking University, Beijing 100871, China; Key Laboratory of Machine Perception (Ministry of Education), Peking University, Beijing 100871, China.
| | - Tai Sing Lee
- Center for the Neural Basis of Cognition and Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
| | - Ming Li
- Peking University School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Beijing 100871, China; IDG/McGovern Institute for Brain Research at Peking University, Beijing 100871, China; Key Laboratory of Machine Perception (Ministry of Education), Peking University, Beijing 100871, China
| | - Yimeng Zhang
- Center for the Neural Basis of Cognition and Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Yue Xu
- Center for the Neural Basis of Cognition and Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Fang Liu
- Peking University School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Beijing 100871, China; IDG/McGovern Institute for Brain Research at Peking University, Beijing 100871, China; Key Laboratory of Machine Perception (Ministry of Education), Peking University, Beijing 100871, China
| | - Benjamin Teo
- Center for the Neural Basis of Cognition and Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Hongfei Jiang
- Peking University School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Beijing 100871, China; IDG/McGovern Institute for Brain Research at Peking University, Beijing 100871, China; Key Laboratory of Machine Perception (Ministry of Education), Peking University, Beijing 100871, China
| |
Collapse
|
134
|
Ferrari U, Gardella C, Marre O, Mora T. Closed-Loop Estimation of Retinal Network Sensitivity by Local Empirical Linearization. eNeuro 2017; 4:ENEURO.0166-17.2017. [PMID: 29379871 PMCID: PMC5783239 DOI: 10.1523/eneuro.0166-17.2017] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Revised: 10/12/2017] [Accepted: 10/16/2017] [Indexed: 11/28/2022] Open
Abstract
Understanding how sensory systems process information depends crucially on identifying which features of the stimulus drive the response of sensory neurons, and which ones leave their response invariant. This task is made difficult by the many nonlinearities that shape sensory processing. Here, we present a novel perturbative approach to understand information processing by sensory neurons, where we linearize their collective response locally in stimulus space. We added small perturbations to reference stimuli and tested if they triggered visible changes in the responses, adapting their amplitude according to the previous responses with closed-loop experiments. We developed a local linear model that accurately predicts the sensitivity of the neural responses to these perturbations. Applying this approach to the rat retina, we estimated the optimal performance of a neural decoder and showed that the nonlinear sensitivity of the retina is consistent with an efficient encoding of stimulus information. Our approach can be used to characterize experimentally the sensitivity of neural systems to external stimuli locally, quantify experimentally the capacity of neural networks to encode sensory information, and relate their activity to behavior.
Collapse
Affiliation(s)
- Ulisse Ferrari
- Institut de la Vision, Sorbonne Université, INSERM, CNRS, 17 rue Moreau, 75012 Paris, France
| | - Christophe Gardella
- Institut de la Vision, Sorbonne Université, INSERM, CNRS, 17 rue Moreau, 75012 Paris, France
- Laboratoire de physique statistique, CNRS, Sorbonne Université, Université Paris-Diderot and École normale supérieure (PSL), 24, rue Lhomond, 75005 Paris, France
| | - Olivier Marre
- Institut de la Vision, Sorbonne Université, INSERM, CNRS, 17 rue Moreau, 75012 Paris, France
| | - Thierry Mora
- Laboratoire de physique statistique, CNRS, Sorbonne Université, Université Paris-Diderot and École normale supérieure (PSL), 24, rue Lhomond, 75005 Paris, France
| |
Collapse
|
135
|
Nonnenmacher M, Behrens C, Berens P, Bethge M, Macke JH. Signatures of criticality arise from random subsampling in simple population models. PLoS Comput Biol 2017; 13:e1005718. [PMID: 28972970 PMCID: PMC5640238 DOI: 10.1371/journal.pcbi.1005718] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Revised: 10/13/2017] [Accepted: 08/01/2017] [Indexed: 11/18/2022] Open
Abstract
The rise of large-scale recordings of neuronal activity has fueled the hope to gain new insights into the collective activity of neural ensembles. How can one link the statistics of neural population activity to underlying principles and theories? One attempt to interpret such data builds upon analogies to the behaviour of collective systems in statistical physics. Divergence of the specific heat-a measure of population statistics derived from thermodynamics-has been used to suggest that neural populations are optimized to operate at a "critical point". However, these findings have been challenged by theoretical studies which have shown that common inputs can lead to diverging specific heat. Here, we connect "signatures of criticality", and in particular the divergence of specific heat, back to statistics of neural population activity commonly studied in neural coding: firing rates and pairwise correlations. We show that the specific heat diverges whenever the average correlation strength does not depend on population size. This is necessarily true when data with correlations is randomly subsampled during the analysis process, irrespective of the detailed structure or origin of correlations. We also show how the characteristic shape of specific heat capacity curves depends on firing rates and correlations, using both analytically tractable models and numerical simulations of a canonical feed-forward population model. To analyze these simulations, we develop efficient methods for characterizing large-scale neural population activity with maximum entropy models. We find that, consistent with experimental findings, increases in firing rates and correlation directly lead to more pronounced signatures. Thus, previous reports of thermodynamical criticality in neural populations based on the analysis of specific heat can be explained by average firing rates and correlations, and are not indicative of an optimized coding strategy. We conclude that a reliable interpretation of statistical tests for theories of neural coding is possible only in reference to relevant ground-truth models.
Collapse
Affiliation(s)
- Marcel Nonnenmacher
- Research Center caesar, an associate of the Max Planck Society, Bonn, Germany
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
| | - Christian Behrens
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
- Centre for Integrative Neuroscience, University of Tübingen, Tübingen, Germany
- Institute for Ophthalmic Research, University of Tübingen, Tübingen, Germany
| | - Philipp Berens
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
- Centre for Integrative Neuroscience, University of Tübingen, Tübingen, Germany
- Institute for Ophthalmic Research, University of Tübingen, Tübingen, Germany
| | - Matthias Bethge
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
- Centre for Integrative Neuroscience, University of Tübingen, Tübingen, Germany
- Institute of Theoretical Physics, University of Tübingen, Tübingen, Germany
| | - Jakob H. Macke
- Research Center caesar, an associate of the Max Planck Society, Bonn, Germany
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
- Bernstein Center for Computational Neuroscience, Tübingen, Germany
| |
Collapse
|
136
|
Image identification from brain activity using the population receptive field model. PLoS One 2017; 12:e0183295. [PMID: 28922355 PMCID: PMC5603170 DOI: 10.1371/journal.pone.0183295] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Accepted: 08/02/2017] [Indexed: 11/19/2022] Open
Abstract
A goal of computational models is not only to explain experimental data but also to make new predictions. A current focus of computational neuroimaging is to predict features of the presented stimulus from measured brain signals. These computational neuroimaging approaches may be agnostic about the underlying neural processes or may be biologically inspired. Here, we use the biologically inspired population receptive field (pRF) approach to identify presented images from fMRI recordings of the visual cortex, using an explicit model of the underlying neural response selectivity. The advantage of the pRF-model is its simplicity: it is defined by a handful of parameters, which can be estimated from fMRI data that was collected within half an hour. Using 7T MRI, we measured responses elicited by different visual stimuli: (i) conventional pRF mapping stimuli, (ii) semi-random synthetic images and (iii) natural images. The pRF mapping stimuli were used to estimate the pRF-properties of each cortical location in early visual cortex. Next, we used these pRFs to identify which synthetic or natural images was presented to the subject from the fMRI responses. We show that image identification using V1 responses is far above chance, both for the synthetic and natural images. Thus, we can identify visual images, including natural images, using the most fundamental low-parameter pRF model estimated from conventional pRF mapping stimuli. This allows broader application of image identification.
Collapse
|
137
|
Khani MH, Gollisch T. Diversity in spatial scope of contrast adaptation among mouse retinal ganglion cells. J Neurophysiol 2017; 118:3024-3043. [PMID: 28904106 PMCID: PMC5712662 DOI: 10.1152/jn.00529.2017] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Revised: 09/07/2017] [Accepted: 09/07/2017] [Indexed: 02/05/2023] Open
Abstract
Retinal ganglion cells adapt to changes in visual contrast by adjusting their response kinetics and sensitivity. While much work has focused on the time scales of these adaptation processes, less is known about the spatial scale of contrast adaptation. For example, do small, localized contrast changes affect a cell's signal processing across its entire receptive field? Previous investigations have provided conflicting evidence, suggesting that contrast adaptation occurs either locally within subregions of a ganglion cell's receptive field or globally over the receptive field in its entirety. Here, we investigated the spatial extent of contrast adaptation in ganglion cells of the isolated mouse retina through multielectrode-array recordings. We applied visual stimuli so that ganglion cell receptive fields contained regions where the average contrast level changed periodically as well as regions with constant average contrast level. This allowed us to analyze temporal stimulus integration and sensitivity separately for stimulus regions with and without contrast changes. We found that the spatial scope of contrast adaptation depends strongly on cell identity, with some ganglion cells displaying clear local adaptation, whereas others, in particular large transient ganglion cells, adapted globally to contrast changes. Thus, the spatial scope of contrast adaptation in mouse retinal ganglion cells appears to be cell-type specific. This could reflect differences in mechanisms of contrast adaptation and may contribute to the functional diversity of different ganglion cell types.NEW & NOTEWORTHY Understanding whether adaptation of a neuron in a sensory system can occur locally inside the receptive field or whether it always globally affects the entire receptive field is important for understanding how the neuron processes complex sensory stimuli. For mouse retinal ganglion cells, we here show that both local and global contrast adaptation exist and that this diversity in spatial scope can contribute to the functional diversity of retinal ganglion cell types.
Collapse
Affiliation(s)
- Mohammad Hossein Khani
- University Medical Center Göttingen, Dept. of Ophthalmology and Bernstein Center for Computational Neuroscience Göttingen, Göttingen, Germany; and.,International Max Planck Research School for Neuroscience, Göttingen, Germany
| | - Tim Gollisch
- University Medical Center Göttingen, Dept. of Ophthalmology and Bernstein Center for Computational Neuroscience Göttingen, Göttingen, Germany; and
| |
Collapse
|
138
|
Cessac B, Kornprobst P, Kraria S, Nasser H, Pamplona D, Portelli G, Viéville T. PRANAS: A New Platform for Retinal Analysis and Simulation. Front Neuroinform 2017; 11:49. [PMID: 28919854 PMCID: PMC5585572 DOI: 10.3389/fninf.2017.00049] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2016] [Accepted: 07/17/2017] [Indexed: 01/28/2023] Open
Abstract
The retina encodes visual scenes by trains of action potentials that are sent to the brain via the optic nerve. In this paper, we describe a new free access user-end software allowing to better understand this coding. It is called PRANAS (https://pranas.inria.fr), standing for Platform for Retinal ANalysis And Simulation. PRANAS targets neuroscientists and modelers by providing a unique set of retina-related tools. PRANAS integrates a retina simulator allowing large scale simulations while keeping a strong biological plausibility and a toolbox for the analysis of spike train population statistics. The statistical method (entropy maximization under constraints) takes into account both spatial and temporal correlations as constraints, allowing to analyze the effects of memory on statistics. PRANAS also integrates a tool computing and representing in 3D (time-space) receptive fields. All these tools are accessible through a friendly graphical user interface. The most CPU-costly of them have been implemented to run in parallel.
Collapse
Affiliation(s)
- Bruno Cessac
- Biovision Team, Inria, Université Côte d'AzurSophia Antipolis, France
| | - Pierre Kornprobst
- Biovision Team, Inria, Université Côte d'AzurSophia Antipolis, France
| | - Selim Kraria
- Biovision Team, Inria, Université Côte d'AzurSophia Antipolis, France
| | - Hassan Nasser
- Biovision Team, Inria, Université Côte d'AzurSophia Antipolis, France
| | - Daniela Pamplona
- Biovision Team, Inria, Université Côte d'AzurSophia Antipolis, France
| | - Geoffrey Portelli
- Biovision Team, Inria, Université Côte d'AzurSophia Antipolis, France
| | | |
Collapse
|
139
|
Sawada T, Petrov AA. The divisive normalization model of V1 neurons: a comprehensive comparison of physiological data and model predictions. J Neurophysiol 2017; 118:3051-3091. [PMID: 28835531 DOI: 10.1152/jn.00821.2016] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Revised: 08/21/2017] [Accepted: 08/21/2017] [Indexed: 01/24/2023] Open
Abstract
The physiological responses of simple and complex cells in the primary visual cortex (V1) have been studied extensively and modeled at different levels. At the functional level, the divisive normalization model (DNM; Heeger DJ. Vis Neurosci 9: 181-197, 1992) has accounted for a wide range of single-cell recordings in terms of a combination of linear filtering, nonlinear rectification, and divisive normalization. We propose standardizing the formulation of the DNM and implementing it in software that takes static grayscale images as inputs and produces firing rate responses as outputs. We also review a comprehensive suite of 30 empirical phenomena and report a series of simulation experiments that qualitatively replicate dozens of key experiments with a standard parameter set consistent with physiological measurements. This systematic approach identifies novel falsifiable predictions of the DNM. We show how the model simultaneously satisfies the conflicting desiderata of flexibility and falsifiability. Our key idea is that, while adjustable parameters are needed to accommodate the diversity across neurons, they must be fixed for a given individual neuron. This requirement introduces falsifiable constraints when this single neuron is probed with multiple stimuli. We also present mathematical analyses and simulation experiments that explicate some of these constraints.
Collapse
Affiliation(s)
- Tadamasa Sawada
- School of Psychology, National Research University Higher School of Economics, Moscow, Russia; and
| | | |
Collapse
|
140
|
Neri P. Object segmentation controls image reconstruction from natural scenes. PLoS Biol 2017; 15:e1002611. [PMID: 28827801 PMCID: PMC5565198 DOI: 10.1371/journal.pbio.1002611] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 07/25/2017] [Indexed: 11/22/2022] Open
Abstract
The structure of the physical world projects images onto our eyes. However, those images are often poorly representative of environmental structure: well-defined boundaries within the eye may correspond to irrelevant features of the physical world, while critical features of the physical world may be nearly invisible at the retinal projection. The challenge for the visual cortex is to sort these two types of features according to their utility in ultimately reconstructing percepts and interpreting the constituents of the scene. We describe a novel paradigm that enabled us to selectively evaluate the relative role played by these two feature classes in signal reconstruction from corrupted images. Our measurements demonstrate that this process is quickly dominated by the inferred structure of the environment, and only minimally controlled by variations of raw image content. The inferential mechanism is spatially global and its impact on early visual cortex is fast. Furthermore, it retunes local visual processing for more efficient feature extraction without altering the intrinsic transduction noise. The basic properties of this process can be partially captured by a combination of small-scale circuit models and large-scale network architectures. Taken together, our results challenge compartmentalized notions of bottom-up/top-down perception and suggest instead that these two modes are best viewed as an integrated perceptual mechanism. Biological vision is designed to discover the structure of the environment around us. To do this, it relies on ambiguous and often misleading information from the eyes: the boundary of a critical object may be invisible against a background of similar appearance, and may be overlooked in favour of the sharp contour projected by an irrelevant shadow. It remains unclear how human vision sorts different image features according to their relevance to the layout of objects within the scene. We demonstrate that vision achieves this goal via a specialized perceptual system for object segmentation that is one and the same with the feature extraction system: immediately after information is relayed to cortex by the eyes, the process of reconstructing image content from local features is controlled by a dedicated inferential mechanism that attempts to recover the underlying environmental structure; perception is quickly organized around the operation of this mechanism, which becomes the primary contextual influence on image reconstruction. The integrated nature of this perceptual mechanism defies current notions of separate top-down and bottom-up processes, offering a fresh view of how human vision operates on natural signals.
Collapse
Affiliation(s)
- Peter Neri
- Laboratoire des Systèmes Perceptifs, Département d'études cognitives, Ecole Normale Supérieure, PSL Research University, CNRS, Paris, France
- * E-mail:
| |
Collapse
|
141
|
Isik L, Singer J, Madsen JR, Kanwisher N, Kreiman G. What is changing when: Decoding visual information in movies from human intracranial recordings. Neuroimage 2017; 180:147-159. [PMID: 28823828 DOI: 10.1016/j.neuroimage.2017.08.027] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2017] [Revised: 08/07/2017] [Accepted: 08/08/2017] [Indexed: 11/19/2022] Open
Abstract
The majority of visual recognition studies have focused on the neural responses to repeated presentations of static stimuli with abrupt and well-defined onset and offset times. In contrast, natural vision involves unique renderings of visual inputs that are continuously changing without explicitly defined temporal transitions. Here we considered commercial movies as a coarse proxy to natural vision. We recorded intracranial field potential signals from 1,284 electrodes implanted in 15 patients with epilepsy while the subjects passively viewed commercial movies. We could rapidly detect large changes in the visual inputs within approximately 100 ms of their occurrence, using exclusively field potential signals from ventral visual cortical areas including the inferior temporal gyrus and inferior occipital gyrus. Furthermore, we could decode the content of those visual changes even in a single movie presentation, generalizing across the wide range of transformations present in a movie. These results present a methodological framework for studying cognition during dynamic and natural vision.
Collapse
Affiliation(s)
- Leyla Isik
- Department of Ophthalmology, Boston Children's Hospital, Harvard Medical School, Boston, MA, United States; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, United States.
| | - Jedediah Singer
- Department of Ophthalmology, Boston Children's Hospital, Harvard Medical School, Boston, MA, United States
| | - Joseph R Madsen
- Department of Neurosurgery, Boston Children's Hospital, Harvard Medical School, Boston, MA, United States
| | - Nancy Kanwisher
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, United States
| | - Gabriel Kreiman
- Department of Ophthalmology, Boston Children's Hospital, Harvard Medical School, Boston, MA, United States
| |
Collapse
|
142
|
Synaptic Contributions to Receptive Field Structure and Response Properties in the Rodent Lateral Geniculate Nucleus of the Thalamus. J Neurosci 2017; 36:10949-10963. [PMID: 27798177 DOI: 10.1523/jneurosci.1045-16.2016] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2016] [Accepted: 08/17/2016] [Indexed: 11/21/2022] Open
Abstract
Comparative physiological and anatomical studies have greatly advanced our understanding of sensory systems. Many lines of evidence show that the murine lateral geniculate nucleus (LGN) has unique attributes, compared with other species such as cat and monkey. For example, in rodent, thalamic receptive field structure is markedly diverse, and many cells are sensitive to stimulus orientation and direction. To explore shared and different strategies of synaptic integration across species, we made whole-cell recordings in vivo from the murine LGN during the presentation of visual stimuli, analyzed the results with different computational approaches, and compared our findings with those from cat. As for carnivores, murine cells with classical center-surround receptive fields had a "push-pull" structure of excitation and inhibition within a given On or Off subregion. These cells compose the largest single population in the murine LGN (∼40%), indicating that push-pull is key in the form vision pathway across species. For two cell types with overlapping On and Off responses, which recalled either W3 or suppressed-by-contrast ganglion cells in murine retina, inhibition took a different form and was most pronounced for spatially extensive stimuli. Other On-Off cells were selective for stimulus orientation and direction. In these cases, retinal inputs were tuned and, for oriented cells, the second-order subunit of the receptive field predicted the preferred angle. By contrast, suppression was not tuned and appeared to sharpen stimulus selectivity. Together, our results provide new perspectives on the role of excitation and inhibition in retinothalamic processing. SIGNIFICANCE STATEMENT We explored the murine lateral geniculate nucleus from a comparative physiological perspective. In cat, most retinal cells have center-surround receptive fields and push-pull excitation and inhibition, including neurons with the smallest (highest acuity) receptive fields. The same is true for thalamic relay cells. In mouse retina, the most numerous cell type has the smallest receptive fields but lacks push-pull. The most common receptive field in rodent thalamus, however, is center-surround with push-pull. Thus, receptive field structure supersedes size per se for form vision. Further, for many orientation-selective cells, the second-order component of the receptive field aligned with stimulus preference, whereas suppression was untuned. Thus, inhibition may improve spatial resolution and sharpen other forms of selectivity in rodent lateral geniculate nucleus.
Collapse
|
143
|
Hermes D, Nguyen M, Winawer J. Neuronal synchrony and the relation between the blood-oxygen-level dependent response and the local field potential. PLoS Biol 2017; 15:e2001461. [PMID: 28742093 PMCID: PMC5524566 DOI: 10.1371/journal.pbio.2001461] [Citation(s) in RCA: 56] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 06/22/2017] [Indexed: 01/07/2023] Open
Abstract
The most widespread measures of human brain activity are the blood-oxygen-level dependent (BOLD) signal and surface field potential. Prior studies report a variety of relationships between these signals. To develop an understanding of how to interpret these signals and the relationship between them, we developed a model of (a) neuronal population responses and (b) transformations from neuronal responses into the functional magnetic resonance imaging (fMRI) BOLD signal and electrocorticographic (ECoG) field potential. Rather than seeking a transformation between the two measures directly, this approach interprets each measure with respect to the underlying neuronal population responses. This model accounts for the relationship between BOLD and ECoG data from human visual cortex in V1, V2, and V3, with the model predictions and data matching in three ways: across stimuli, the BOLD amplitude and ECoG broadband power were positively correlated, the BOLD amplitude and alpha power (8-13 Hz) were negatively correlated, and the BOLD amplitude and narrowband gamma power (30-80 Hz) were uncorrelated. The two measures provide complementary information about human brain activity, and we infer that features of the field potential that are uncorrelated with BOLD arise largely from changes in synchrony, rather than level, of neuronal activity.
Collapse
Affiliation(s)
- Dora Hermes
- Department of Psychology, New York University, New York, New York, United States of America
- Brain Center Rudolf Magnus, Department of Neurology & Neurosurgery, University Medical Center Utrecht, Utrecht, the Netherlands
- Department of Psychology, Stanford University, Stanford, California, United States of America
| | - Mai Nguyen
- Department of Psychology, Princeton University, Princeton, New Jersey, United States of America
| | - Jonathan Winawer
- Department of Psychology, New York University, New York, New York, United States of America
| |
Collapse
|
144
|
Chaplin TA, Allitt BJ, Hagan MA, Price NSC, Rajan R, Rosa MGP, Lui LL. Sensitivity of neurons in the middle temporal area of marmoset monkeys to random dot motion. J Neurophysiol 2017. [PMID: 28637812 DOI: 10.1152/jn.00065.2017] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Neurons in the middle temporal area (MT) of the primate cerebral cortex respond to moving visual stimuli. The sensitivity of MT neurons to motion signals can be characterized by using random-dot stimuli, in which the strength of the motion signal is manipulated by adding different levels of noise (elements that move in random directions). In macaques, this has allowed the calculation of "neurometric" thresholds. We characterized the responses of MT neurons in sufentanil/nitrous oxide-anesthetized marmoset monkeys, a species that has attracted considerable recent interest as an animal model for vision research. We found that MT neurons show a wide range of neurometric thresholds and that the responses of the most sensitive neurons could account for the behavioral performance of macaques and humans. We also investigated factors that contributed to the wide range of observed thresholds. The difference in firing rate between responses to motion in the preferred and null directions was the most effective predictor of neurometric threshold, whereas the direction tuning bandwidth had no correlation with the threshold. We also showed that it is possible to obtain reliable estimates of neurometric thresholds using stimuli that were not highly optimized for each neuron, as is often necessary when recording from large populations of neurons with different receptive field concurrently, as was the case in this study. These results demonstrate that marmoset MT shows an essential physiological similarity to macaque MT and suggest that its neurons are capable of representing motion signals that allow for comparable motion-in-noise judgments.NEW & NOTEWORTHY We report the activity of neurons in marmoset MT in response to random-dot motion stimuli of varying coherence. The information carried by individual MT neurons was comparable to that of the macaque, and the maximum firing rates were a strong predictor of sensitivity. Our study provides key information regarding the neural basis of motion perception in the marmoset, a small primate species that is becoming increasingly popular as an experimental model.
Collapse
Affiliation(s)
- Tristan A Chaplin
- Neuroscience Program, Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia; and.,ARC Centre of Excellence for Integrative Brain Function, Monash University Node, Victoria, Australia
| | - Benjamin J Allitt
- Neuroscience Program, Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia; and
| | - Maureen A Hagan
- Neuroscience Program, Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia; and.,ARC Centre of Excellence for Integrative Brain Function, Monash University Node, Victoria, Australia
| | - Nicholas S C Price
- Neuroscience Program, Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia; and.,ARC Centre of Excellence for Integrative Brain Function, Monash University Node, Victoria, Australia
| | - Ramesh Rajan
- Neuroscience Program, Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia; and.,ARC Centre of Excellence for Integrative Brain Function, Monash University Node, Victoria, Australia
| | - Marcello G P Rosa
- Neuroscience Program, Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia; and.,ARC Centre of Excellence for Integrative Brain Function, Monash University Node, Victoria, Australia
| | - Leo L Lui
- Neuroscience Program, Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria, Australia; and .,ARC Centre of Excellence for Integrative Brain Function, Monash University Node, Victoria, Australia
| |
Collapse
|
145
|
Tarawneh G, Nityananda V, Rosner R, Errington S, Herbert W, Cumming BG, Read JCA, Serrano-Pedraza I. Invisible noise obscures visible signal in insect motion detection. Sci Rep 2017; 7:3496. [PMID: 28615659 PMCID: PMC5471215 DOI: 10.1038/s41598-017-03732-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2017] [Accepted: 05/03/2017] [Indexed: 11/09/2022] Open
Abstract
The motion energy model is the standard account of motion detection in animals from beetles to humans. Despite this common basis, we show here that a difference in the early stages of visual processing between mammals and insects leads this model to make radically different behavioural predictions. In insects, early filtering is spatially lowpass, which makes the surprising prediction that motion detection can be impaired by "invisible" noise, i.e. noise at a spatial frequency that elicits no response when presented on its own as a signal. We confirm this prediction using the optomotor response of praying mantis Sphodromantis lineola. This does not occur in mammals, where spatially bandpass early filtering means that linear systems techniques, such as deriving channel sensitivity from masking functions, remain approximately valid. Counter-intuitive effects such as masking by invisible noise may occur in neural circuits wherever a nonlinearity is followed by a difference operation.
Collapse
Affiliation(s)
- Ghaith Tarawneh
- Institute of Neuroscience, Henry Wellcome Building for Neuroecology, Newcastle University, Framlington Place, Newcastle upon Tyne, NE2 4HH, United Kingdom.
| | - Vivek Nityananda
- Institute of Neuroscience, Henry Wellcome Building for Neuroecology, Newcastle University, Framlington Place, Newcastle upon Tyne, NE2 4HH, United Kingdom
| | - Ronny Rosner
- Institute of Neuroscience, Henry Wellcome Building for Neuroecology, Newcastle University, Framlington Place, Newcastle upon Tyne, NE2 4HH, United Kingdom
| | - Steven Errington
- Institute of Neuroscience, Henry Wellcome Building for Neuroecology, Newcastle University, Framlington Place, Newcastle upon Tyne, NE2 4HH, United Kingdom
| | - William Herbert
- Institute of Neuroscience, Henry Wellcome Building for Neuroecology, Newcastle University, Framlington Place, Newcastle upon Tyne, NE2 4HH, United Kingdom
| | - Bruce G Cumming
- Laboratory of Sensorimotor Research, National Eye Institute, National Institutes of Health, Bldg 49 Room 2A50, Bethesda, MD, 20892-4435, USA
| | - Jenny C A Read
- Institute of Neuroscience, Henry Wellcome Building for Neuroecology, Newcastle University, Framlington Place, Newcastle upon Tyne, NE2 4HH, United Kingdom
| | | |
Collapse
|
146
|
Effects of Stimulus Size and Contrast on the Initial Primary Visual Cortical Response in Humans. Brain Topogr 2017; 30:450-460. [PMID: 28474167 DOI: 10.1007/s10548-016-0530-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2016] [Accepted: 10/11/2016] [Indexed: 10/19/2022]
Abstract
Decades of intracranial electrophysiological investigation into the primary visual cortex (V1) have produced many fundamental insights into the computations carried out in low-level visual circuits of the brain. Some of the most important work has been simply concerned with the precise measurement of neural response variations as a function of elementary stimulus attributes such as contrast and size. Surprisingly, such simple but fundamental characterization of V1 responses has not been carried out in human electrophysiology. Here we report such a detailed characterization for the initial "C1" component of the scalp-recorded visual evoked potential (VEP). The C1 is known to be dominantly generated by initial afferent activation in V1, but is difficult to record reliably due to interindividual anatomical variability. We used pattern-pulse multifocal VEP mapping to identify a stimulus position that activates the left lower calcarine bank in each individual, and afterwards measured robust negative C1s over posterior midline scalp to gratings presented sequentially at that location. We found clear and systematic increases in C1 peak amplitude and decreases in peak latency with increasing size as well as with increasing contrast. With a sample of 15 subjects and ~180 trials per condition, reliable C1 amplitudes of -0.46 µV were evoked at as low a contrast as 3.13% and as large as -4.82 µV at 100% contrast, using stimuli of 3.33° diameter. A practical implication is that by placing sufficiently-sized stimuli to target favorable calcarine cortical loci, robust V1 responses can be measured at contrasts close to perceptual thresholds, which could greatly facilitate principled studies of early visual perception and attention.
Collapse
|
147
|
Selby B, Tripp B. Extending the Stabilized Supralinear Network model for binocular image processing. Neural Netw 2017; 90:29-41. [PMID: 28388471 DOI: 10.1016/j.neunet.2017.03.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Revised: 12/23/2016] [Accepted: 03/03/2017] [Indexed: 11/29/2022]
Abstract
The visual cortex is both extensive and intricate. Computational models are needed to clarify the relationships between its local mechanisms and high-level functions. The Stabilized Supralinear Network (SSN) model was recently shown to account for many receptive field phenomena in V1, and also to predict subtle receptive field properties that were subsequently confirmed in vivo. In this study, we performed a preliminary exploration of whether the SSN is suitable for incorporation into large, functional models of the visual cortex, considering both its extensibility and computational tractability. First, whereas the SSN receives abstract orientation signals as input, we extended it to receive images (through a linear-nonlinear stage), and found that the extended version behaved similarly. Secondly, whereas the SSN had previously been studied in a monocular context, we found that it could also reproduce data on interocular transfer of surround suppression. Finally, we reformulated the SSN as a convolutional neural network, and found that it scaled well on parallel hardware. These results provide additional support for the plausibility of the SSN as a model of lateral interactions in V1, and suggest that the SSN is well suited as a component of complex vision models. Future work will use the SSN to explore relationships between local network interactions and sophisticated vision processes in large networks.
Collapse
Affiliation(s)
- Ben Selby
- Department of Systems Design Engineering, University of Waterloo, 200 University Ave W., Waterloo, Ontario, Canada N2L 3G1; Centre for Theoretical Neuroscience, University of Waterloo, 200 University Ave W., Waterloo, Ontario, Canada N2L 3G1.
| | - Bryan Tripp
- Department of Systems Design Engineering, University of Waterloo, 200 University Ave W., Waterloo, Ontario, Canada N2L 3G1; Centre for Theoretical Neuroscience, University of Waterloo, 200 University Ave W., Waterloo, Ontario, Canada N2L 3G1.
| |
Collapse
|
148
|
Bytautiene J, Baranauskas G. Rat superior colliculus neurons respond to large visual stimuli flashed outside the classical receptive field. PLoS One 2017; 12:e0174409. [PMID: 28379979 PMCID: PMC5381878 DOI: 10.1371/journal.pone.0174409] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2016] [Accepted: 03/08/2017] [Indexed: 11/18/2022] Open
Abstract
Spatial integration of visual stimuli is a crucial step in visual information processing yet it is often unclear where this integration takes place in the visual system. In the superficial layers of the superior colliculus that form an early stage in visual information processing, neurons are known to have relatively small visual receptive fields, suggesting limited spatial integration. Here it is shown that at least for rats this conclusion may be wrong. Extracellular recordings in urethane-anaesthetized young adult rats (1.5–2 months old) showed that large stimuli of over 10° could evoke detectable responses well outside the borders of ‘classical’ receptive fields determined by employing 2° – 3.5° stimuli. The presence of responses to large stimuli well outside these ‘classical’ receptive fields could not be explained neither by partial overlap between the visual stimulus and the receptive field, nor by reflections or light dispersion from the stimulation site. However, very low frequency (<0.1 Hz) residual responses to small stimuli presented outside the receptive field may explain the obtained results if we assume that the frequency of action potentials during a response to a stimulus outside RF is proportional to the stimulus area. Thus, responses to large stimuli outside RF may be predicted by scaling according to the stimulus area of the responses to small stimuli. These data demonstrate that neurons in the superficial layers of the superior colliculus are capable of integrating visual stimuli over much larger area than it can be deduced from the classical receptive field.
Collapse
Affiliation(s)
- Juntaute Bytautiene
- Neurophysiology laboratory, Neuroscience Institute, Lithuanian University of Health Sciences, Kaunas, Lithuania
| | - Gytis Baranauskas
- Neurophysiology laboratory, Neuroscience Institute, Lithuanian University of Health Sciences, Kaunas, Lithuania
- * E-mail:
| |
Collapse
|
149
|
Kay KN, Yeatman JD. Bottom-up and top-down computations in word- and face-selective cortex. eLife 2017; 6. [PMID: 28226243 PMCID: PMC5358981 DOI: 10.7554/elife.22341] [Citation(s) in RCA: 99] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 02/19/2017] [Indexed: 11/13/2022] Open
Abstract
The ability to read a page of text or recognize a person's face depends on category-selective visual regions in ventral temporal cortex (VTC). To understand how these regions mediate word and face recognition, it is necessary to characterize how stimuli are represented and how this representation is used in the execution of a cognitive task. Here, we show that the response of a category-selective region in VTC can be computed as the degree to which the low-level properties of the stimulus match a category template. Moreover, we show that during execution of a task, the bottom-up representation is scaled by the intraparietal sulcus (IPS), and that the level of IPS engagement reflects the cognitive demands of the task. These results provide an account of neural processing in VTC in the form of a model that addresses both bottom-up and top-down effects and quantitatively predicts VTC responses.
Collapse
Affiliation(s)
- Kendrick N Kay
- Center for Magnetic Resonance Research, Department of Radiology, University of Minnesota, Minneapolis, United States
| | - Jason D Yeatman
- Institute for Learning and Brain Sciences, University of Washington, Seattle, United States.,Department of Speech and Hearing Sciences, University of Washington, Seattle, United States
| |
Collapse
|
150
|
Temporally Diverse Excitation Generates Direction-Selective Responses in ON- and OFF-Type Retinal Starburst Amacrine Cells. Cell Rep 2017; 18:1356-1365. [DOI: 10.1016/j.celrep.2017.01.026] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2016] [Revised: 12/05/2016] [Accepted: 01/11/2017] [Indexed: 01/06/2023] Open
|