1
|
Yuasa K, Groen IIA, Piantoni G, Montenegro S, Flinker A, Devore S, Devinsky O, Doyle W, Dugan P, Friedman D, Ramsey N, Petridou N, Winawer J. Precise Spatial Tuning of Visually Driven Alpha Oscillations in Human Visual Cortex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2023.02.11.528137. [PMID: 36865223 PMCID: PMC9979988 DOI: 10.1101/2023.02.11.528137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
Abstract
Neuronal oscillations at about 10 Hz, called alpha oscillations, are often thought to arise from synchronous activity across occipital cortex, reflecting general cognitive states such as arousal and alertness. However, there is also evidence that modulation of alpha oscillations in visual cortex can be spatially specific. Here, we used intracranial electrodes in human patients to measure alpha oscillations in response to visual stimuli whose location varied systematically across the visual field. We separated the alpha oscillatory power from broadband power changes. The variation in alpha oscillatory power with stimulus position was then fit by a population receptive field (pRF) model. We find that the alpha pRFs have similar center locations to pRFs estimated from broadband power (70-180 Hz) but are several times larger. The results demonstrate that alpha suppression in human visual cortex can be precisely tuned. Finally, we show how the pattern of alpha responses can explain several features of exogenous visual attention. Significance Statement The alpha oscillation is the largest electrical signal generated by the human brain. An important question in systems neuroscience is the degree to which this oscillation reflects system-wide states and behaviors such as arousal, alertness, and attention, versus much more specific functions in the routing and processing of information. We examined alpha oscillations at high spatial precision in human patients with intracranial electrodes implanted over visual cortex. We discovered a surprisingly high spatial specificity of visually driven alpha oscillations, which we quantified with receptive field models. We further use our discoveries about properties of the alpha response to show a link between these oscillations and the spread of visual attention.Grant support: NIH R01 MH111417 (Petridou, Winawer, Ramsey, Devinsky); JSPS Overseas Research Fellowship (Yuasa)The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Collapse
|
2
|
Ha J, Broderick WF, Kay K, Winawer J. Spatial Frequency Maps in Human Visual Cortex: A Replication and Extension. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.21.634150. [PMID: 39896600 PMCID: PMC11785079 DOI: 10.1101/2025.01.21.634150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2025]
Abstract
In a step toward developing a model of human primary visual cortex, a recent study introduced a model of spatial frequency tuning in V1 (Broderick, Simoncelli, & Winawer, 2022). The model is compact, using just 9 parameters to predict BOLD response amplitude for locations across all of V1 as a function of stimulus orientation and spatial frequency. Here we replicated this analysis in a new dataset, the 'nsdsynthetic' supplement to the Natural Scenes Dataset (Allen et al., 2022), to assess generalization of model parameters. Furthermore, we extended the analyses to extrastriate maps V2 and V3. For each retinotopic map in the 8 NSD subjects, we fit the 9-parameter model. Despite many experimental differences between NSD and the original study, including stimulus size, experimental design, and MR field strength, there was good agreement in most model parameters. The dependence of preferred spatial frequency on eccentricity in V1 was similar between NSD and Broderick et al. Moreover, the effect of absolute stimulus orientation on spatial frequency maps was similar: higher preferred spatial frequency for horizontal and cardinal orientations compared to vertical and oblique orientations in both studies. The extension to extrastriate maps revealed that the biggest change in tuning between maps was in bandwidth: the bandwidth in spatial frequency tuning increased by 70% from V1 to V2 and 100% from V1 to V3, paralleling known increases in receptive field size. Together, the results show robust reproducibility and bring us closer to a systematic characterization of spatial encoding in the human visual system.
Collapse
Affiliation(s)
- Jiyeong Ha
- Department of Psychology and Center for Neural, New York University, NY, USA
| | | | - Kendrick Kay
- Center for Magnetic Resonance Research (CMRR), Department of Radiology, University of Minnesota, Minneapolis, MN, USA
| | - Jonathan Winawer
- Department of Psychology and Center for Neural, New York University, NY, USA
| |
Collapse
|
3
|
Tünçok E, Carrasco M, Winawer J. Spatial attention alters visual cortical representation during target anticipation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.02.583127. [PMID: 38496524 PMCID: PMC10942396 DOI: 10.1101/2024.03.02.583127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Attention enables us to efficiently and flexibly interact with the environment by prioritizing some image features in preparation for responding to a stimulus. Using a concurrent psychophysics- fMRI experiment, we investigated how covert spatial attention affects responses in human visual cortex prior to target onset, and how it affects subsequent behavioral performance. Performance improved at cued locations and worsened at uncued locations, relative to distributed attention, demonstrating a selective tradeoff in processing. Pre-target BOLD responses in cortical visual field maps changed in two ways: First, there was a stimulus-independent baseline shift, positive in map locations near the cued location and negative elsewhere, paralleling the behavioral results. Second, population receptive field centers shifted toward the attended location. Both effects increased in higher visual areas. Together, the results show that spatial attention has large effects on visual cortex prior to target appearance, altering neural response properties throughout and across multiple visual field maps.
Collapse
|
4
|
Thayer DD, Sprague TC. Feature-Specific Salience Maps in Human Cortex. J Neurosci 2023; 43:8785-8800. [PMID: 37907257 PMCID: PMC10727177 DOI: 10.1523/jneurosci.1104-23.2023] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 09/29/2023] [Accepted: 10/24/2023] [Indexed: 11/02/2023] Open
Abstract
Priority map theory is a leading framework for understanding how various aspects of stimulus displays and task demands guide visual attention. Per this theory, the visual system computes a priority map, which is a representation of visual space indexing the relative importance, or priority, of locations in the environment. Priority is computed based on both salience, defined based on image-computable properties; and relevance, defined by an individual's current goals, and is used to direct attention to the highest-priority locations for further processing. Computational theories suggest that priority maps identify salient locations based on individual feature dimensions (e.g., color, motion), which are integrated into an aggregate priority map. While widely accepted, a core assumption of this framework, the existence of independent feature dimension maps in visual cortex, remains untested. Here, we tested the hypothesis that retinotopic regions selective for specific feature dimensions (color or motion) in human cortex act as neural feature dimension maps, indexing salient locations based on their preferred feature. We used fMRI activation patterns to reconstruct spatial maps while male and female human participants viewed stimuli with salient regions defined by relative color or motion direction. Activation in reconstructed spatial maps was localized to the salient stimulus position in the display. Moreover, the strength of the stimulus representation was strongest in the ROI selective for the salience-defining feature. Together, these results suggest that feature-selective extrastriate visual regions highlight salient locations based on local feature contrast within their preferred feature dimensions, supporting their role as neural feature dimension maps.SIGNIFICANCE STATEMENT Identifying salient information is important for navigating the world. For example, it is critical to detect a quickly approaching car when crossing the street. Leading models of computer vision and visual search rely on compartmentalized salience computations based on individual features; however, there has been no direct empirical demonstration identifying neural regions as responsible for performing these dissociable operations. Here, we provide evidence of a critical double dissociation that neural activation patterns from color-selective regions prioritize the location of color-defined salience while minimally representing motion-defined salience, whereas motion-selective regions show the complementary result. These findings reveal that specialized cortical regions act as neural "feature dimension maps" that are used to index salient locations based on specific features to guide attention.
Collapse
Affiliation(s)
- Daniel D Thayer
- Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, California 93106
| | - Thomas C Sprague
- Department of Psychological and Brain Sciences, University of California-Santa Barbara, Santa Barbara, California 93106
| |
Collapse
|
5
|
Fang Z, Bloem IM, Olsson C, Ma WJ, Winawer J. Normalization by orientation-tuned surround in human V1-V3. PLoS Comput Biol 2023; 19:e1011704. [PMID: 38150484 PMCID: PMC10793941 DOI: 10.1371/journal.pcbi.1011704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 01/17/2024] [Accepted: 11/20/2023] [Indexed: 12/29/2023] Open
Abstract
An influential account of neuronal responses in primary visual cortex is the normalized energy model. This model is often implemented as a multi-stage computation. The first stage is linear filtering. The second stage is the extraction of contrast energy, whereby a complex cell computes the squared and summed outputs of a pair of the linear filters in quadrature phase. The third stage is normalization, in which a local population of complex cells mutually inhibit one another. Because the population includes cells tuned to a range of orientations and spatial frequencies, the result is that the responses are effectively normalized by the local stimulus contrast. Here, using evidence from human functional MRI, we show that the classical model fails to account for the relative responses to two classes of stimuli: straight, parallel, band-passed contours (gratings), and curved, band-passed contours (snakes). The snakes elicit fMRI responses that are about twice as large as the gratings, yet a traditional divisive normalization model predicts responses that are about the same. Motivated by these observations and others from the literature, we implement a divisive normalization model in which cells matched in orientation tuning ("tuned normalization") preferentially inhibit each other. We first show that this model accounts for differential responses to these two classes of stimuli. We then show that the model successfully generalizes to other band-pass textures, both in V1 and in extrastriate cortex (V2 and V3). We conclude that even in primary visual cortex, complex features of images such as the degree of heterogeneity, can have large effects on neural responses.
Collapse
Affiliation(s)
- Zeming Fang
- Department of Psychology and Center for Neural Science, New York University, New York City, New York, United States of America
- Department of Cognitive Science, Rensselaer Polytechnic Institute, Troy, New York, United States of America
| | - Ilona M. Bloem
- Department of Psychology and Center for Neural Science, New York University, New York City, New York, United States of America
| | - Catherine Olsson
- Department of Psychology and Center for Neural Science, New York University, New York City, New York, United States of America
| | - Wei Ji Ma
- Department of Psychology and Center for Neural Science, New York University, New York City, New York, United States of America
| | - Jonathan Winawer
- Department of Psychology and Center for Neural Science, New York University, New York City, New York, United States of America
| |
Collapse
|
6
|
Asghar M, Sanchez-Panchuelo R, Schluppeck D, Francis S. Two-Dimensional Population Receptive Field Mapping of Human Primary Somatosensory Cortex. Brain Topogr 2023; 36:816-834. [PMID: 37634160 PMCID: PMC10522535 DOI: 10.1007/s10548-023-01000-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Accepted: 08/09/2023] [Indexed: 08/29/2023]
Abstract
Functional magnetic resonance imaging can provide detailed maps of how sensory space is mapped in the human brain. Here, we use a novel 16 stimulator setup (a 4 × 4 grid) to measure two-dimensional sensory maps of between and within-digit (D2-D4) space using high spatial-resolution (1.25 mm isotropic) imaging at 7 Tesla together with population receptive field (pRF) mapping in 10 participants. Using a 2D Gaussian pRF model, we capture maps of the coverage of digits D2-D5 across Brodmann areas and estimate pRF size and shape. In addition, we compare results to previous studies that used fewer stimulators by constraining pRF models to a 1D Gaussian Between Digit or 1D Gaussian Within Digit model. We show that pRFs across somatosensory areas tend to have a strong preference to cover the within-digit axis. We show an increase in pRF size moving from D2-D5. We quantify pRF shapes in Brodmann area (BA) 3b, 3a, 1, 2 and show differences in pRF size in Brodmann areas 3a-2, with larger estimates for BA2. Generally, the 2D Gaussian pRF model better represents pRF coverage maps generated by our data, which itself is produced from a 2D stimulation grid.
Collapse
Affiliation(s)
- Michael Asghar
- Sir Peter Mansfield Imaging Centre, School of Physics and Astronomy, University of Nottingham, Nottingham, UK.
| | - Rosa Sanchez-Panchuelo
- Sir Peter Mansfield Imaging Centre, School of Physics and Astronomy, University of Nottingham, Nottingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Nottingham, UK
| | | | - Susan Francis
- Sir Peter Mansfield Imaging Centre, School of Physics and Astronomy, University of Nottingham, Nottingham, UK
- NIHR Nottingham Biomedical Research Centre, Nottingham University Hospitals NHS Trust and the University of Nottingham, Nottingham, UK
| |
Collapse
|
7
|
Penacchio O, Otazu X, Wilkins AJ, Haigh SM. A mechanistic account of visual discomfort. Front Neurosci 2023; 17:1200661. [PMID: 37547142 PMCID: PMC10397803 DOI: 10.3389/fnins.2023.1200661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 06/27/2023] [Indexed: 08/08/2023] Open
Abstract
Much of the neural machinery of the early visual cortex, from the extraction of local orientations to contextual modulations through lateral interactions, is thought to have developed to provide a sparse encoding of contour in natural scenes, allowing the brain to process efficiently most of the visual scenes we are exposed to. Certain visual stimuli, however, cause visual stress, a set of adverse effects ranging from simple discomfort to migraine attacks, and epileptic seizures in the extreme, all phenomena linked with an excessive metabolic demand. The theory of efficient coding suggests a link between excessive metabolic demand and images that deviate from natural statistics. Yet, the mechanisms linking energy demand and image spatial content in discomfort remain elusive. Here, we used theories of visual coding that link image spatial structure and brain activation to characterize the response to images observers reported as uncomfortable in a biologically based neurodynamic model of the early visual cortex that included excitatory and inhibitory layers to implement contextual influences. We found three clear markers of aversive images: a larger overall activation in the model, a less sparse response, and a more unbalanced distribution of activity across spatial orientations. When the ratio of excitation over inhibition was increased in the model, a phenomenon hypothesised to underlie interindividual differences in susceptibility to visual discomfort, the three markers of discomfort progressively shifted toward values typical of the response to uncomfortable stimuli. Overall, these findings propose a unifying mechanistic explanation for why there are differences between images and between observers, suggesting how visual input and idiosyncratic hyperexcitability give rise to abnormal brain responses that result in visual stress.
Collapse
Affiliation(s)
- Olivier Penacchio
- Department of Computer Science, Universitat Autònoma de Barcelona, Bellaterra, Spain
- Computer Vision Center, Universitat Autònoma de Barcelona, Bellaterra, Spain
- School of Psychology and Neuroscience, University of St Andrews, St Andrews, United Kingdom
| | - Xavier Otazu
- Department of Computer Science, Universitat Autònoma de Barcelona, Bellaterra, Spain
- Computer Vision Center, Universitat Autònoma de Barcelona, Bellaterra, Spain
| | - Arnold J. Wilkins
- Department of Psychology, University of Essex, Colchester, United Kingdom
| | - Sarah M. Haigh
- Department of Psychology, University of Nevada Reno, Reno, NV, United States
- Institute for Neuroscience, University of Nevada Reno, Reno, NV, United States
| |
Collapse
|
8
|
Doostani N, Hossein-Zadeh GA, Vaziri-Pashkam M. The normalization model predicts responses in the human visual cortex during object-based attention. eLife 2023; 12:e75726. [PMID: 37163571 PMCID: PMC10229119 DOI: 10.7554/elife.75726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 04/25/2023] [Indexed: 05/12/2023] Open
Abstract
Divisive normalization of the neural responses by the activity of the neighboring neurons has been proposed as a fundamental operation in the nervous system based on its success in predicting neural responses recorded in primate electrophysiology studies. Nevertheless, experimental evidence for the existence of this operation in the human brain is still scant. Here, using functional MRI, we examined the role of normalization across the visual hierarchy in the human visual cortex. Using stimuli form the two categories of human bodies and houses, we presented objects in isolation or in clutter and asked participants to attend or ignore the stimuli. Focusing on the primary visual area V1, the object-selective regions LO and pFs, the body-selective region EBA, and the scene-selective region PPA, we first modeled single-voxel responses using a weighted sum, a weighted average, and a normalization model and demonstrated that although the weighted sum and weighted average models also made acceptable predictions in some conditions, the response to multiple stimuli could generally be better described by a model that takes normalization into account. We then determined the observed effects of attention on cortical responses and demonstrated that these effects were predicted by the normalization model, but not by the weighted sum or the weighted average models. Our results thus provide evidence that the normalization model can predict responses to objects across shifts of visual attention, suggesting the role of normalization as a fundamental operation in the human brain.
Collapse
Affiliation(s)
- Narges Doostani
- School of Cognitive Sciences, Institute for Research in Fundamental SciencesTehranIran
| | - Gholam-Ali Hossein-Zadeh
- School of Cognitive Sciences, Institute for Research in Fundamental SciencesTehranIran
- School of Electrical Engineering, University of TehranTehranIran
| | - Maryam Vaziri-Pashkam
- Laboratory of Brain and Cognition, National Institute of Mental HealthBethesdaUnited States
| |
Collapse
|
9
|
van Dijk JA, de Jong MC, Piantoni G, Fracasso A, Vansteensel MJ, Groen IIA, Petridou N, Dumoulin SO. Intracranial recordings show evidence of numerosity tuning in human parietal cortex. PLoS One 2022; 17:e0272087. [PMID: 35921261 PMCID: PMC9348694 DOI: 10.1371/journal.pone.0272087] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 07/12/2022] [Indexed: 11/25/2022] Open
Abstract
Numerosity is the set size of a group of items. Numerosity perception is a trait shared across numerous species. Numerosity-selective neural populations are thought to underlie numerosity perception. These neurons have been identified primarily using electrical recordings in animal models and blood oxygen level dependent (BOLD) functional magnetic resonance imaging (fMRI) in humans. Here we use electrical intracranial recordings to investigate numerosity tuning in humans, focusing on high-frequency transient activations. These recordings combine a high spatial and temporal resolution and can bridge the gap between animal models and human recordings. In line with previous studies, we find numerosity-tuned responses at parietal sites in two out of three participants. Neuronal populations at these locations did not respond to other visual stimuli, i.e. faces, houses, and letters, in contrast to several occipital sites. Our findings further corroborate the specificity of numerosity tuning of in parietal cortex, and further link fMRI results and electrophysiological recordings.
Collapse
Affiliation(s)
- Jelle A. van Dijk
- Spinoza Centre for Neuroimaging, Amsterdam, The Netherlands
- Experimental Psychology, Utrecht University, Utrecht, The Netherlands
| | - Maartje C. de Jong
- Spinoza Centre for Neuroimaging, Amsterdam, The Netherlands
- Department of Psychology, University of Amsterdam, Amsterdam, The Netherlands
- Amsterdam Brain and Cognition (ABC), University of Amsterdam, Amsterdam, The Netherlands
| | - Gio Piantoni
- Radiology Department, Imaging Division, Center for Image Sciences, University Medical Center Utrecht, The Netherlands
| | - Alessio Fracasso
- Spinoza Centre for Neuroimaging, Amsterdam, The Netherlands
- Radiology Department, Imaging Division, Center for Image Sciences, University Medical Center Utrecht, The Netherlands
- Institute of Neuroscience and Psychology, University of Glasgow, Glasgow, United Kingdom
| | - Mariska J. Vansteensel
- UMC Utrecht Brain Center, Department Neurology and Neurosurgery, UMC Utrecht, Utrecht, The Netherlands
| | - Iris. I. A. Groen
- Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands
- Department of Psychology, New York University, New York, United States of America
| | - Natalia Petridou
- Radiology Department, Imaging Division, Center for Image Sciences, University Medical Center Utrecht, The Netherlands
| | - Serge O. Dumoulin
- Spinoza Centre for Neuroimaging, Amsterdam, The Netherlands
- Experimental Psychology, Utrecht University, Utrecht, The Netherlands
- Experimental and Applied Psychology, VU University, Amsterdam, The Netherlands
| |
Collapse
|
10
|
Burlingham CS, Mirbagheri S, Heeger DJ. A unified model of the task-evoked pupil response. SCIENCE ADVANCES 2022; 8:eabi9979. [PMID: 35442730 PMCID: PMC9020670 DOI: 10.1126/sciadv.abi9979] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 03/04/2022] [Indexed: 06/14/2023]
Abstract
The pupil dilates and reconstricts following task events. It is popular to model this task-evoked pupil response as a linear transformation of event-locked impulses, whose amplitudes are used as estimates of arousal. We show that this model is incorrect and propose an alternative model based on the physiological finding that a common neural input drives saccades and pupil size. The estimates of arousal from our model agreed with key predictions: Arousal scaled with task difficulty and behavioral performance but was invariant to small differences in trial duration. Moreover, the model offers a unified explanation for a wide range of phenomena: entrainment of pupil size and saccades to task timing, modulation of pupil response amplitude and noise with task difficulty, reaction time-dependent modulation of pupil response timing and amplitude, a constrictory pupil response time-locked to saccades, and task-dependent distortion of this saccade-locked pupil response.
Collapse
Affiliation(s)
| | - Saghar Mirbagheri
- Graduate Program in Neuroscience, University of Washington, Seattle, WA 98195, USA
| | - David J. Heeger
- Department of Psychology, New York University, New York, NY 10003, USA
- Center for Neural Science, New York University, New York, NY 10003, USA
| |
Collapse
|
11
|
Broderick WF, Simoncelli EP, Winawer J. Mapping spatial frequency preferences across human primary visual cortex. J Vis 2022; 22:3. [PMID: 35266962 PMCID: PMC8934567 DOI: 10.1167/jov.22.4.3] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 01/23/2022] [Indexed: 01/13/2023] Open
Abstract
Neurons in primate visual cortex (area V1) are tuned for spatial frequency, in a manner that depends on their position in the visual field. Several studies have examined this dependency using functional magnetic resonance imaging (fMRI), reporting preferred spatial frequencies (tuning curve peaks) of V1 voxels as a function of eccentricity, but their results differ by as much as two octaves, presumably owing to differences in stimuli, measurements, and analysis methodology. Here, we characterize spatial frequency tuning at a millimeter resolution within the human primary visual cortex, across stimulus orientation and visual field locations. We measured fMRI responses to a novel set of stimuli, constructed as sinusoidal gratings in log-polar coordinates, which include circular, radial, and spiral geometries. For each individual stimulus, the local spatial frequency varies inversely with eccentricity, and for any given location in the visual field, the full set of stimuli span a broad range of spatial frequencies and orientations. Over the measured range of eccentricities, the preferred spatial frequency is well-fit by a function that varies as the inverse of the eccentricity plus a small constant. We also find small but systematic effects of local stimulus orientation, defined in both absolute coordinates and relative to visual field location. Specifically, peak spatial frequency is higher for pinwheel than annular stimuli and for horizontal than vertical stimuli.
Collapse
Affiliation(s)
- William F Broderick
- Center for Neural Science, New York University, New York, NY, USA
- https://wfbroderick.com/
| | - Eero P Simoncelli
- Center for Neural Science, and Courant Institue for Mathematical Sciences, New York University, New York, NY, USA
- Flatiron Institute, Simons Foundation, USA
| | - Jonathan Winawer
- Department of Psychology, New York University, New York, NY, USA
| |
Collapse
|
12
|
Du B, Cheng X, Duan Y, Ning H. fMRI Brain Decoding and Its Applications in Brain-Computer Interface: A Survey. Brain Sci 2022; 12:228. [PMID: 35203991 PMCID: PMC8869956 DOI: 10.3390/brainsci12020228] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 01/29/2022] [Accepted: 01/30/2022] [Indexed: 11/25/2022] Open
Abstract
Brain neural activity decoding is an important branch of neuroscience research and a key technology for the brain-computer interface (BCI). Researchers initially developed simple linear models and machine learning algorithms to classify and recognize brain activities. With the great success of deep learning on image recognition and generation, deep neural networks (DNN) have been engaged in reconstructing visual stimuli from human brain activity via functional magnetic resonance imaging (fMRI). In this paper, we reviewed the brain activity decoding models based on machine learning and deep learning algorithms. Specifically, we focused on current brain activity decoding models with high attention: variational auto-encoder (VAE), generative confrontation network (GAN), and the graph convolutional network (GCN). Furthermore, brain neural-activity-decoding-enabled fMRI-based BCI applications in mental and psychological disease treatment are presented to illustrate the positive correlation between brain decoding and BCI. Finally, existing challenges and future research directions are addressed.
Collapse
Affiliation(s)
- Bing Du
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China; (B.D.); (X.C.)
| | - Xiaomu Cheng
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China; (B.D.); (X.C.)
| | - Yiping Duan
- Department of Electronic Engineering, Tsinghua University, Beijing 100084, China;
| | - Huansheng Ning
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China; (B.D.); (X.C.)
| |
Collapse
|
13
|
Kupers ER, Benson NC, Winawer J. A visual encoding model links magnetoencephalography signals to neural synchrony in human cortex. Neuroimage 2021; 245:118655. [PMID: 34687857 PMCID: PMC8788390 DOI: 10.1016/j.neuroimage.2021.118655] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 10/11/2021] [Indexed: 01/23/2023] Open
Abstract
Synchronization of neuronal responses over large distances is hypothesized to be important for many cortical functions. However, no straightforward methods exist to estimate synchrony non-invasively in the living human brain. MEG and EEG measure the whole brain, but the sensors pool over large, overlapping cortical regions, obscuring the underlying neural synchrony. Here, we developed a model from stimulus to cortex to MEG sensors to disentangle neural synchrony from spatial pooling of the instrument. We find that synchrony across cortex has a surprisingly large and systematic effect on predicted MEG spatial topography. We then conducted visual MEG experiments and separated responses into stimulus-locked and broadband components. The stimulus-locked topography was similar to model predictions assuming synchronous neural sources, whereas the broadband topography was similar to model predictions assuming asynchronous sources. We infer that visual stimulation elicits two distinct types of neural responses, one highly synchronous and one largely asynchronous across cortex.
Collapse
Affiliation(s)
- Eline R Kupers
- Department of Psychology, New York University, New York, NY 10003, United States; Center for Neural Science, New York University, New York, NY 10003, United States; Department of Psychology, Stanford University, Stanford, CA 94305, United States.
| | - Noah C Benson
- Department of Psychology, New York University, New York, NY 10003, United States; Center for Neural Science, New York University, New York, NY 10003, United States; eSciences Institute, University of Washington, Seattle, WA 98195, United States
| | - Jonathan Winawer
- Department of Psychology, New York University, New York, NY 10003, United States; Center for Neural Science, New York University, New York, NY 10003, United States
| |
Collapse
|
14
|
Wang H, Huang L, Du C, Li D, Wang B, He H. Neural Encoding for Human Visual Cortex With Deep Neural Networks Learning “What” and “Where”. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2020.3007761] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
15
|
Fuglsang SA, Madsen KH, Puonti O, Hjortkjær J, Siebner HR. Mapping cortico-subcortical sensitivity to 4 Hz amplitude modulation depth in human auditory system with functional MRI. Neuroimage 2021; 246:118745. [PMID: 34808364 DOI: 10.1016/j.neuroimage.2021.118745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 11/17/2021] [Accepted: 11/18/2021] [Indexed: 10/19/2022] Open
Abstract
Temporal modulations in the envelope of acoustic waveforms at rates around 4 Hz constitute a strong acoustic cue in speech and other natural sounds. It is often assumed that the ascending auditory pathway is increasingly sensitive to slow amplitude modulation (AM), but sensitivity to AM is typically considered separately for individual stages of the auditory system. Here, we used blood oxygen level dependent (BOLD) fMRI in twenty human subjects (10 male) to measure sensitivity of regional neural activity in the auditory system to 4 Hz temporal modulations. Participants were exposed to AM noise stimuli varying parametrically in modulation depth to characterize modulation-depth effects on BOLD responses. A Bayesian hierarchical modeling approach was used to model potentially nonlinear relations between AM depth and group-level BOLD responses in auditory regions of interest (ROIs). Sound stimulation activated the auditory brainstem and cortex structures in single subjects. BOLD responses to noise exposure in core and belt auditory cortices scaled positively with modulation depth. This finding was corroborated by whole-brain cluster-level inference. Sensitivity to AM depth variations was particularly pronounced in the Heschl's gyrus but also found in higher-order auditory cortical regions. None of the sound-responsive subcortical auditory structures showed a BOLD response profile that reflected the parametric variation in AM depth. The results are compatible with the notion that early auditory cortical regions play a key role in processing low-rate modulation content of sounds in the human auditory system.
Collapse
Affiliation(s)
- Søren A Fuglsang
- Danish Research Centre for Magnetic Resonance, Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Amager and Hvidovre, Hvidovre Denmark.
| | - Kristoffer H Madsen
- Danish Research Centre for Magnetic Resonance, Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Amager and Hvidovre, Hvidovre Denmark; Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Oula Puonti
- Danish Research Centre for Magnetic Resonance, Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Amager and Hvidovre, Hvidovre Denmark; Department of Health Technology, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Jens Hjortkjær
- Danish Research Centre for Magnetic Resonance, Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Amager and Hvidovre, Hvidovre Denmark; Department of Health Technology, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Hartwig R Siebner
- Danish Research Centre for Magnetic Resonance, Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Amager and Hvidovre, Hvidovre Denmark; Department of Neurology, Copenhagen University Hospital Bispebjerg and Frederiksberg, Copenhagen, Denmark; Department of Clinical Medicine, Faculty of Medical and Health Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
16
|
Divisive normalization unifies disparate response signatures throughout the human visual hierarchy. Proc Natl Acad Sci U S A 2021; 118:2108713118. [PMID: 34772812 PMCID: PMC8609633 DOI: 10.1073/pnas.2108713118] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/30/2021] [Indexed: 01/04/2023] Open
Abstract
A canonical neural computation is a mathematical operation applied by the brain in a wide variety of contexts and capable of explaining and unifying seemingly unrelated neural and perceptual phenomena. Here, we use a combination of state-of-the-art experiments (ultra-high-field functional MRI) and mathematical methods (population receptive field [pRF] modeling) to uniquely demonstrate the role of divisive normalization (DN) as the canonical neural computation underlying visuospatial responses throughout the human visual hierarchy. The DN pRF model provides a tool to investigate and interpret the computational processes underlying neural responses in human and animal recordings, but also in clinical and cognitive dimensions. Neural processing is hypothesized to apply the same mathematical operations in a variety of contexts, implementing so-called canonical neural computations. Divisive normalization (DN) is considered a prime candidate for a canonical computation. Here, we propose a population receptive field (pRF) model based on DN and evaluate it using ultra-high-field functional MRI (fMRI). The DN model parsimoniously captures seemingly disparate response signatures with a single computation, superseding existing pRF models in both performance and biological plausibility. We observe systematic variations in specific DN model parameters across the visual hierarchy and show how they relate to differences in response modulation and visuospatial information integration. The DN model delivers a unifying framework for visuospatial responses throughout the human visual hierarchy and provides insights into its underlying information-encoding computations. These findings extend the role of DN as a canonical computation to neuronal populations throughout the human visual hierarchy.
Collapse
|
17
|
Reconstruction of natural images from evoked brain activity with a dictionary-based invertible encoding procedure. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.05.083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
18
|
Hansen BC, Greene MR, Field DJ. Dynamic Electrode-to-Image (DETI) mapping reveals the human brain's spatiotemporal code of visual information. PLoS Comput Biol 2021; 17:e1009456. [PMID: 34570753 PMCID: PMC8496831 DOI: 10.1371/journal.pcbi.1009456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Revised: 10/07/2021] [Accepted: 09/16/2021] [Indexed: 11/18/2022] Open
Abstract
A number of neuroimaging techniques have been employed to understand how visual information is transformed along the visual pathway. Although each technique has spatial and temporal limitations, they can each provide important insights into the visual code. While the BOLD signal of fMRI can be quite informative, the visual code is not static and this can be obscured by fMRI’s poor temporal resolution. In this study, we leveraged the high temporal resolution of EEG to develop an encoding technique based on the distribution of responses generated by a population of real-world scenes. This approach maps neural signals to each pixel within a given image and reveals location-specific transformations of the visual code, providing a spatiotemporal signature for the image at each electrode. Our analyses of the mapping results revealed that scenes undergo a series of nonuniform transformations that prioritize different spatial frequencies at different regions of scenes over time. This mapping technique offers a potential avenue for future studies to explore how dynamic feedforward and recurrent processes inform and refine high-level representations of our visual world. The visual information that we sample from our environment undergoes a series of neural modifications, with each modification state (or visual code) consisting of a unique distribution of responses across neurons along the visual pathway. However, current noninvasive neuroimaging techniques provide an account of that code that is coarse with respect to time or space. Here, we present dynamic electrode-to-image (DETI) mapping, an analysis technique that capitalizes on the high temporal resolution of EEG to map neural signals to each pixel within a given image to reveal location-specific modifications of the visual code. The DETI technique reveals maps of features that are associated with the neural signal at each pixel and at each time point. DETI mapping shows that real-world scenes undergo a series of nonuniform modifications over both space and time. Specifically, we find that the visual code varies in a location-specific manner, likely reflecting that neural processing prioritizes different features at different image locations over time. DETI mapping therefore offers a potential avenue for future studies to explore how each modification state informs and refines the conceptual meaning of our visual world.
Collapse
Affiliation(s)
- Bruce C. Hansen
- Colgate University, Department of Psychological & Brain Sciences, Neuroscience Program, Hamilton New York, United States of America
- * E-mail:
| | - Michelle R. Greene
- Bates College, Neuroscience Program, Lewiston, Maine, United States of America
| | - David J. Field
- Cornell University, Department of Psychology, Ithaca, New York, United States of America
| |
Collapse
|
19
|
Barnett MA, Aguirre GK, Brainard D. A quadratic model captures the human V1 response to variations in chromatic direction and contrast. eLife 2021; 10:65590. [PMID: 34342580 PMCID: PMC8452309 DOI: 10.7554/elife.65590] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Accepted: 07/27/2021] [Indexed: 11/13/2022] Open
Abstract
An important goal for vision science is to develop quantitative models of the representation of visual signals at post-receptoral sites. To this end, we develop the quadratic color model (QCM) and examine its ability to account for the BOLD fMRI response in human V1 to spatially-uniform, temporal chromatic modulations that systematically vary in chromatic direction and contrast. We find that the QCM explains the same, cross-validated variance as a conventional general linear model, with far fewer free parameters. The QCM generalizes to allow prediction of V1 responses to a large range of modulations. We replicate the results for each subject and find good agreement across both replications and subjects. We find that within the LM cone contrast plane, V1 is most sensitive to L-M contrast modulations and least sensitive to L+M contrast modulations. Within V1, we observe little to no change in chromatic sensitivity as a function of eccentricity.
Collapse
Affiliation(s)
- Michael A Barnett
- Department of Psychology, University of Pennsylvania, Philadelphia, United States
| | | | - David Brainard
- Neurology, University of Pennsylvania, Philadelphia, United States
| |
Collapse
|
20
|
A Visual Encoding Model Based on Contrastive Self-Supervised Learning for Human Brain Activity along the Ventral Visual Stream. Brain Sci 2021; 11:brainsci11081004. [PMID: 34439623 PMCID: PMC8391143 DOI: 10.3390/brainsci11081004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Revised: 07/23/2021] [Accepted: 07/26/2021] [Indexed: 11/30/2022] Open
Abstract
Visual encoding models are important computational models for understanding how information is processed along the visual stream. Many improved visual encoding models have been developed from the perspective of the model architecture and the learning objective, but these are limited to the supervised learning method. From the view of unsupervised learning mechanisms, this paper utilized a pre-trained neural network to construct a visual encoding model based on contrastive self-supervised learning for the ventral visual stream measured by functional magnetic resonance imaging (fMRI). We first extracted features using the ResNet50 model pre-trained in contrastive self-supervised learning (ResNet50-CSL model), trained a linear regression model for each voxel, and finally calculated the prediction accuracy of different voxels. Compared with the ResNet50 model pre-trained in a supervised classification task, the ResNet50-CSL model achieved an equal or even relatively better encoding performance in multiple visual cortical areas. Moreover, the ResNet50-CSL model performs hierarchical representation of input visual stimuli, which is similar to the human visual cortex in its hierarchical information processing. Our experimental results suggest that the encoding model based on contrastive self-supervised learning is a strong computational model to compete with supervised models, and contrastive self-supervised learning proves an effective learning method to extract human brain-like representations.
Collapse
|
21
|
Abstract
The scientific study of reading has a rich history that spans disciplines from vision science to linguistics, psychology, cognitive neuroscience, neurology, and education. The study of reading can elucidate important general mechanisms in spatial vision, attentional control, object recognition, and perceptual learning, as well as the principles of plasticity and cortical topography. However, literacy also prompts the development of specific neural circuits to process a unique and artificial stimulus. In this review, we describe the sequence of operations that transforms visual features into language, how the key neural circuits are sculpted by experience during development, and what goes awry in children for whom learning to read is a struggle. Expected final online publication date for the Annual Review of Vision Science, Volume 7 is September 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Jason D Yeatman
- Graduate School of Education, Stanford University, Stanford, California 93405, USA; .,Division of Developmental-Behavioral Pediatrics, Stanford University School of Medicine, Stanford, California 94305, USA.,Department of Psychology, Stanford University, Stanford, California 94305, USA
| | - Alex L White
- Graduate School of Education, Stanford University, Stanford, California 93405, USA; .,Division of Developmental-Behavioral Pediatrics, Stanford University School of Medicine, Stanford, California 94305, USA.,Department of Neuroscience and Behavior, Barnard College, New York, New York 10027, USA
| |
Collapse
|
22
|
Das A, Ray S. Effect of Cross-Orientation Normalization on Different Neural Measures in Macaque Primary Visual Cortex. Cereb Cortex Commun 2021; 2:tgab009. [PMID: 34095837 PMCID: PMC8152940 DOI: 10.1093/texcom/tgab009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2020] [Revised: 01/26/2021] [Accepted: 01/27/2021] [Indexed: 11/14/2022] Open
Abstract
Divisive normalization is a canonical mechanism that can explain a variety of sensory phenomena. While normalization models have been used to explain spiking activity in response to different stimulus/behavioral conditions in multiple brain areas, it is unclear whether similar models can also explain modulation in population-level neural measures such as power at various frequencies in local field potentials (LFPs) or steady-state visually evoked potential (SSVEP) that is produced by flickering stimuli and popular in electroencephalogram studies. To address this, we manipulated normalization strength by presenting static as well as flickering orthogonal superimposed gratings (plaids) at varying contrasts to 2 female monkeys while recording multiunit activity (MUA) and LFP from the primary visual cortex and quantified the modulation in MUA, gamma (32-80 Hz), high-gamma (104-248 Hz) power, as well as SSVEP. Even under similar stimulus conditions, normalization strength was different for the 4 measures and increased as: spikes, high-gamma, SSVEP, and gamma. However, these results could be explained using a normalization model that was modified for population responses, by varying the tuned normalization parameter and semisaturation constant. Our results show that different neural measures can reflect the effect of stimulus normalization in different ways, which can be modeled by a simple normalization model.
Collapse
Affiliation(s)
- Aritra Das
- Centre for Neuroscience, Indian Institute of Science, Bangalore 560012, India
| | - Supratim Ray
- Centre for Neuroscience, Indian Institute of Science, Bangalore 560012, India
| |
Collapse
|
23
|
Abstract
The response to contrast is one of the most important functions of the macaque primary visual cortex, V1, but up to now there has not been an adequate theory for it. To fill this gap in our understanding of cortical function, we built and analyzed a new large-scale, biologically constrained model of the input layer, 4Cα, of macaque V1. We called the new model CSY2. We challenged CSY2 with a three-parameter family of visual stimuli that varied in contrast, orientation, and spatial frequency. CSY2 accurately simulated experimental data and made many new predictions. It accounted for 1) the shapes of firing-rate-versus-contrast functions, 2) orientation and spatial frequency tuning versus contrast, and 3) the approximate contrast-invariance of cortical activity maps. Post-analysis revealed that the mechanisms that were needed to produce the successful simulations of contrast response included strong recurrent excitation and inhibition that find dynamic equilibria across the cortical surface, dynamic feedback between L6 and L4, and synaptic dynamics like inhibitory synaptic depression.
Collapse
|
24
|
Wu H, Zheng N, Chen B. Feature-specific denoising of neural activity for natural image identification. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2021.3062067] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
25
|
Differential impact of endogenous and exogenous attention on activity in human visual cortex. Sci Rep 2020; 10:21274. [PMID: 33277552 PMCID: PMC7718281 DOI: 10.1038/s41598-020-78172-x] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 11/09/2020] [Indexed: 01/27/2023] Open
Abstract
How do endogenous (voluntary) and exogenous (involuntary) attention modulate activity in visual cortex? Using ROI-based fMRI analysis, we measured fMRI activity for valid and invalid trials (target at cued/un-cued location, respectively), pre- or post-cueing endogenous or exogenous attention, while participants performed the same orientation discrimination task. We found stronger modulation in contralateral than ipsilateral visual regions, and higher activity in valid- than invalid-trials. For endogenous attention, modulation of stimulus-evoked activity due to a pre-cue increased along the visual hierarchy, but was constant due to a post-cue. For exogenous attention, modulation of stimulus-evoked activity due to a pre-cue was constant along the visual hierarchy, but was not modulated due to a post-cue. These findings reveal that endogenous and exogenous attention distinctly modulate activity in visuo-occipital areas during orienting and reorienting; endogenous attention facilitates both the encoding and the readout of visual information whereas exogenous attention only facilitates the encoding of information.
Collapse
|
26
|
The neural signature of numerosity by separating numerical and continuous magnitude extraction in visual cortex with frequency-tagged EEG. Proc Natl Acad Sci U S A 2020; 117:5726-5732. [PMID: 32123113 PMCID: PMC7084102 DOI: 10.1073/pnas.1917849117] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
The ability to handle approximate quantities, or number sense, has been recurrently linked to mathematical skills, although the nature of the mechanism allowing to extract numerical information (i.e., numerosity) from environmental stimuli is still debated. A set of objects is indeed not only characterized by its numerosity but also by other features, such as the summed area occupied by the elements, which often covary with numerosity. These intrinsic relations between numerosity and nonnumerical magnitudes led some authors to argue that numerosity is not independently processed but extracted through a weighting of continuous magnitudes. This view cannot be properly tested through classic behavioral and neuroimaging approaches due to these intrinsic correlations. The current study used a frequency-tagging EEG approach to separately measure responses to numerosity as well as to continuous magnitudes. We recorded occipital responses to numerosity, total area, and convex hull changes but not to density and dot size. We additionally applied a model predicting primary visual cortex responses to the set of stimuli. The model output was closely aligned with our electrophysiological data, since it predicted discrimination only for numerosity, total area, and convex hull. Our findings thus demonstrate that numerosity can be independently processed at an early stage in the visual cortex, even when completely isolated from other magnitude changes. The similar implicit discrimination for numerosity as for some continuous magnitudes, which correspond to basic visual percepts, shows that both can be extracted independently, hence substantiating the nature of numerosity as a primary feature of the visual scene.
Collapse
|
27
|
Bloem IM, Ling S. Normalization governs attentional modulation within human visual cortex. Nat Commun 2019; 10:5660. [PMID: 31827078 PMCID: PMC6906520 DOI: 10.1038/s41467-019-13597-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Accepted: 11/12/2019] [Indexed: 12/02/2022] Open
Abstract
Although attention is known to increase the gain of visuocortical responses, its underlying neural computations remain unclear. Here, we use fMRI to test the hypothesis that a neural population’s ability to be modulated by attention is dependent on divisive normalization. To do so, we leverage the feature-tuned properties of normalization and find that visuocortical responses to stimuli sharing features normalize each other more strongly. Comparing these normalization measures to measures of attentional modulation, we demonstrate that subpopulations which exhibit stronger normalization also exhibit larger attentional benefits. In a converging experiment, we reveal that attentional benefits are greatest when a subpopulation is forced into a state of stronger normalization. Taken together, these results suggest that the degree to which a subpopulation exhibits normalization plays a role in dictating its potential for attentional benefits. Attention is known to enhance relevant information in our environment, yet its underlying neural computations remain unclear. Here, the authors provide evidence that the degree to which a neural population can normalize itself results in greater potential for attentional benefits.
Collapse
Affiliation(s)
- Ilona M Bloem
- Department of Psychological and Brain Sciences, Boston University, Boston, MA, 02215, USA. .,Center for Systems Neuroscience, Boston University, Boston, MA, 02215, USA.
| | - Sam Ling
- Department of Psychological and Brain Sciences, Boston University, Boston, MA, 02215, USA.,Center for Systems Neuroscience, Boston University, Boston, MA, 02215, USA
| |
Collapse
|
28
|
Fritsche M, Lawrence SJD, de Lange FP. Temporal tuning of repetition suppression across the visual cortex. J Neurophysiol 2019; 123:224-233. [PMID: 31774368 DOI: 10.1152/jn.00582.2019] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The visual system adapts to its recent history. A phenomenon related to this is repetition suppression (RS), a reduction in neural responses to repeated compared with nonrepeated visual input. An intriguing hypothesis is that the timescale over which RS occurs across the visual hierarchy is tuned to the temporal statistics of visual input features, which change rapidly in low-level areas but are more stable in higher level areas. Here, we tested this hypothesis by studying the influence of the temporal lag between successive visual stimuli on RS throughout the visual system using functional (f)MRI. Twelve human volunteers engaged in four fMRI sessions in which we characterized the blood oxygen level-dependent response to pairs of repeated and nonrepeated natural images with interstimulus intervals (ISI) ranging from 50 to 1,000 ms to quantify the temporal tuning of RS along the posterior-anterior axis of the visual system. As expected, RS was maximal for short ISIs and decayed with increasing ISI. Crucially, however, and against our hypothesis, RS decayed at a similar rate in early and late visual areas. This finding challenges the prevailing view that the timescale of RS increases along the posterior-anterior axis of the visual system and suggests that RS is not tuned to temporal input regularities.NEW & NOTEWORTHY Visual areas show reduced neural responses to repeated compared with nonrepeated visual input, a phenomenon termed repetition suppression (RS). Here we show that RS decays at a similar rate in low- and high-level visual areas, suggesting that the short-term decay of RS across the visual hierarchy is not tuned to temporal input regularities. This may limit the specificity with which the mechanisms underlying RS could optimize the processing of input features across the visual hierarchy.
Collapse
Affiliation(s)
- Matthias Fritsche
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Samuel J D Lawrence
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Floris P de Lange
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| |
Collapse
|
29
|
Hermes D, Petridou N, Kay KN, Winawer J. An image-computable model for the stimulus selectivity of gamma oscillations. eLife 2019; 8:e47035. [PMID: 31702552 PMCID: PMC6839904 DOI: 10.7554/elife.47035] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Accepted: 10/24/2019] [Indexed: 11/13/2022] Open
Abstract
Gamma oscillations in visual cortex have been hypothesized to be critical for perception, cognition, and information transfer. However, observations of these oscillations in visual cortex vary widely; some studies report little to no stimulus-induced narrowband gamma oscillations, others report oscillations for only some stimuli, and yet others report large oscillations for most stimuli. To better understand this signal, we developed a model that predicts gamma responses for arbitrary images and validated this model on electrocorticography (ECoG) data from human visual cortex. The model computes variance across the outputs of spatially pooled orientation channels, and accurately predicts gamma amplitude across 86 images. Gamma responses were large for a small subset of stimuli, differing dramatically from fMRI and ECoG broadband (non-oscillatory) responses. We propose that gamma oscillations in visual cortex serve as a biomarker of gain control rather than being a fundamental mechanism for communicating visual information.
Collapse
Affiliation(s)
- Dora Hermes
- Department of Physiology and Biomedical EngineeringMayo ClinicRochesterUnited States
- Department of Neurology and NeurosurgeryUMC Utrecht Brain CenterUtrechtNetherlands
| | - Natalia Petridou
- Center for Image SciencesUniversity Medical Center UtrechtUtrechtNetherlands
| | - Kendrick N Kay
- Center for Magnetic Resonance Research (CMRR), Department of RadiologyUniversity of MinnesotaMinneapolisUnited States
| | - Jonathan Winawer
- Department of PsychologyNew York UniversityNew YorkUnited States
- Center for Neural ScienceNew York UniversityNew YorkUnited States
| |
Collapse
|
30
|
Hendriks AD, van der Kemp WJ, Luijten PR, Petridou N, Klomp DW. SNR optimized 31 P functional MRS to detect mitochondrial and extracellular pH change during visual stimulation. NMR IN BIOMEDICINE 2019; 32:e4137. [PMID: 31329342 PMCID: PMC6900119 DOI: 10.1002/nbm.4137] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Revised: 06/11/2019] [Accepted: 06/12/2019] [Indexed: 05/04/2023]
Abstract
UNLABELLED Energy metabolism of the human visual cortex was investigated by performing 31 P functional MRS. INTRODUCTION The human brain is known to be the main glucose demanding organ of the human body and neuronal activity can increase this energy demand. In this study we investigate whether alterations in pH during activation of the brain can be observed with MRS, focusing on the mitochondrial inorganic phosphate (Pi) pool as potential marker of energy demand. METHODS Six participants were scanned with 16 consecutive 31 P-MRSI scans, which were divided in 4 blocks of 8:36 minutes of either rest or visual stimulation. Since the signals from the mitochondrial compartments of Pi are low, multiple approaches to achieve high SNR 31 P measurements were combined. This included: a close fitting 31 P RF coil, a 7 T-field strength, Ernst angle acquisitions and a stimulus with a large visual angle allowing large spectroscopy volumes containing activated tissue. RESULTS The targeted resonance downfield of the main Pi peak could be distinguished, indicating the high SNR of the 31 P spectra. The peak downfield of the main Pi peak is believed to be connected to mitochondrial performance. In addition, a BOLD effect in the PCr signal was observed as a signal increase of 2-3% during visual stimulation as compared to rest. When averaging data over multiple volunteers, a small subtle shift of about 0.1 ppm of the downfield Pi peak towards the main Pi peak could be observed in the first 4 minutes of visual stimulation, but no longer in the 4 to 8 minute scan window. Indications of a subtle shift during visual stimulation were found, but this effect remains small and should be further validated. CONCLUSION Overall, the downfield peak of Pi could be observed, revealing opportunities and considerations to measure specific acidity (pH) effects in the human visual cortex.
Collapse
Affiliation(s)
- Arjan D. Hendriks
- Department of RadiologyUniversity Medical Center UtrechtUtrechtthe Netherlands
| | | | - Peter R. Luijten
- Department of RadiologyUniversity Medical Center UtrechtUtrechtthe Netherlands
| | - Natalia Petridou
- Department of RadiologyUniversity Medical Center UtrechtUtrechtthe Netherlands
| | - Dennis W.J. Klomp
- Department of RadiologyUniversity Medical Center UtrechtUtrechtthe Netherlands
| |
Collapse
|
31
|
Turner BM, Palestro JJ, Miletić S, Forstmann BU. Advances in techniques for imposing reciprocity in brain-behavior relations. Neurosci Biobehav Rev 2019; 102:327-336. [PMID: 31128445 DOI: 10.1016/j.neubiorev.2019.04.018] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2018] [Revised: 03/18/2019] [Accepted: 04/25/2019] [Indexed: 01/01/2023]
Abstract
To better understand human behavior, the emerging field of model-based cognitive neuroscience seeks to anchor psychological theory to the biological substrate from which behavior originates: the brain. Despite complex dynamics, many researchers in this field have demonstrated that fluctuations in brain activity can be related to fluctuations in components of cognitive models, which instantiate psychological theories. In this review, we discuss a number of approaches for relating brain activity to cognitive models, and expand on a framework for imposing reciprocity in the inference of mental operations from the combination of brain and behavioral data.
Collapse
Affiliation(s)
- Brandon M Turner
- Department of Psychology, The Ohio State University, Columbus, OH, USA.
| | - James J Palestro
- Department of Psychology, The Ohio State University, Columbus, OH, USA.
| | - Steven Miletić
- Department of Psychology, University of Amsterdam, Amsterdam, Netherlands.
| | - Birte U Forstmann
- Department of Psychology, University of Amsterdam, Amsterdam, Netherlands.
| |
Collapse
|
32
|
Derivatives and inverse of cascaded linear+nonlinear neural models. PLoS One 2018; 13:e0201326. [PMID: 30321175 PMCID: PMC6188639 DOI: 10.1371/journal.pone.0201326] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Accepted: 07/11/2018] [Indexed: 11/20/2022] Open
Abstract
In vision science, cascades of Linear+Nonlinear transforms are very successful in modeling a number of perceptual experiences. However, the conventional literature is usually too focused on only describing the forward input-output transform. Instead, in this work we present the mathematics of such cascades beyond the forward transform, namely the Jacobian matrices and the inverse. The fundamental reason for this analytical treatment is that it offers useful analytical insight into the psychophysics, the physiology, and the function of the visual system. For instance, we show how the trends of the sensitivity (volume of the discrimination regions) and the adaptation of the receptive fields can be identified in the expression of the Jacobian w.r.t. the stimulus. This matrix also tells us which regions of the stimulus space are encoded more efficiently in multi-information terms. The Jacobian w.r.t. the parameters shows which aspects of the model have bigger impact in the response, and hence their relative relevance. The analytic inverse implies conditions for the response and model parameters to ensure appropriate decoding. From the experimental and applied perspective, (a) the Jacobian w.r.t. the stimulus is necessary in new experimental methods based on the synthesis of visual stimuli with interesting geometrical properties, (b) the Jacobian matrices w.r.t. the parameters are convenient to learn the model from classical experiments or alternative goal optimization, and (c) the inverse is a promising model-based alternative to blind machine-learning methods for neural decoding that do not include meaningful biological information. The theory is checked by building and testing a vision model that actually follows a modular Linear+Nonlinear program. Our illustrative derivable and invertible model consists of a cascade of modules that account for brightness, contrast, energy masking, and wavelet masking. To stress the generality of this modular setting we show examples where some of the canonical Divisive Normalization modules are substituted by equivalent modules such as the Wilson-Cowan interaction model (at the V1 cortex) or a tone-mapping model (at the retina).
Collapse
|
33
|
Kay KN. Principles for models of neural information processing. Neuroimage 2018; 180:101-109. [DOI: 10.1016/j.neuroimage.2017.08.016] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2017] [Revised: 08/02/2017] [Accepted: 08/03/2017] [Indexed: 11/25/2022] Open
|
34
|
Baker DH, Wade AR. Evidence for an Optimal Algorithm Underlying Signal Combination in Human Visual Cortex. Cereb Cortex 2018; 27:254-264. [PMID: 28031176 PMCID: PMC5903417 DOI: 10.1093/cercor/bhw395] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Accepted: 12/05/2016] [Indexed: 12/24/2022] Open
Abstract
How does the cortex combine information from multiple sources? We tested several computational models against data from steady-state electroencephalography (EEG) experiments in humans, using periodic visual stimuli combined across either retinal location or eye-of-presentation. A model in which signals are raised to an exponent before being summed in both the numerator and the denominator of a gain control nonlinearity gave the best account of the data. This model also predicted the pattern of responses in a range of additional conditions accurately and with no free parameters, as well as predicting responses at harmonic and intermodulation frequencies between 1 and 30 Hz. We speculate that this model implements the optimal algorithm for combining multiple noisy inputs, in which responses are proportional to the weighted sum of both inputs. This suggests a novel purpose for cortical gain control: implementing optimal signal combination via mutual inhibition, perhaps explaining its ubiquity as a neural computation.
Collapse
Affiliation(s)
- Daniel H Baker
- Department of Psychology, University of York, Heslington, York YO10 5DD, UK
| | - Alex R Wade
- Department of Psychology, University of York, Heslington, York YO10 5DD, UK
| |
Collapse
|
35
|
A dual fast and slow feature interaction in biologically inspired visual recognition of human action. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2017.10.021] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
36
|
Compressive Temporal Summation in Human Visual Cortex. J Neurosci 2017; 38:691-709. [PMID: 29192127 DOI: 10.1523/jneurosci.1724-17.2017] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Revised: 10/23/2017] [Accepted: 11/17/2017] [Indexed: 01/23/2023] Open
Abstract
Combining sensory inputs over space and time is fundamental to vision. Population receptive field models have been successful in characterizing spatial encoding throughout the human visual pathways. A parallel question, how visual areas in the human brain process information distributed over time, has received less attention. One challenge is that the most widely used neuroimaging method, fMRI, has coarse temporal resolution compared with the time-scale of neural dynamics. Here, via carefully controlled temporally modulated stimuli, we show that information about temporal processing can be readily derived from fMRI signal amplitudes in male and female subjects. We find that all visual areas exhibit subadditive summation, whereby responses to longer stimuli are less than the linear prediction from briefer stimuli. We also find fMRI evidence that the neural response to two stimuli is reduced for brief interstimulus intervals (indicating adaptation). These effects are more pronounced in visual areas anterior to V1-V3. Finally, we develop a general model that shows how these effects can be captured with two simple operations: temporal summation followed by a compressive nonlinearity. This model operates for arbitrary temporal stimulation patterns and provides a simple and interpretable set of computations that can be used to characterize neural response properties across the visual hierarchy. Importantly, compressive temporal summation directly parallels earlier findings of compressive spatial summation in visual cortex describing responses to stimuli distributed across space. This indicates that, for space and time, cortex uses a similar processing strategy to achieve higher-level and increasingly invariant representations of the visual world.SIGNIFICANCE STATEMENT Combining sensory inputs over time is fundamental to seeing. Two important temporal phenomena are summation, the accumulation of sensory inputs over time, and adaptation, a response reduction for repeated or sustained stimuli. We investigated these phenomena in the human visual system using fMRI. We built predictive models that operate on arbitrary temporal patterns of stimulation using two simple computations: temporal summation followed by a compressive nonlinearity. Our new temporal compressive summation model captures (1) subadditive temporal summation, and (2) adaptation. We show that the model accounts for systematic differences in these phenomena across visual areas. Finally, we show that for space and time, the visual system uses a similar strategy to achieve increasingly invariant representations of the visual world.
Collapse
|
37
|
Hermes D, Nguyen M, Winawer J. Neuronal synchrony and the relation between the blood-oxygen-level dependent response and the local field potential. PLoS Biol 2017; 15:e2001461. [PMID: 28742093 PMCID: PMC5524566 DOI: 10.1371/journal.pbio.2001461] [Citation(s) in RCA: 56] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 06/22/2017] [Indexed: 01/07/2023] Open
Abstract
The most widespread measures of human brain activity are the blood-oxygen-level dependent (BOLD) signal and surface field potential. Prior studies report a variety of relationships between these signals. To develop an understanding of how to interpret these signals and the relationship between them, we developed a model of (a) neuronal population responses and (b) transformations from neuronal responses into the functional magnetic resonance imaging (fMRI) BOLD signal and electrocorticographic (ECoG) field potential. Rather than seeking a transformation between the two measures directly, this approach interprets each measure with respect to the underlying neuronal population responses. This model accounts for the relationship between BOLD and ECoG data from human visual cortex in V1, V2, and V3, with the model predictions and data matching in three ways: across stimuli, the BOLD amplitude and ECoG broadband power were positively correlated, the BOLD amplitude and alpha power (8-13 Hz) were negatively correlated, and the BOLD amplitude and narrowband gamma power (30-80 Hz) were uncorrelated. The two measures provide complementary information about human brain activity, and we infer that features of the field potential that are uncorrelated with BOLD arise largely from changes in synchrony, rather than level, of neuronal activity.
Collapse
Affiliation(s)
- Dora Hermes
- Department of Psychology, New York University, New York, New York, United States of America
- Brain Center Rudolf Magnus, Department of Neurology & Neurosurgery, University Medical Center Utrecht, Utrecht, the Netherlands
- Department of Psychology, Stanford University, Stanford, California, United States of America
| | - Mai Nguyen
- Department of Psychology, Princeton University, Princeton, New Jersey, United States of America
| | - Jonathan Winawer
- Department of Psychology, New York University, New York, New York, United States of America
| |
Collapse
|
38
|
De Martino F, Yacoub E, Kemper V, Moerel M, Uludağ K, De Weerd P, Ugurbil K, Goebel R, Formisano E. The impact of ultra-high field MRI on cognitive and computational neuroimaging. Neuroimage 2017; 168:366-382. [PMID: 28396293 DOI: 10.1016/j.neuroimage.2017.03.060] [Citation(s) in RCA: 73] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2016] [Revised: 03/20/2017] [Accepted: 03/29/2017] [Indexed: 01/14/2023] Open
Abstract
The ability to measure functional brain responses non-invasively with ultra high field MRI (7 T and above) represents a unique opportunity in advancing our understanding of the human brain. Compared to lower fields (3 T and below), ultra high field MRI has an increased sensitivity, which can be used to acquire functional images with greater spatial resolution, and greater specificity of the blood oxygen level dependent (BOLD) signal to the underlying neuronal responses. Together, increased resolution and specificity enable investigating brain functions at a submillimeter scale, which so far could only be done with invasive techniques. At this mesoscopic spatial scale, perception, cognition and behavior can be probed at the level of fundamental units of neural computations, such as cortical columns, cortical layers, and subcortical nuclei. This represents a unique and distinctive advantage that differentiates ultra high from lower field imaging and that can foster a tighter link between fMRI and computational modeling of neural networks. So far, functional brain mapping at submillimeter scale has focused on the processing of sensory information and on well-known systems for which extensive information is available from invasive recordings in animals. It remains an open challenge to extend this methodology to uniquely human functions and, more generally, to systems for which animal models may be problematic. To succeed, the possibility to acquire high-resolution functional data with large spatial coverage, the availability of computational models of neural processing as well as accurate biophysical modeling of neurovascular coupling at mesoscopic scale all appear necessary.
Collapse
Affiliation(s)
- Federico De Martino
- Department of Cognitive Neurosciences, Faculty of Psychology and Neuroscience, Maastricht University, Oxfordlaan 55, 6229 ER Maastricht, The Netherlands; Center for Magnetic Resonance Research, Department of Radiology, University of Minnesota, 2021 sixth street SE, 55455 Minneapolis, MN, USA.
| | - Essa Yacoub
- Center for Magnetic Resonance Research, Department of Radiology, University of Minnesota, 2021 sixth street SE, 55455 Minneapolis, MN, USA
| | - Valentin Kemper
- Department of Cognitive Neurosciences, Faculty of Psychology and Neuroscience, Maastricht University, Oxfordlaan 55, 6229 ER Maastricht, The Netherlands
| | - Michelle Moerel
- Department of Cognitive Neurosciences, Faculty of Psychology and Neuroscience, Maastricht University, Oxfordlaan 55, 6229 ER Maastricht, The Netherlands; Maastricht Center for System Biology, Maastricht University, Universiteitssingel 60, 6229 ER Maastricht, The Netherlands
| | - Kâmil Uludağ
- Department of Cognitive Neurosciences, Faculty of Psychology and Neuroscience, Maastricht University, Oxfordlaan 55, 6229 ER Maastricht, The Netherlands
| | - Peter De Weerd
- Department of Cognitive Neurosciences, Faculty of Psychology and Neuroscience, Maastricht University, Oxfordlaan 55, 6229 ER Maastricht, The Netherlands
| | - Kamil Ugurbil
- Center for Magnetic Resonance Research, Department of Radiology, University of Minnesota, 2021 sixth street SE, 55455 Minneapolis, MN, USA
| | - Rainer Goebel
- Department of Cognitive Neurosciences, Faculty of Psychology and Neuroscience, Maastricht University, Oxfordlaan 55, 6229 ER Maastricht, The Netherlands
| | - Elia Formisano
- Department of Cognitive Neurosciences, Faculty of Psychology and Neuroscience, Maastricht University, Oxfordlaan 55, 6229 ER Maastricht, The Netherlands; Maastricht Center for System Biology, Maastricht University, Universiteitssingel 60, 6229 ER Maastricht, The Netherlands
| |
Collapse
|
39
|
Kay KN, Yeatman JD. Bottom-up and top-down computations in word- and face-selective cortex. eLife 2017; 6. [PMID: 28226243 PMCID: PMC5358981 DOI: 10.7554/elife.22341] [Citation(s) in RCA: 99] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 02/19/2017] [Indexed: 11/13/2022] Open
Abstract
The ability to read a page of text or recognize a person's face depends on category-selective visual regions in ventral temporal cortex (VTC). To understand how these regions mediate word and face recognition, it is necessary to characterize how stimuli are represented and how this representation is used in the execution of a cognitive task. Here, we show that the response of a category-selective region in VTC can be computed as the degree to which the low-level properties of the stimulus match a category template. Moreover, we show that during execution of a task, the bottom-up representation is scaled by the intraparietal sulcus (IPS), and that the level of IPS engagement reflects the cognitive demands of the task. These results provide an account of neural processing in VTC in the form of a model that addresses both bottom-up and top-down effects and quantitatively predicts VTC responses.
Collapse
Affiliation(s)
- Kendrick N Kay
- Center for Magnetic Resonance Research, Department of Radiology, University of Minnesota, Minneapolis, United States
| | - Jason D Yeatman
- Institute for Learning and Brain Sciences, University of Washington, Seattle, United States.,Department of Speech and Hearing Sciences, University of Washington, Seattle, United States
| |
Collapse
|
40
|
Khaligh-Razavi SM, Henriksson L, Kay K, Kriegeskorte N. Fixed versus mixed RSA: Explaining visual representations by fixed and mixed feature sets from shallow and deep computational models. JOURNAL OF MATHEMATICAL PSYCHOLOGY 2017; 76:184-197. [PMID: 28298702 PMCID: PMC5341758 DOI: 10.1016/j.jmp.2016.10.007] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Studies of the primate visual system have begun to test a wide range of complex computational object-vision models. Realistic models have many parameters, which in practice cannot be fitted using the limited amounts of brain-activity data typically available. Task performance optimization (e.g. using backpropagation to train neural networks) provides major constraints for fitting parameters and discovering nonlinear representational features appropriate for the task (e.g. object classification). Model representations can be compared to brain representations in terms of the representational dissimilarities they predict for an image set. This method, called representational similarity analysis (RSA), enables us to test the representational feature space as is (fixed RSA) or to fit a linear transformation that mixes the nonlinear model features so as to best explain a cortical area's representational space (mixed RSA). Like voxel/population-receptive-field modelling, mixed RSA uses a training set (different stimuli) to fit one weight per model feature and response channel (voxels here), so as to best predict the response profile across images for each response channel. We analysed response patterns elicited by natural images, which were measured with functional magnetic resonance imaging (fMRI). We found that early visual areas were best accounted for by shallow models, such as a Gabor wavelet pyramid (GWP). The GWP model performed similarly with and without mixing, suggesting that the original features already approximated the representational space, obviating the need for mixing. However, a higher ventral-stream visual representation (lateral occipital region) was best explained by the higher layers of a deep convolutional network and mixing of its feature set was essential for this model to explain the representation. We suspect that mixing was essential because the convolutional network had been trained to discriminate a set of 1000 categories, whose frequencies in the training set did not match their frequencies in natural experience or their behavioural importance. The latter factors might determine the representational prominence of semantic dimensions in higher-level ventral-stream areas. Our results demonstrate the benefits of testing both the specific representational hypothesis expressed by a model's original feature space and the hypothesis space generated by linear transformations of that feature space.
Collapse
Affiliation(s)
- Seyed-Mahdi Khaligh-Razavi
- MRC Cognition and Brain Sciences Unit, Cambridge, UK
- Computer Science & Artificial intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Linda Henriksson
- MRC Cognition and Brain Sciences Unit, Cambridge, UK
- Department of Neuroscience and Biomedical Engineering, Aalto University, Espoo, Finland
| | - Kendrick Kay
- Department of Psychology, Washington University in St. Louis, St. Louis, MO, USA
| | | |
Collapse
|
41
|
Chen G, Taylor PA, Cox RW. Is the statistic value all we should care about in neuroimaging? Neuroimage 2016; 147:952-959. [PMID: 27729277 DOI: 10.1016/j.neuroimage.2016.09.066] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2016] [Revised: 09/23/2016] [Accepted: 09/29/2016] [Indexed: 10/20/2022] Open
Abstract
Here we address an important issue that has been embedded within the neuroimaging community for a long time: the absence of effect estimates in results reporting in the literature. The statistic value itself, as a dimensionless measure, does not provide information on the biophysical interpretation of a study, and it certainly does not represent the whole picture of a study. Unfortunately, in contrast to standard practice in most scientific fields, effect (or amplitude) estimates are usually not provided in most results reporting in the current neuroimaging publications and presentations. Possible reasons underlying this general trend include (1) lack of general awareness, (2) software limitations, (3) inaccurate estimation of the BOLD response, and (4) poor modeling due to our relatively limited understanding of FMRI signal components. However, as we discuss here, such reporting damages the reliability and interpretability of the scientific findings themselves, and there is in fact no overwhelming reason for such a practice to persist. In order to promote meaningful interpretation, cross validation, reproducibility, meta and power analyses in neuroimaging, we strongly suggest that, as part of good scientific practice, effect estimates should be reported together with their corresponding statistic values. We provide several easily adaptable recommendations for facilitating this process.
Collapse
Affiliation(s)
- Gang Chen
- Scientific and Statistical Computing Core, National Institute of Mental Health, National Institutes of Health, Department of Health and Human Services, USA.
| | - Paul A Taylor
- Scientific and Statistical Computing Core, National Institute of Mental Health, National Institutes of Health, Department of Health and Human Services, USA
| | - Robert W Cox
- Scientific and Statistical Computing Core, National Institute of Mental Health, National Institutes of Health, Department of Health and Human Services, USA
| |
Collapse
|
42
|
Hummer A, Ritter M, Tik M, Ledolter AA, Woletz M, Holder GE, Dumoulin SO, Schmidt-Erfurth U, Windischberger C. Eyetracker-based gaze correction for robust mapping of population receptive fields. Neuroimage 2016; 142:211-224. [PMID: 27389789 DOI: 10.1016/j.neuroimage.2016.07.003] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2016] [Revised: 06/14/2016] [Accepted: 07/02/2016] [Indexed: 11/26/2022] Open
Abstract
Functional MRI enables the acquisition of a retinotopic map that relates regions of the visual field to neural populations in the visual cortex. During such a "population receptive field" (PRF) experiment, stable gaze fixation is of utmost importance in order to correctly link the presented stimulus patterns to stimulated retinal regions and the resulting Blood Oxygen Level Dependent (BOLD) response of the appropriate region within the visual cortex. A method is described that compensates for unstable gaze fixation by recording gaze position via an eyetracker and subsequently modifies the input stimulus underlying the PRF analysis according to the eyetracking measures. Here we show that PRF maps greatly improve when the method is applied to data acquired with either saccadic or smooth eye movements. We conclude that the technique presented herein is useful for studies involving subjects with unstable gaze fixation, particularly elderly patient populations.
Collapse
Affiliation(s)
- A Hummer
- MR Centre of Excellence, Center for Medical Physics and Biomedical Engineering, Medical University of Vienna, Währinger Gürtel 18-20, 1090 Vienna, Austria
| | - M Ritter
- Department of Ophthalmology and Optometry, Medical University of Vienna, Währinger Gürtel 18-20, 1090 Vienna, Austria
| | - M Tik
- MR Centre of Excellence, Center for Medical Physics and Biomedical Engineering, Medical University of Vienna, Währinger Gürtel 18-20, 1090 Vienna, Austria
| | - A A Ledolter
- Department of Ophthalmology and Optometry, Medical University of Vienna, Währinger Gürtel 18-20, 1090 Vienna, Austria
| | - M Woletz
- MR Centre of Excellence, Center for Medical Physics and Biomedical Engineering, Medical University of Vienna, Währinger Gürtel 18-20, 1090 Vienna, Austria
| | - G E Holder
- Institute of Ophthalmology, University College London, 11-43 Bath Street, London, EC1V 2PD, UK; Moorfields Eye Hospital, 162 City Road, London, EC1V 9EL, UK
| | - S O Dumoulin
- Experimental Psychology, Helmholtz Institute, Utrecht University, Heidelberglaan 1, 3584 CS Utrecht, The Netherlands; Spinoza Centre for Neuroimaging, Meibergdreef 75, 1105BK, Amsterdam, The Netherlands
| | - U Schmidt-Erfurth
- Department of Ophthalmology and Optometry, Medical University of Vienna, Währinger Gürtel 18-20, 1090 Vienna, Austria
| | - C Windischberger
- MR Centre of Excellence, Center for Medical Physics and Biomedical Engineering, Medical University of Vienna, Währinger Gürtel 18-20, 1090 Vienna, Austria.
| |
Collapse
|
43
|
Abstract
Naturalistic textures with an intermediate degree of statistical regularity can capture key structural features of natural images (Freeman and Simoncelli, 2011). V2 and later visual areas are sensitive to these features, while primary visual cortex is not (Freeman et al., 2013). Here we expand on this work by investigating a class of textures that have maximal formal regularity, the 17 crystallographic wallpaper groups (Fedorov, 1891). We used texture stimuli from four of the groups that differ in the maximum order of rotation symmetry they contain, and measured neural responses in human participants using functional MRI and high-density EEG. We found that cortical area V3 has a parametric representation of the rotation symmetries in the textures that is not present in either V1 or V2, the first discovery of a stimulus property that differentiates processing in V3 from that of lower-level areas. Parametric responses were also seen in higher-order ventral stream areas V4, VO1, and lateral occipital complex (LOC), but not in dorsal stream areas. The parametric response pattern was replicated in the EEG data, and source localization indicated that responses in V3 and V4 lead responses in LOC, which is consistent with a feedforward mechanism. Finally, we presented our stimuli to four well developed feedforward models and found that none of them were able to account for our results. Our results highlight structural regularity as an important stimulus dimension for distinguishing the early stages of visual processing, and suggest a previously unrecognized role for V3 in the visual form-processing hierarchy. Significance statement: Hierarchical processing is a fundamental organizing principle in visual neuroscience, with each successive processing stage being sensitive to increasingly complex stimulus properties. Here, we probe the encoding hierarchy in human visual cortex using a class of visual textures--wallpaper patterns--that are maximally regular. Through a combination of fMRI and EEG source imaging, we find specific responses to texture regularity that depend parametrically on the maximum order of rotation symmetry in the textures. These parametric responses are seen in several areas of the ventral visual processing stream, as well as in area V3, but not in V1 or V2. This is the first demonstration of a stimulus property that differentiates processing in V3 from that of lower-level visual areas.
Collapse
|
44
|
Naselaris T, Kay KN. Resolving Ambiguities of MVPA Using Explicit Models of Representation. Trends Cogn Sci 2016; 19:551-554. [PMID: 26412094 DOI: 10.1016/j.tics.2015.07.005] [Citation(s) in RCA: 75] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2015] [Revised: 07/14/2015] [Accepted: 07/20/2015] [Indexed: 11/19/2022]
Abstract
We advocate a shift in emphasis within cognitive neuroscience from multivariate pattern analysis (MVPA) to the design and testing of explicit models of neural representation. With such models, it becomes possible to identify the specific representations encoded in patterns of brain activity and to map them across the brain.
Collapse
|
45
|
Larsson J, Solomon SG, Kohn A. fMRI adaptation revisited. Cortex 2015; 80:154-60. [PMID: 26703375 DOI: 10.1016/j.cortex.2015.10.026] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Accepted: 10/29/2015] [Indexed: 10/22/2022]
Abstract
Adaptation has been widely used in functional magnetic imaging (fMRI) studies to infer neuronal response properties in human cortex. fMRI adaptation has been criticized because of the complex relationship between fMRI adaptation effects and the multiple neuronal effects that could underlie them. Many of the longstanding concerns about fMRI adaptation have received empirical support from neurophysiological studies over the last decade. We review these studies here, and also consider neuroimaging studies that have investigated how fMRI adaptation effects are influenced by high-level perceptual processes. The results of these studies further emphasize the need to interpret fMRI adaptation results with caution, but they also provide helpful guidance for more accurate interpretation and better experimental design. In addition, we argue that rather than being used as a proxy for measurements of neuronal stimulus selectivity, fMRI adaptation may be most useful for studying population-level adaptation effects across cortical processing hierarchies.
Collapse
Affiliation(s)
- Jonas Larsson
- Department of Psychology, Royal Holloway, University of London, Egham, UK.
| | - Samuel G Solomon
- Department of Experimental Psychology, University College London, London, UK.
| | - Adam Kohn
- Dominick Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA; Department of Ophthalmology and Visual Sciences, Albert Einstein College of Medicine, Bronx, NY, USA.
| |
Collapse
|
46
|
Vanni S, Sharifian F, Heikkinen H, Vigário R. Modeling fMRI signals can provide insights into neural processing in the cerebral cortex. J Neurophysiol 2015; 114:768-80. [PMID: 25972586 DOI: 10.1152/jn.00332.2014] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2014] [Accepted: 05/04/2015] [Indexed: 12/16/2022] Open
Abstract
Every stimulus or task activates multiple areas in the mammalian cortex. These distributed activations can be measured with functional magnetic resonance imaging (fMRI), which has the best spatial resolution among the noninvasive brain imaging methods. Unfortunately, the relationship between the fMRI activations and distributed cortical processing has remained unclear, both because the coupling between neural and fMRI activations has remained poorly understood and because fMRI voxels are too large to directly sense the local neural events. To get an idea of the local processing given the macroscopic data, we need models to simulate the neural activity and to provide output that can be compared with fMRI data. Such models can describe neural mechanisms as mathematical functions between input and output in a specific system, with little correspondence to physiological mechanisms. Alternatively, models can be biomimetic, including biological details with straightforward correspondence to experimental data. After careful balancing between complexity, computational efficiency, and realism, a biomimetic simulation should be able to provide insight into how biological structures or functions contribute to actual data processing as well as to promote theory-driven neuroscience experiments. This review analyzes the requirements for validating system-level computational models with fMRI. In particular, we study mesoscopic biomimetic models, which include a limited set of details from real-life networks and enable system-level simulations of neural mass action. In addition, we discuss how recent developments in neurophysiology and biophysics may significantly advance the modelling of fMRI signals.
Collapse
Affiliation(s)
- Simo Vanni
- Clinical Neurosciences, Neurology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland;
| | - Fariba Sharifian
- Clinical Neurosciences, Neurology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland; Department of Neuroscience and Biomedical Engineering, School of Science, Aalto University, Espoo, Finland; Advanced Magnetic Imaging Centre, Aalto Neuroimaging, School of Science, Aalto University, Espoo, Finland; and
| | - Hanna Heikkinen
- Department of Neuroscience and Biomedical Engineering, School of Science, Aalto University, Espoo, Finland; Advanced Magnetic Imaging Centre, Aalto Neuroimaging, School of Science, Aalto University, Espoo, Finland; and
| | - Ricardo Vigário
- Department Computer Science, School of Science, Aalto University, Espoo, Finland
| |
Collapse
|
47
|
Henriksson L, Khaligh-Razavi SM, Kay K, Kriegeskorte N. Visual representations are dominated by intrinsic fluctuations correlated between areas. Neuroimage 2015; 114:275-86. [PMID: 25896934 PMCID: PMC4503804 DOI: 10.1016/j.neuroimage.2015.04.026] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Revised: 02/27/2015] [Accepted: 04/08/2015] [Indexed: 12/05/2022] Open
Abstract
Intrinsic cortical dynamics are thought to underlie trial-to-trial variability of visually evoked responses in animal models. Understanding their function in the context of sensory processing and representation is a major current challenge. Here we report that intrinsic cortical dynamics strongly affect the representational geometry of a brain region, as reflected in response-pattern dissimilarities, and exaggerate the similarity of representations between brain regions. We characterized the representations in several human visual areas by representational dissimilarity matrices (RDMs) constructed from fMRI response-patterns for natural image stimuli. The RDMs of different visual areas were highly similar when the response-patterns were estimated on the basis of the same trials (sharing intrinsic cortical dynamics), and quite distinct when patterns were estimated on the basis of separate trials (sharing only the stimulus-driven component). We show that the greater similarity of the representational geometries can be explained by coherent fluctuations of regional-mean activation within visual cortex, reflecting intrinsic dynamics. Using separate trials to study stimulus-driven representations revealed clearer distinctions between the representational geometries: a Gabor wavelet pyramid model explained representational geometry in visual areas V1–3 and a categorical animate–inanimate model in the object-responsive lateral occipital cortex.
Collapse
Affiliation(s)
- Linda Henriksson
- MRC Cognition and Brain Sciences Unit, Cambridge CB2 7EF, UK; Brain Research Unit, Department of Neuroscience and Biomedical Engineering, Aalto University, 02150 Espoo, Finland.
| | - Seyed-Mahdi Khaligh-Razavi
- MRC Cognition and Brain Sciences Unit, Cambridge CB2 7EF, UK; Computer Science & Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Kendrick Kay
- Department of Psychology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | |
Collapse
|
48
|
Wandell BA, Winawer J. Computational neuroimaging and population receptive fields. Trends Cogn Sci 2015; 19:349-57. [PMID: 25850730 DOI: 10.1016/j.tics.2015.03.009] [Citation(s) in RCA: 126] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2015] [Revised: 03/09/2015] [Accepted: 03/16/2015] [Indexed: 10/23/2022]
Abstract
Functional magnetic resonance imaging (fMRI) noninvasively measures human brain activity at millimeter resolution. Scientists use different approaches to take advantage of the remarkable opportunities presented by fMRI. Here, we describe progress using the computational neuroimaging approach in human visual cortex, which aims to build models that predict the neural responses from the stimulus and task. We focus on a particularly active area of research, the use of population receptive field (pRF) models to characterize human visual cortex responses to a range of stimuli, in a variety of tasks and different subject populations.
Collapse
Affiliation(s)
- Brian A Wandell
- Psychology Department and Neurosciences Institute, Stanford University, Stanford, CA, USA.
| | - Jonathan Winawer
- Psychology Department and Center for Neural Science, New York University, New York, NY, USA.
| |
Collapse
|
49
|
Abstract
The ventral surface of the human occipital lobe contains multiple retinotopic maps. The most posterior of these maps is considered a potential homolog of macaque V4, and referred to as human V4 ("hV4"). The location of the hV4 map, its retinotopic organization, its role in visual encoding, and the cortical areas it borders have been the subject of considerable investigation and debate over the last 25 years. We review the history of this map and adjacent maps in ventral occipital cortex, and consider the different hypotheses for how these ventral occipital maps are organized. Advances in neuroimaging, computational modeling, and characterization of the nearby anatomical landmarks and functional brain areas have improved our understanding of where human V4 is and what kind of visual representations it contains.
Collapse
Affiliation(s)
- Jonathan Winawer
- Department of Psychology and Center for Neural Science,New York University,New York,New York 10003
| | - Nathan Witthoft
- Department of Psychology,Stanford University,Stanford,California 94305
| |
Collapse
|
50
|
Horiguchi H, Wandell BA, Winawer J. A Predominantly Visual Subdivision of The Right Temporo-Parietal Junction (vTPJ). Cereb Cortex 2014; 26:639-646. [PMID: 25267856 DOI: 10.1093/cercor/bhu226] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A multiplicity of sensory and cognitive functions has been attributed to the large cortical region at the temporo-parietal junction (TPJ). Using functional MRI, we report that a small region lateralized within the right TPJ responds robustly to certain simple visual stimuli ("vTPJ"). The vTPJ was found in all right hemispheres (n = 7), posterior to the auditory cortex. To manipulate stimuli and attention, subjects were presented with a mixture of visual and auditory stimuli in a concurrent block design in 2 experiments: (1) A simple visual stimulus (a grating pattern modulating in mean luminance) elicited robust responses in the vTPJ, whether or not the subject attended to vision and(2) a drifting low-contrast dartboard pattern of constant mean luminance evoked robust responses in the vTPJ when it was task-relevant (visual task), and smaller responses when it was not (auditory task). The results suggest a focal, visually responsive region within the right TPJ that is powerfully driven by certain visual stimuli (luminance fluctuations), and that can be driven by other visual stimuli when the subject is attending. The precise localization of this visually responsive region is helpful in segmenting the TPJ and to better understand its role in visual awareness and related disorders such as extinction and neglect.
Collapse
Affiliation(s)
- Hiroshi Horiguchi
- Department of Psychology.,Department of Ophthalmology, Jikei University, School of Medicine, Minato, Tokyo, Japan
| | - Brian A Wandell
- Department of Psychology.,Center for Cognitive and Neurobiological Imaging, Stanford University, Stanford, CA, USA
| | - Jonathan Winawer
- Department of Psychology and Center for Neural Science, New York University, New York, NY, USA
| |
Collapse
|