1
|
Gamal M, Eldawlatly S. High-level visual processing in the lateral geniculate nucleus revealed using goal-driven deep learning. J Neurosci Methods 2025; 418:110429. [PMID: 40122470 DOI: 10.1016/j.jneumeth.2025.110429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2024] [Revised: 03/03/2025] [Accepted: 03/13/2025] [Indexed: 03/25/2025]
Abstract
BACKGROUND The Lateral Geniculate Nucleus (LGN) is an essential contributor to high-level visual processing despite being an early subcortical area in the visual system. Current LGN computational models focus on its basic properties, with less emphasis on its role in high-level vision. NEW METHOD We propose a high-level approach for encoding mouse LGN neural responses to natural scenes. This approach employs two deep neural networks (DNNs); namely VGG16 and ResNet50, as goal-driven models. We use these models as tools to better understand visual features encoded in the LGN. RESULTS Early layers of the DNNs represent the best LGN models. We also demonstrate that numerosity, as a high-level visual feature, is encoded, along with other visual features, in LGN neural activity. Results demonstrate that intermediate layers are better in representing numerosity compared to early layers. Early layers are better at predicting simple visual features, while intermediate layers are better at predicting more complex features. Finally, we show that an ensemble model of an early and an intermediate layer achieves high neural prediction accuracy and numerosity representation. COMPARISON WITH EXISTING METHOD(S) Our approach emphasizes the role of analyzing the inner workings of DNNs to demonstrate the representation of a high-level feature such as numerosity in the LGN, as opposed to the common belief about the simplicity of the LGN. CONCLUSIONS We demonstrate that goal-driven DNNs can be used as high-level vision models of the LGN for neural prediction and as an exploration tool to better understand the role of the LGN.
Collapse
Affiliation(s)
- Mai Gamal
- Computer Science and Engineering Department, German University in Cairo, Cairo 11835, Egypt.
| | - Seif Eldawlatly
- Computer and Systems Engineering Department, Ain Shams University, Cairo 11517, Egypt; Computer Science and Engineering Department, The American University in Cairo, Cairo 11835, Egypt.
| |
Collapse
|
2
|
An NM, Roh H, Kim S, Kim JH, Im M. Machine Learning Techniques for Simulating Human Psychophysical Testing of Low-Resolution Phosphene Face Images in Artificial Vision. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2405789. [PMID: 39985243 PMCID: PMC12005743 DOI: 10.1002/advs.202405789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 01/18/2025] [Indexed: 02/24/2025]
Abstract
To evaluate the quality of artificial visual percepts generated by emerging methodologies, researchers often rely on labor-intensive and tedious human psychophysical experiments. These experiments necessitate repeated iterations upon any major/minor modifications in the hardware/software configurations. Here, the capacity of standard machine learning (ML) models is investigated to accurately replicate quaternary match-to-sample tasks using low-resolution facial images represented by arrays of phosphenes as input stimuli. Initially, the performance of the ML models trained to approximate innate human facial recognition abilities across a dataset comprising 3600 phosphene images of human faces is analyzed. Subsequently, due to the time constraints and the potential for subject fatigue, the psychophysical test is limited to presenting only 720 low-resolution phosphene images to 36 human subjects. Notably, the superior model adeptly mirrors the behavioral trend of human subjects, offering precise predictions for 8 out of 9 phosphene quality levels on the overlapping test queries. Subsequently, human recognition performances for untested phosphene images are predicted, streamlining the process and minimizing the need for additional psychophysical tests. The findings underscore the transformative potential of ML in reshaping the research paradigm of visual prosthetics, facilitating the expedited advancement of prostheses.
Collapse
Affiliation(s)
- Na Min An
- Brain Science InstituteKorea Institute of Science and Technology (KIST)Seoul02792Republic of Korea
- Present address:
Kim Jaechul Graduate School of AIKAISTSeoul02455Republic of Korea
| | - Hyeonhee Roh
- Brain Science InstituteKorea Institute of Science and Technology (KIST)Seoul02792Republic of Korea
| | - Sein Kim
- Brain Science InstituteKorea Institute of Science and Technology (KIST)Seoul02792Republic of Korea
| | - Jae Hun Kim
- Brain Science InstituteKorea Institute of Science and Technology (KIST)Seoul02792Republic of Korea
- Sensor System Research CenterAdvanced Materials and Systems Research DivisionKISTSeoul02792Republic of Korea
| | - Maesoon Im
- Brain Science InstituteKorea Institute of Science and Technology (KIST)Seoul02792Republic of Korea
- Division of Bio‐Medical Science and TechnologyUniversity of Science and Technology (UST)Seoul02792Republic of Korea
- KHU‐KIST Department of Converging Science and TechnologyKyung Hee UniversitySeoul02447Republic of Korea
| |
Collapse
|
3
|
Lorenzi E, Kobylkov D, Vallortigara G. Is there an innate sense of number in the brain? Cereb Cortex 2025; 35:bhaf004. [PMID: 39932126 DOI: 10.1093/cercor/bhaf004] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2024] [Revised: 11/07/2024] [Accepted: 01/09/2025] [Indexed: 05/08/2025] Open
Abstract
The approximate number system or «sense of number» is a crucial, presymbolic mechanism enabling animals to estimate quantities, which is essential for survival in various contexts (eg estimating numerosities of social companions, prey, predators, and so on). Behavioral studies indicate that a sense of number is widespread across vertebrates and invertebrates. Specific brain regions such as the intraparietal sulcus and prefrontal cortex in primates, or equivalent areas in birds and fish, are involved in numerical estimation, and their activity is modulated by the ratio of quantities. Data gathered across species strongly suggest similar evolutionary pressures for number estimation pointing to a likely common origin, at least across vertebrates. On the other hand, few studies have investigated the origins of the sense of number. Recent findings, however, have shown that numerosity-selective neurons exist in newborn animals, such as domestic chicks and zebrafish, supporting the hypothesis of an innateness of the approximate number system. Control-rearing experiments on visually naïve animals further support the notion that the sense of number is innate and does not need any specific instructive experience in order to be triggered.
Collapse
Affiliation(s)
- Elena Lorenzi
- Centre for Mind/Brain Sciences, University of Trento, Piazza della Manifattura 1, Rovereto, TN 30868, Italy
| | - Dmitry Kobylkov
- Centre for Mind/Brain Sciences, University of Trento, Piazza della Manifattura 1, Rovereto, TN 30868, Italy
| | - Giorgio Vallortigara
- Centre for Mind/Brain Sciences, University of Trento, Piazza della Manifattura 1, Rovereto, TN 30868, Italy
| |
Collapse
|
4
|
Abstract
The human brain possesses neural networks and mechanisms enabling the representation of numbers, basic arithmetic operations, and mathematical reasoning. Without the ability to represent numerical quantity and perform calculations, our scientifically and technically advanced culture would not exist. However, the origins of numerical abilities are grounded in an intuitive understanding of quantity deeply rooted in biology. Nevertheless, more advanced symbolic arithmetic skills require a cultural background with formal mathematical education. In the past two decades, cognitive neuroscience has seen significant progress in understanding the workings of the calculating brain through various methods and model systems. This review begins by exploring the mental and neuronal representations of nonsymbolic numerical quantity and then progresses to symbolic representations acquired in childhood. During arithmetic operations (addition, subtraction, multiplication, and division), these representations are processed and transformed according to arithmetic rules and principles, leveraging different mental strategies and types of arithmetic knowledge that can be dissociated in the brain. Although it was once believed that number processing and calculation originated from the language faculty, it is now evident that mathematical and linguistic abilities are primarily processed independently in the brain. Understanding how the healthy brain processes numerical information is crucial for gaining insights into debilitating numerical disorders, including acquired conditions like acalculia and learning-related calculation disorders such as developmental dyscalculia.
Collapse
Affiliation(s)
- Andreas Nieder
- Animal Physiology Unit, Institute of Neurobiology, University of Tübingen, Tübingen, Germany
| |
Collapse
|
5
|
Thompson JAF, Sheahan H, Dumbalska T, Sandbrink JD, Piazza M, Summerfield C. Zero-shot counting with a dual-stream neural network model. Neuron 2024; 112:4147-4158.e5. [PMID: 39488209 DOI: 10.1016/j.neuron.2024.10.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 08/06/2024] [Accepted: 10/06/2024] [Indexed: 11/04/2024]
Abstract
To understand a visual scene, observers need to both recognize objects and encode relational structure. For example, a scene comprising three apples requires the observer to encode concepts of "apple" and "three." In the primate brain, these functions rely on dual (ventral and dorsal) processing streams. Object recognition in primates has been successfully modeled with deep neural networks, but how scene structure (including numerosity) is encoded remains poorly understood. Here, we built a deep learning model, based on the dual-stream architecture of the primate brain, which is able to count items "zero-shot"-even if the objects themselves are unfamiliar. Our dual-stream network forms spatial response fields and lognormal number codes that resemble those observed in the macaque posterior parietal cortex. The dual-stream network also makes successful predictions about human counting behavior. Our results provide evidence for an enactive theory of the role of the posterior parietal cortex in visual scene understanding.
Collapse
Affiliation(s)
- Jessica A F Thompson
- Department of Experimental Psychology, University of Oxford, Oxford OX2 6GG, UK.
| | - Hannah Sheahan
- Department of Experimental Psychology, University of Oxford, Oxford OX2 6GG, UK
| | | | - Julian D Sandbrink
- Department of Experimental Psychology, University of Oxford, Oxford OX2 6GG, UK
| | - Manuela Piazza
- University of Trento, Department of Psychology and Cognitive Science, Trento 38068, Italy
| | | |
Collapse
|
6
|
Hou K, Zorzi M, Testolin A. Estimating the distribution of numerosity and non-numerical visual magnitudes in natural scenes using computer vision. PSYCHOLOGICAL RESEARCH 2024; 89:31. [PMID: 39625570 DOI: 10.1007/s00426-024-02064-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Accepted: 11/16/2024] [Indexed: 03/04/2025]
Abstract
Humans share with many animal species the ability to perceive and approximately represent the number of objects in visual scenes. This ability improves throughout childhood, suggesting that learning and development play a key role in shaping our number sense. This hypothesis is further supported by computational investigations based on deep learning, which have shown that numerosity perception can spontaneously emerge in neural networks that learn the statistical structure of images with a varying number of items. However, neural network models are usually trained using synthetic datasets that might not faithfully reflect the statistical structure of natural environments, and there is also growing interest in using more ecological visual stimuli to investigate numerosity perception in humans. In this work, we exploit recent advances in computer vision algorithms to design and implement an original pipeline that can be used to estimate the distribution of numerosity and non-numerical magnitudes in large-scale datasets containing thousands of real images depicting objects in daily life situations. We show that in natural visual scenes the frequency of appearance of different numerosities follows a power law distribution. Moreover, we show that the correlational structure for numerosity and continuous magnitudes is stable across datasets and scene types (homogeneous vs. heterogeneous object sets). We suggest that considering such "ecological" pattern of covariance is important to understand the influence of non-numerical visual cues on numerosity judgements.
Collapse
Affiliation(s)
- Kuinan Hou
- Department of General Psychology, University of Padova, Padua, Italy
| | - Marco Zorzi
- Department of General Psychology, University of Padova, Padua, Italy
- IRCCS San Camillo Hospital, Lido, VE, Italy
| | - Alberto Testolin
- Department of General Psychology, University of Padova, Padua, Italy.
- Department of Mathematics, University of Padova, Padua, Italy.
| |
Collapse
|
7
|
Kosinski M. Evaluating large language models in theory of mind tasks. Proc Natl Acad Sci U S A 2024; 121:e2405460121. [PMID: 39471222 PMCID: PMC11551352 DOI: 10.1073/pnas.2405460121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2024] [Accepted: 09/23/2024] [Indexed: 11/01/2024] Open
Abstract
Eleven large language models (LLMs) were assessed using 40 bespoke false-belief tasks, considered a gold standard in testing theory of mind (ToM) in humans. Each task included a false-belief scenario, three closely matched true-belief control scenarios, and the reversed versions of all four. An LLM had to solve all eight scenarios to solve a single task. Older models solved no tasks; Generative Pre-trained Transformer (GPT)-3-davinci-003 (from November 2022) and ChatGPT-3.5-turbo (from March 2023) solved 20% of the tasks; ChatGPT-4 (from June 2023) solved 75% of the tasks, matching the performance of 6-y-old children observed in past studies. We explore the potential interpretation of these results, including the intriguing possibility that ToM-like ability, previously considered unique to humans, may have emerged as an unintended by-product of LLMs' improving language skills. Regardless of how we interpret these outcomes, they signify the advent of more powerful and socially skilled AI-with profound positive and negative implications.
Collapse
Affiliation(s)
- Michal Kosinski
- Graduate School of Business, Stanford University, Stanford, CA94305
| |
Collapse
|
8
|
Croteau J, Fornaciai M, Huber DE, Park J. The divisive normalization model of visual number sense: model predictions and experimental confirmation. Cereb Cortex 2024; 34:bhae418. [PMID: 39441025 DOI: 10.1093/cercor/bhae418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 09/29/2024] [Accepted: 10/02/2024] [Indexed: 10/25/2024] Open
Abstract
Our intuitive sense of number allows rapid estimation for the number of objects (numerosity) in a scene. How does the continuous nature of neural information processing create a discrete representation of number? A neurocomputational model with divisive normalization explains this process and existing data; however, a successful model should not only explain existing data but also generate novel predictions. Here, we experimentally test novel predictions of this model to evaluate its merit for explaining mechanisms of numerosity perception. We did so by consideration of the coherence illusion: the underestimation of number for arrays containing heterogeneous compared to homogeneous items. First, we established the existence of the coherence illusion for homogeneity manipulations of both area and orientation of items in an array. Second, despite the behavioral similarity, the divisive normalization model predicted that these two illusions should reflect activity in different stages of visual processing. Finally, visual evoked potentials from an electroencephalography experiment confirmed these predictions, showing that area and orientation coherence modulate brain responses at distinct latencies and topographies. These results demonstrate the utility of the divisive normalization model for explaining numerosity perception, according to which numerosity perception is a byproduct of canonical neurocomputations that exist throughout the visual pathway.
Collapse
Affiliation(s)
- Jenna Croteau
- Department of Psychological and Brain Sciences, University of Massachusetts Amherst, 135 Hicks Way, Amherst, MA 01003, United States
| | - Michele Fornaciai
- Institute for Research in Psychology (IPSY) and Institute of Neuroscience (IoNS), Université Catholique de Louvain, Place du Cardinal Mercier 10, Louvain-la-Neuve, 1348, Belgium
| | - David E Huber
- Department of Psychology and Neuroscience, University of Colorado Boulder, Muenzinger D244, 345 UCB, Boulder, CO 80309, United States
| | - Joonkoo Park
- Department of Psychological and Brain Sciences, University of Massachusetts Amherst, 135 Hicks Way, Amherst, MA 01003, United States
- Commonwealth Honors College, University of Massachusetts Amherst, 157 Commonwealth Avenue, Amherst, MA 01003, United States
| |
Collapse
|
9
|
Kumar S, Sumers TR, Yamakoshi T, Goldstein A, Hasson U, Norman KA, Griffiths TL, Hawkins RD, Nastase SA. Shared functional specialization in transformer-based language models and the human brain. Nat Commun 2024; 15:5523. [PMID: 38951520 PMCID: PMC11217339 DOI: 10.1038/s41467-024-49173-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 05/24/2024] [Indexed: 07/03/2024] Open
Abstract
When processing language, the brain is thought to deploy specialized computations to construct meaning from complex linguistic structures. Recently, artificial neural networks based on the Transformer architecture have revolutionized the field of natural language processing. Transformers integrate contextual information across words via structured circuit computations. Prior work has focused on the internal representations ("embeddings") generated by these circuits. In this paper, we instead analyze the circuit computations directly: we deconstruct these computations into the functionally-specialized "transformations" that integrate contextual information across words. Using functional MRI data acquired while participants listened to naturalistic stories, we first verify that the transformations account for considerable variance in brain activity across the cortical language network. We then demonstrate that the emergent computations performed by individual, functionally-specialized "attention heads" differentially predict brain activity in specific cortical regions. These heads fall along gradients corresponding to different layers and context lengths in a low-dimensional cortical space.
Collapse
Affiliation(s)
- Sreejan Kumar
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, 08540, USA.
| | - Theodore R Sumers
- Department of Computer Science, Princeton University, Princeton, NJ, 08540, USA.
| | - Takateru Yamakoshi
- Faculty of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, 113-0033, Japan
| | - Ariel Goldstein
- Department of Cognitive and Brain Sciences and Business School, Hebrew University, Jerusalem, 9190401, Israel
| | - Uri Hasson
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, 08540, USA
- Department of Psychology, Princeton University, Princeton, NJ, 08540, USA
| | - Kenneth A Norman
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, 08540, USA
- Department of Psychology, Princeton University, Princeton, NJ, 08540, USA
| | - Thomas L Griffiths
- Department of Computer Science, Princeton University, Princeton, NJ, 08540, USA
- Department of Psychology, Princeton University, Princeton, NJ, 08540, USA
| | - Robert D Hawkins
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, 08540, USA
- Department of Psychology, Princeton University, Princeton, NJ, 08540, USA
| | - Samuel A Nastase
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, 08540, USA.
| |
Collapse
|
10
|
Strock A, Mistry PK, Menon V. Digital twins for understanding mechanisms of learning disabilities: Personalized deep neural networks reveal impact of neuronal hyperexcitability. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.29.591409. [PMID: 38746231 PMCID: PMC11092492 DOI: 10.1101/2024.04.29.591409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Learning disabilities affect a significant proportion of children worldwide, with far-reaching consequences for their academic, professional, and personal lives. Here we develop digital twins - biologically plausible personalized Deep Neural Networks (pDNNs) - to investigate the neurophysiological mechanisms underlying learning disabilities in children. Our pDNN reproduces behavioral and neural activity patterns observed in affected children, including lower performance accuracy, slower learning rates, neural hyper-excitability, and reduced neural differentiation of numerical problems. Crucially, pDNN models reveal aberrancies in the geometry of manifold structure, providing a comprehensive view of how neural excitability influences both learning performance and the internal structure of neural representations. Our findings not only advance knowledge of the neurophysiological underpinnings of learning differences but also open avenues for targeted, personalized strategies designed to bridge cognitive gaps in affected children. This work reveals the power of digital twins integrating AI and neuroscience to uncover mechanisms underlying neurodevelopmental disorders.
Collapse
Affiliation(s)
- Anthony Strock
- Department of Psychiatry & Behavioral Sciences, Stanford University School of Medicine, Stanford, CA 94305
| | - Percy K. Mistry
- Department of Psychiatry & Behavioral Sciences, Stanford University School of Medicine, Stanford, CA 94305
| | - Vinod Menon
- Department of Psychiatry & Behavioral Sciences, Stanford University School of Medicine, Stanford, CA 94305
- Department of Neurology & Neurological Sciences, Stanford University School of Medicine, Stanford, CA 94305
- Wu Tsai Neurosciences Institute, Stanford University School of Medicine, Stanford, CA 94305
| |
Collapse
|
11
|
Kondapaneni N, Perona P. A number sense as an emergent property of the manipulating brain. Sci Rep 2024; 14:6858. [PMID: 38514690 PMCID: PMC10958013 DOI: 10.1038/s41598-024-56828-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 03/12/2024] [Indexed: 03/23/2024] Open
Abstract
The ability to understand and manipulate numbers and quantities emerges during childhood, but the mechanism through which humans acquire and develop this ability is still poorly understood. We explore this question through a model, assuming that the learner is able to pick up and place small objects from, and to, locations of its choosing, and will spontaneously engage in such undirected manipulation. We further assume that the learner's visual system will monitor the changing arrangements of objects in the scene and will learn to predict the effects of each action by comparing perception with a supervisory signal from the motor system. We model perception using standard deep networks for feature extraction and classification. Our main finding is that, from learning the task of action prediction, an unexpected image representation emerges exhibiting regularities that foreshadow the perception and representation of numbers and quantity. These include distinct categories for zero and the first few natural numbers, a strict ordering of the numbers, and a one-dimensional signal that correlates with numerical quantity. As a result, our model acquires the ability to estimate numerosity, i.e. the number of objects in the scene, as well as subitization, i.e. the ability to recognize at a glance the exact number of objects in small scenes. Remarkably, subitization and numerosity estimation extrapolate to scenes containing many objects, far beyond the three objects used during training. We conclude that important aspects of a facility with numbers and quantities may be learned with supervision from a simple pre-training task. Our observations suggest that cross-modal learning is a powerful learning mechanism that may be harnessed in artificial intelligence.
Collapse
|
12
|
Liu P, Bo K, Ding M, Fang R. Emergence of Emotion Selectivity in Deep Neural Networks Trained to Recognize Visual Objects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.04.16.537079. [PMID: 37163104 PMCID: PMC10168209 DOI: 10.1101/2023.04.16.537079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Recent neuroimaging studies have shown that the visual cortex plays an important role in representing the affective significance of visual input. The origin of these affect-specific visual representations is debated: they are intrinsic to the visual system versus they arise through reentry from frontal emotion processing structures such as the amygdala. We examined this problem by combining convolutional neural network (CNN) models of the human ventral visual cortex pre-trained on ImageNet with two datasets of affective images. Our results show that (1) in all layers of the CNN models, there were artificial neurons that responded consistently and selectively to neutral, pleasant, or unpleasant images and (2) lesioning these neurons by setting their output to 0 or enhancing these neurons by increasing their gain led to decreased or increased emotion recognition performance respectively. These results support the idea that the visual system may have the intrinsic ability to represent the affective significance of visual input and suggest that CNNs offer a fruitful platform for testing neuroscientific theories.
Collapse
Affiliation(s)
- Peng Liu
- J. Crayton Pruitt Family Department of Biomedical Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, FL, USA
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
| | - Ke Bo
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
| | - Mingzhou Ding
- J. Crayton Pruitt Family Department of Biomedical Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, FL, USA
| | - Ruogu Fang
- J. Crayton Pruitt Family Department of Biomedical Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, FL, USA
- Center for Cognitive Aging and Memory, McKnight Brain Institute, University of Florida, Gainesville, FL, USA
| |
Collapse
|
13
|
Liu P, Bo K, Ding M, Fang R. Emergence of Emotion Selectivity in Deep Neural Networks Trained to Recognize Visual Objects. PLoS Comput Biol 2024; 20:e1011943. [PMID: 38547053 PMCID: PMC10977720 DOI: 10.1371/journal.pcbi.1011943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 02/24/2024] [Indexed: 04/02/2024] Open
Abstract
Recent neuroimaging studies have shown that the visual cortex plays an important role in representing the affective significance of visual input. The origin of these affect-specific visual representations is debated: they are intrinsic to the visual system versus they arise through reentry from frontal emotion processing structures such as the amygdala. We examined this problem by combining convolutional neural network (CNN) models of the human ventral visual cortex pre-trained on ImageNet with two datasets of affective images. Our results show that in all layers of the CNN models, there were artificial neurons that responded consistently and selectively to neutral, pleasant, or unpleasant images and lesioning these neurons by setting their output to zero or enhancing these neurons by increasing their gain led to decreased or increased emotion recognition performance respectively. These results support the idea that the visual system may have the intrinsic ability to represent the affective significance of visual input and suggest that CNNs offer a fruitful platform for testing neuroscientific theories.
Collapse
Affiliation(s)
- Peng Liu
- J. Crayton Pruitt Family Department of Biomedical Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, Florida, United States of America
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, New Hampshire, United States of America
| | - Ke Bo
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, New Hampshire, United States of America
| | - Mingzhou Ding
- J. Crayton Pruitt Family Department of Biomedical Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, Florida, United States of America
| | - Ruogu Fang
- J. Crayton Pruitt Family Department of Biomedical Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, Florida, United States of America
- Center for Cognitive Aging and Memory, McKnight Brain Institute, University of Florida, Gainesville, Florida, United States of America
| |
Collapse
|
14
|
Liang T, Peng RC, Rong KL, Li JX, Ke Y, Yung WH. Disparate processing of numerosity and associated continuous magnitudes in rats. SCIENCE ADVANCES 2024; 10:eadj2566. [PMID: 38381814 PMCID: PMC10881051 DOI: 10.1126/sciadv.adj2566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 01/18/2024] [Indexed: 02/23/2024]
Abstract
The studies of number sense in different species are severely hampered by the inevitable entanglement of non-numerical attributes inherent in nonsymbolic stimuli representing numerosity, resulting in contrasting theories of numerosity processing. Here, we developed an algorithm and associated analytical methods to generate stimuli that not only minimized the impact of non-numerical magnitudes in numerosity perception but also allowed their quantification. We trained number-naïve rats with these stimuli as sound pulses representing two or three numbers and demonstrated that their numerical discrimination ability mainly relied on numerosity. Also, studying the learning process revealed that rats used numerosity before using magnitudes for choices. This numerical processing could be impaired specifically by silencing the posterior parietal cortex. Furthermore, modeling this capacity by neural networks shed light on the separation of numerosity and magnitudes extraction. Our study helps dissect the relationship between magnitude and numerosity processing, and the above different findings together affirm the independent existence of innate number and magnitudes sense in rats.
Collapse
Affiliation(s)
- Tuo Liang
- School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Rong-Chao Peng
- School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
- School of Biomedical Engineering, Guangdong Medical University, Dongguan, Guangdong, China
| | - Kang-Lin Rong
- School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Jia-Xin Li
- School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Ya Ke
- School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
| | - Wing-Ho Yung
- School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China
- Department of Neuroscience, City University of Hong Kong, Hong Kong, China
| |
Collapse
|
15
|
Noda K, Soda T, Yamashita Y. Emergence of number sense through the integration of multimodal information: developmental learning insights from neural network models. Front Neurosci 2024; 18:1330512. [PMID: 38298912 PMCID: PMC10828047 DOI: 10.3389/fnins.2024.1330512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Accepted: 01/02/2024] [Indexed: 02/02/2024] Open
Abstract
Introduction Associating multimodal information is essential for human cognitive abilities including mathematical skills. Multimodal learning has also attracted attention in the field of machine learning, and it has been suggested that the acquisition of better latent representation plays an important role in enhancing task performance. This study aimed to explore the impact of multimodal learning on representation, and to understand the relationship between multimodal representation and the development of mathematical skills. Methods We employed a multimodal deep neural network as the computational model for multimodal associations in the brain. We compared the representations of numerical information, that is, handwritten digits and images containing a variable number of geometric figures learned through single- and multimodal methods. Next, we evaluated whether these representations were beneficial for downstream arithmetic tasks. Results Multimodal training produced better latent representation in terms of clustering quality, which is consistent with previous findings on multimodal learning in deep neural networks. Moreover, the representations learned using multimodal information exhibited superior performance in arithmetic tasks. Discussion Our novel findings experimentally demonstrate that changes in acquired latent representations through multimodal association learning are directly related to cognitive functions, including mathematical skills. This supports the possibility that multimodal learning using deep neural network models may offer novel insights into higher cognitive functions.
Collapse
Affiliation(s)
| | | | - Yuichi Yamashita
- Department of Information Medicine, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Kodaira, Japan
| |
Collapse
|
16
|
Kim G, Kim DK, Jeong H. Spontaneous emergence of rudimentary music detectors in deep neural networks. Nat Commun 2024; 15:148. [PMID: 38168097 PMCID: PMC10761941 DOI: 10.1038/s41467-023-44516-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 12/15/2023] [Indexed: 01/05/2024] Open
Abstract
Music exists in almost every society, has universal acoustic features, and is processed by distinct neural circuits in humans even with no experience of musical training. However, it remains unclear how these innate characteristics emerge and what functions they serve. Here, using an artificial deep neural network that models the auditory information processing of the brain, we show that units tuned to music can spontaneously emerge by learning natural sound detection, even without learning music. The music-selective units encoded the temporal structure of music in multiple timescales, following the population-level response characteristics observed in the brain. We found that the process of generalization is critical for the emergence of music-selectivity and that music-selectivity can work as a functional basis for the generalization of natural sound, thereby elucidating its origin. These findings suggest that evolutionary adaptation to process natural sounds can provide an initial blueprint for our sense of music.
Collapse
Affiliation(s)
- Gwangsu Kim
- Department of Physics, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Korea
| | - Dong-Kyum Kim
- Department of Physics, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Korea
| | - Hawoong Jeong
- Department of Physics, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Korea.
- Center for Complex Systems, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Korea.
| |
Collapse
|
17
|
Bonn CD, Odic D. Effects of spatial frequency cross-adaptation on the visual number sense. Atten Percept Psychophys 2024; 86:248-262. [PMID: 37872436 DOI: 10.3758/s13414-023-02798-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/18/2023] [Indexed: 10/25/2023]
Abstract
When observing a simple visual scene such as an array of dots, observers can easily and automatically extract their number. How does our visual system accomplish this? We investigate the role of specific spatial frequencies to the encoding of number through cross-adaptation. In two experiments, observers were peripherally adapted to six randomly generated sinusoidal gratings varying from relatively low-spatial frequency (M = 0.44 c/deg) to relatively high-spatial frequency (M = 5.88 c/deg). Subsequently, observers judged which side of the screen had a higher number of dots. We found a strong number-adaptation effect to low-spatial frequency gratings (i.e., participants significantly underestimated the number of dots on the adapted side) but a significantly reduced adaptation effect for high-spatial frequency gratings. Various control conditions demonstrate that these effects are not due to a generic response bias for the adapted side, nor moderated by dot size or spacing effects. In a third experiment, we observed no cross-adaptation for centrally presented gratings. Our results show that observers' peripheral number perception can be adapted even with stimuli lacking any numeric or segmented object information and that low spatial frequencies adapt peripheral number perception more than high ones. Together, our results are consistent with recent number perception models that suggest a key role for spatial frequency in the extraction of number from the visual signal (e.g., Paul, Ackooij, Ten Cate, & Harvey, 2022), but additionally suggest that some spatial frequencies - especially in the low range and in the periphery - may be weighted more by the visual system when estimating number. We argue that the cross-adaptation paradigm is also a useful methodology for discovering the primitives of visual number encoding.
Collapse
Affiliation(s)
- Cory D Bonn
- Strong Analytics, Department of Psychology, University of British Columbia, 330 N. Wabash, Chicago, IL, USA
- Centre for Cognitive Development, Department of Psychology, University of British Columbia, 2136 West Mall, Vancouver, BC, V6T 1Z4, Canada
| | - Darko Odic
- Centre for Cognitive Development, Department of Psychology, University of British Columbia, 2136 West Mall, Vancouver, BC, V6T 1Z4, Canada.
| |
Collapse
|
18
|
Ohki T, Kunii N, Chao ZC. Efficient, continual, and generalized learning in the brain - neural mechanism of Mental Schema 2.0. Rev Neurosci 2023; 34:839-868. [PMID: 36960579 DOI: 10.1515/revneuro-2022-0137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 02/26/2023] [Indexed: 03/25/2023]
Abstract
There has been tremendous progress in artificial neural networks (ANNs) over the past decade; however, the gap between ANNs and the biological brain as a learning device remains large. With the goal of closing this gap, this paper reviews learning mechanisms in the brain by focusing on three important issues in ANN research: efficiency, continuity, and generalization. We first discuss the method by which the brain utilizes a variety of self-organizing mechanisms to maximize learning efficiency, with a focus on the role of spontaneous activity of the brain in shaping synaptic connections to facilitate spatiotemporal learning and numerical processing. Then, we examined the neuronal mechanisms that enable lifelong continual learning, with a focus on memory replay during sleep and its implementation in brain-inspired ANNs. Finally, we explored the method by which the brain generalizes learned knowledge in new situations, particularly from the mathematical generalization perspective of topology. Besides a systematic comparison in learning mechanisms between the brain and ANNs, we propose "Mental Schema 2.0," a new computational property underlying the brain's unique learning ability that can be implemented in ANNs.
Collapse
Affiliation(s)
- Takefumi Ohki
- International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo Institutes for Advanced Study, The University of Tokyo, Tokyo 113-0033, Japan
| | - Naoto Kunii
- Department of Neurosurgery, The University of Tokyo, Tokyo 113-0033, Japan
| | - Zenas C Chao
- International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo Institutes for Advanced Study, The University of Tokyo, Tokyo 113-0033, Japan
| |
Collapse
|
19
|
Kirschhock ME, Nieder A. Numerical Representation for Action in Crows Obeys the Weber-Fechner Law. Psychol Sci 2023; 34:1322-1335. [PMID: 37883792 DOI: 10.1177/09567976231201624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2023] Open
Abstract
The psychophysical laws governing the judgment of perceived numbers of objects or events, called the number sense, have been studied in detail. However, the behavioral principles of equally important numerical representations for action are largely unexplored in both humans and animals. We trained two male carrion crows (Corvus corone) to judge numerical values of instruction stimuli from one to five and to flexibly perform a matching number of pecks. Our quantitative analysis of the crows' number production performance shows the same behavioral regularities that have previously been demonstrated for the judgment of sensory numerosity, such as the numerical distance effect, the numerical magnitude effect, and the logarithmical compression of the number line. The presence of these psychophysical phenomena in crows producing number of pecks suggests a unified sensorimotor number representation system underlying the judgment of the number of external stimuli and internally generated actions.
Collapse
Affiliation(s)
| | - Andreas Nieder
- Animal Physiology Unit, Institute of Neurobiology, University of Tübingen
| |
Collapse
|
20
|
Verma BK, Sengupta R. Emergence of behavioral phenomena and adaptation effects in human numerosity decoder using recurrent neural networks. Sci Rep 2023; 13:19571. [PMID: 37949909 PMCID: PMC10638322 DOI: 10.1038/s41598-023-44535-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 10/10/2023] [Indexed: 11/12/2023] Open
Abstract
Humans possess an innate ability to visually perceive numerosities, which refers to the cardinality of a set. Numerous studies indicate that the lateral intraparietal cortex (LIP) and other intraparietal sulcus (IPS) regions (region) of the brain contain the neurological substrates responsible for number processing. Existing computational models of number perception often focus on a limited range of numbers and fail to account for important behavioral characteristics like adaptation effects, despite simulating fundamental aspects such as size and distance effects. To address these limitations, our study develops (introduces) a novel computational model of number perception utilizing a network of neurons with self-excitatory and mutual inhibitory properties. Our approach assumes that the mean activation of the network at steady state can encode numerosity by exhibiting a monotonically increasing relationship with the input variable set size. By optimizing the total number of inhibition strengths required, we achieve coverage of the full range of numbers through three distinct intervals: 1 to 4, 5 to 17, and 21 to 50. Remarkably, this division aligns closely with the breakpoints in numerosity perception identified in behavioral studies. Furthermore, our study develops a method for decoding the mean activation into a continuous scale of numbers spanning from 1 to 50. Additionally, we propose a mechanism for dynamically selecting the inhibition strength based on current inputs, enabling the network to operate effectively across an extended (entire) range of numerosities. Our model not only sheds new light on the generation of diverse behavioral phenomena in the brain but also elucidates how continuous visual attributes and adaptation effects influence perceived numerosity.
Collapse
Affiliation(s)
- Bhavesh K Verma
- Indian Institute of Science Education and Research, Pune, 411008, India
| | - Rakesh Sengupta
- Center for Creative Cognition. SR University, Warangal, 506371, India.
| |
Collapse
|
21
|
Zhao H, Zhang Y, Han L, Qian W, Wang J, Wu H, Li J, Dai Y, Zhang Z, Bowen CR, Yang Y. Intelligent Recognition Using Ultralight Multifunctional Nano-Layered Carbon Aerogel Sensors with Human-Like Tactile Perception. NANO-MICRO LETTERS 2023; 16:11. [PMID: 37943399 PMCID: PMC10635924 DOI: 10.1007/s40820-023-01216-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 09/11/2023] [Indexed: 11/10/2023]
Abstract
Humans can perceive our complex world through multi-sensory fusion. Under limited visual conditions, people can sense a variety of tactile signals to identify objects accurately and rapidly. However, replicating this unique capability in robots remains a significant challenge. Here, we present a new form of ultralight multifunctional tactile nano-layered carbon aerogel sensor that provides pressure, temperature, material recognition and 3D location capabilities, which is combined with multimodal supervised learning algorithms for object recognition. The sensor exhibits human-like pressure (0.04-100 kPa) and temperature (21.5-66.2 °C) detection, millisecond response times (11 ms), a pressure sensitivity of 92.22 kPa-1 and triboelectric durability of over 6000 cycles. The devised algorithm has universality and can accommodate a range of application scenarios. The tactile system can identify common foods in a kitchen scene with 94.63% accuracy and explore the topographic and geomorphic features of a Mars scene with 100% accuracy. This sensing approach empowers robots with versatile tactile perception to advance future society toward heightened sensing, recognition and intelligence.
Collapse
Affiliation(s)
- Huiqi Zhao
- CAS Center for Excellence in Nanoscience, Beijing Key Laboratory of Micro-Nano Energy and Sensor, Beijing Institute of Nanoenergy and Nanosystems, Chinese Academy of Sciences, Beijing, 101400, People's Republic of China
- School of Nanoscience and Technology, University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China
| | - Yizheng Zhang
- Tencent Robotics X, Shenzhen, 518054, People's Republic of China
| | - Lei Han
- Tencent Robotics X, Shenzhen, 518054, People's Republic of China
| | - Weiqi Qian
- CAS Center for Excellence in Nanoscience, Beijing Key Laboratory of Micro-Nano Energy and Sensor, Beijing Institute of Nanoenergy and Nanosystems, Chinese Academy of Sciences, Beijing, 101400, People's Republic of China
- School of Nanoscience and Technology, University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China
| | - Jiabin Wang
- CAS Center for Excellence in Nanoscience, Beijing Key Laboratory of Micro-Nano Energy and Sensor, Beijing Institute of Nanoenergy and Nanosystems, Chinese Academy of Sciences, Beijing, 101400, People's Republic of China
- Center on Nanoenergy Research, School of Physical Science and Technology, Guangxi University, Nanning, 530004, People's Republic of China
| | - Heting Wu
- CAS Center for Excellence in Nanoscience, Beijing Key Laboratory of Micro-Nano Energy and Sensor, Beijing Institute of Nanoenergy and Nanosystems, Chinese Academy of Sciences, Beijing, 101400, People's Republic of China
| | - Jingchen Li
- Tencent Robotics X, Shenzhen, 518054, People's Republic of China
| | - Yuan Dai
- Tencent Robotics X, Shenzhen, 518054, People's Republic of China.
| | - Zhengyou Zhang
- Tencent Robotics X, Shenzhen, 518054, People's Republic of China
| | - Chris R Bowen
- Department of Mechanical Engineering, University of Bath, Bath, BA2 7AK, UK
| | - Ya Yang
- CAS Center for Excellence in Nanoscience, Beijing Key Laboratory of Micro-Nano Energy and Sensor, Beijing Institute of Nanoenergy and Nanosystems, Chinese Academy of Sciences, Beijing, 101400, People's Republic of China.
- School of Nanoscience and Technology, University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China.
- Center on Nanoenergy Research, School of Physical Science and Technology, Guangxi University, Nanning, 530004, People's Republic of China.
| |
Collapse
|
22
|
Hancock PA. Are humans still necessary? ERGONOMICS 2023; 66:1711-1718. [PMID: 37530394 DOI: 10.1080/00140139.2023.2236822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 07/05/2023] [Indexed: 08/03/2023]
Abstract
Our long accepted and historically-persistent human narrative almost exclusively places us at the motivational centre of events. The wellspring of this anthropocentric fable arises from the unitary and bounded nature of personal consciousness. Such immediate conscious experience frames the heroic vision we have told to, and subsequently sold to ourselves. But need this centrality necessarily be a given? The following work challenges this, oft unquestioned, foundational assumption, especially in light of developments in automated, autonomous, and artificially-intelligent systems. For, in these latter technologies, human contributions are becoming ever more peripheral and arguably unnecessary. The removal of the human operator from the inner loops of momentary control has progressed to now an ever more remote function as some form of supervisory monitor. The natural progression of that line of evolution is the eventual excision of humans from access to any form of control loop at all. This may even include system maintenance and then, prospectively, even initial design. The present argument features a 'unit of analysis' provocation which explores the proposition that socially, and even ergonomically, the human individual no longer occupies priority or any degree of pre-eminent centrality. Rather, we are witnessing a transitional phase of development in which socio-technical collectives are evolving as the principle sources of what, may well be profoundly unhuman motivation. These developing proclivities occupy our landscape of technological innovations that daily act to magnify, rather than diminish, such progressive inhumanities. Where this leaves a science focused on work as a human-centred enterprise serves to occupy the culminating consideration of the present discourse.
Collapse
Affiliation(s)
- P A Hancock
- Department of Psychology, and Institute for Simulation and Training, University of Central Florida, Orlando, Florida, USA
| |
Collapse
|
23
|
Kobylkov D, Zanon M, Perrino M, Vallortigara G. Neural coding of numerousness. Biosystems 2023; 232:104999. [PMID: 37574182 DOI: 10.1016/j.biosystems.2023.104999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 08/08/2023] [Accepted: 08/10/2023] [Indexed: 08/15/2023]
Abstract
Perception of numerousness, i.e. number of items in a set, is an important cognitive ability, which is present in several animal taxa. In spite of obvious differences in neuroanatomy, insects, fishes, reptiles, birds, and mammals all possess a "number sense". Furthermore, information regarding numbers can belong to different sensory modalities: animals can estimate a number of visual items, a number of tones, or a number of their own movements. Given both the heterogeneity of stimuli and of the brains processing these stimuli, it is hard to imagine that number cognition can be traced back to the same evolutionary conserved neural pathway. However, neurons that selectively respond to the number of stimuli have been described in higher-order integration brain centres both in primates and in birds, two evolutionary distant groups. Although most probably not of the same evolutionary origin, these number neurons share remarkable similarities in their response properties. Instead of homology, this similarity might result from computational advantages of the underlying coding mechanism. This means that one might expect numerousness information to undergo similar steps of neural processing even in evolutionary distant neural pathways. Following this logic, in this review we summarize our current knowledge of how numerousness is processed in the brain from sensory input to coding of abstract information in the higher-order integration centres. We also propose a list of key open questions that might promote future research on number cognition.
Collapse
Affiliation(s)
- Dmitry Kobylkov
- Centre for Mind/Brain Science, CIMeC, University of Trento, Rovereto, Italy
| | - Mirko Zanon
- Centre for Mind/Brain Science, CIMeC, University of Trento, Rovereto, Italy
| | - Matilde Perrino
- Centre for Mind/Brain Science, CIMeC, University of Trento, Rovereto, Italy
| | | |
Collapse
|
24
|
Chakravarthi R, Nordqvist A, Poncet M, Adamian N. Fundamental units of numerosity estimation. Cognition 2023; 239:105565. [PMID: 37487302 DOI: 10.1016/j.cognition.2023.105565] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2022] [Revised: 05/22/2023] [Accepted: 07/13/2023] [Indexed: 07/26/2023]
Abstract
Humans can approximately enumerate a large number of objects at a single glance. While several mechanisms have been proposed to account for this ability, the fundamental units over which they operate remain unclear. Previous studies have argued that estimation mechanisms act only on topologically distinct units or on units formed by spatial grouping cues such as proximity and connectivity, but not on units grouped by similarity. Over four experiments, we tested this claim by systematically assessing and demonstrating that similarity grouping leads to underestimation, just as spatial grouping does. Ungrouped objects with the same low-level properties as grouped objects did not cause underestimation. Further, the underestimation caused by spatial and similarity grouping was additive, suggesting that these grouping processes operate independently. These findings argue against the proposal that estimation mechanisms operate solely on topological units. Instead, we conclude that estimation processes act on representations constructed after Gestalt grouping principles, whether similarity based or spatial, have organised incoming visual input.
Collapse
Affiliation(s)
| | - Andy Nordqvist
- School of Psychology, University of Aberdeen, Aberdeen, United Kingdom.
| | - Marlene Poncet
- School of Psychology & Neuroscience, University of St Andrews, St Andrews, United Kingdom.
| | - Nika Adamian
- School of Psychology, University of Aberdeen, Aberdeen, United Kingdom.
| |
Collapse
|
25
|
Lee H, Choi W, Lee D, Paik SB. Comparison of visual quantities in untrained neural networks. Cell Rep 2023; 42:112900. [PMID: 37516959 DOI: 10.1016/j.celrep.2023.112900] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 05/25/2023] [Accepted: 07/13/2023] [Indexed: 08/01/2023] Open
Abstract
The ability to compare quantities of visual objects with two distinct measures, proportion and difference, is observed even in newborn animals. However, how this function originates in the brain, even before visual experience, remains unknown. Here, we propose a model in which neuronal tuning for quantity comparisons can arise spontaneously in completely untrained neural circuits. Using a biologically inspired model neural network, we find that single units selective to proportions and differences between visual quantities emerge in randomly initialized feedforward wirings and that they enable the network to perform quantity comparison tasks. Notably, we find that two distinct tunings to proportion and difference originate from a random summation of monotonic, nonlinear neural activities and that a slight difference in the nonlinear response function determines the type of measure. Our results suggest that visual quantity comparisons are primitive types of functions that can emerge spontaneously before learning in young brains.
Collapse
Affiliation(s)
- Hyeonsu Lee
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Woochul Choi
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Dongil Lee
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Se-Bum Paik
- Department of Brain and Cognitive Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea.
| |
Collapse
|
26
|
Wencheng W, Ge Y, Zuo Z, Chen L, Qin X, Zuxiang L. Visual number sense for real-world scenes shared by deep neural networks and humans. Heliyon 2023; 9:e18517. [PMID: 37560656 PMCID: PMC10407052 DOI: 10.1016/j.heliyon.2023.e18517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 07/17/2023] [Accepted: 07/19/2023] [Indexed: 08/11/2023] Open
Abstract
Recently, visual number sense has been identified from deep neural networks (DNNs). However, whether DNNs have the same capacity for real-world scenes, rather than the simple geometric figures that are often tested, is unclear. In this study, we explore the number perception of scenes using AlexNet and find that numerosity can be represented by the pattern of group activation of the category layer units. The global activation of these units increases with the number of objects in the scene, and the variations in their activation decrease accordingly. By decoding the numerosity from this pattern, we reveal that the embedding coefficient of a scene determines the likelihood of potential objects to contribute to numerical perception. This was demonstrated by the more optimized performance for pictures with relatively high embedding coefficients in both DNNs and humans. This study for the first time shows that a distinct feature in visual environments, revealed by DNNs, can modulate human perception, supported by a group-coding mechanism.
Collapse
Affiliation(s)
- Wu Wencheng
- AHU-IAI AI Joint Laboratory, Anhui University, Hefei, 230601, China
- Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230088, China
| | - Yingxi Ge
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Beijing, 100101, China
- CAS Center for Excellence in Brain Science and Intelligence Technology, China
- University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing, 100049, China
| | - Zhentao Zuo
- Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230088, China
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Beijing, 100101, China
- CAS Center for Excellence in Brain Science and Intelligence Technology, China
- University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing, 100049, China
| | - Lin Chen
- Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230088, China
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Beijing, 100101, China
- CAS Center for Excellence in Brain Science and Intelligence Technology, China
- University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing, 100049, China
| | - Xu Qin
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Hefei, 230601, China
- Anhui Provincial Key Laboratory of Multimodal Cognitive Computation, Anhui University, Hefei, 230601, China
- School of Computer Science and Technology, Anhui University, Hefei 230601, China
| | - Liu Zuxiang
- Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230088, China
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, 15 Datun Road, Beijing, 100101, China
- CAS Center for Excellence in Brain Science and Intelligence Technology, China
- University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing, 100049, China
| |
Collapse
|
27
|
Mistry PK, Strock A, Liu R, Young G, Menon V. Learning-induced reorganization of number neurons and emergence of numerical representations in a biologically inspired neural network. Nat Commun 2023; 14:3843. [PMID: 37386013 PMCID: PMC10310708 DOI: 10.1038/s41467-023-39548-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 06/16/2023] [Indexed: 07/01/2023] Open
Abstract
Number sense, the ability to decipher quantity, forms the foundation for mathematical cognition. How number sense emerges with learning is, however, not known. Here we use a biologically-inspired neural architecture comprising cortical layers V1, V2, V3, and intraparietal sulcus (IPS) to investigate how neural representations change with numerosity training. Learning dramatically reorganized neuronal tuning properties at both the single unit and population levels, resulting in the emergence of sharply-tuned representations of numerosity in the IPS layer. Ablation analysis revealed that spontaneous number neurons observed prior to learning were not critical to formation of number representations post-learning. Crucially, multidimensional scaling of population responses revealed the emergence of absolute and relative magnitude representations of quantity, including mid-point anchoring. These learnt representations may underlie changes from logarithmic to cyclic and linear mental number lines that are characteristic of number sense development in humans. Our findings elucidate mechanisms by which learning builds novel representations supporting number sense.
Collapse
Affiliation(s)
- Percy K Mistry
- Department of Psychiatry & Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, 94304, USA.
| | - Anthony Strock
- Department of Psychiatry & Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, 94304, USA
| | - Ruizhe Liu
- Department of Psychiatry & Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, 94304, USA
| | - Griffin Young
- Department of Psychiatry & Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, 94304, USA
| | - Vinod Menon
- Department of Psychiatry & Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, 94304, USA.
- Department of Neurology & Neurological Sciences, Stanford University School of Medicine, Stanford, CA, 94304, USA.
- Wu Tsai Stanford Neuroscience Institute, Stanford University School of Medicine, Stanford, CA, 94304, USA.
- Graduate School of Education, Stanford University, Stanford, CA, 94304, USA.
- Stanford Institute for Human-Centered AI, Stanford University, Stanford, CA, 94304, USA.
| |
Collapse
|
28
|
Wagener L, Nieder A. Categorical representation of abstract spatial magnitudes in the executive telencephalon of crows. Curr Biol 2023; 33:2151-2162.e5. [PMID: 37137309 DOI: 10.1016/j.cub.2023.04.013] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 04/03/2023] [Accepted: 04/07/2023] [Indexed: 05/05/2023]
Abstract
The ability to group abstract continuous magnitudes into meaningful categories is cognitively demanding but key to intelligent behavior. To explore its neuronal mechanisms, we trained carrion crows to categorize lines of variable lengths into arbitrary "short" and "long" categories. Single-neuron activity in the nidopallium caudolaterale (NCL) of behaving crows reflected the learned length categories of visual stimuli. The length categories could be reliably decoded from neuronal population activity to predict the crows' conceptual decisions. NCL activity changed with learning when a crow was retrained with the same stimuli assigned to more categories with new boundaries ("short", "medium," and "long"). Categorical neuronal representations emerged dynamically so that sensory length information at the beginning of the trial was transformed into behaviorally relevant categorical representations shortly before the crows' decision making. Our data show malleable categorization capabilities for abstract spatial magnitudes mediated by the flexible networks of the crow NCL.
Collapse
Affiliation(s)
- Lysann Wagener
- Animal Physiology Unit, Institute of Neurobiology, University of Tübingen, 72076 Tübingen, Germany
| | - Andreas Nieder
- Animal Physiology Unit, Institute of Neurobiology, University of Tübingen, 72076 Tübingen, Germany.
| |
Collapse
|
29
|
Nakai T, Nishimoto S. Artificial neural network modelling of the neural population code underlying mathematical operations. Neuroimage 2023; 270:119980. [PMID: 36848969 DOI: 10.1016/j.neuroimage.2023.119980] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Revised: 02/10/2023] [Accepted: 02/23/2023] [Indexed: 02/28/2023] Open
Abstract
Mathematical operations have long been regarded as a sparse, symbolic process in neuroimaging studies. In contrast, advances in artificial neural networks (ANN) have enabled extracting distributed representations of mathematical operations. Recent neuroimaging studies have compared distributed representations of the visual, auditory and language domains in ANNs and biological neural networks (BNNs). However, such a relationship has not yet been examined in mathematics. Here we hypothesise that ANN-based distributed representations can explain brain activity patterns of symbolic mathematical operations. We used the fMRI data of a series of mathematical problems with nine different combinations of operators to construct voxel-wise encoding/decoding models using both sparse operator and latent ANN features. Representational similarity analysis demonstrated shared representations between ANN and BNN, an effect particularly evident in the intraparietal sulcus. Feature-brain similarity (FBS) analysis served to reconstruct a sparse representation of mathematical operations based on distributed ANN features in each cortical voxel. Such reconstruction was more efficient when using features from deeper ANN layers. Moreover, latent ANN features allowed the decoding of novel operators not used during model training from brain activity. The current study provides novel insights into the neural code underlying mathematical thought.
Collapse
Affiliation(s)
- Tomoya Nakai
- Center for Information and Neural Networks, National Institute of Information and Communications Technology, Suita, Japan; Lyon Neuroscience Research Center (CRNL), INSERM U1028 - CNRS UMR5292, University of Lyon, Bron, France.
| | - Shinji Nishimoto
- Center for Information and Neural Networks, National Institute of Information and Communications Technology, Suita, Japan; Graduate School of Frontier Biosciences, Osaka University, Suita, Japan; Graduate School of Medicine, Osaka University, Suita, Japan
| |
Collapse
|
30
|
Kanwisher N, Khosla M, Dobs K. Using artificial neural networks to ask 'why' questions of minds and brains. Trends Neurosci 2023; 46:240-254. [PMID: 36658072 DOI: 10.1016/j.tins.2022.12.008] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 11/29/2022] [Accepted: 12/22/2022] [Indexed: 01/19/2023]
Abstract
Neuroscientists have long characterized the properties and functions of the nervous system, and are increasingly succeeding in answering how brains perform the tasks they do. But the question 'why' brains work the way they do is asked less often. The new ability to optimize artificial neural networks (ANNs) for performance on human-like tasks now enables us to approach these 'why' questions by asking when the properties of networks optimized for a given task mirror the behavioral and neural characteristics of humans performing the same task. Here we highlight the recent success of this strategy in explaining why the visual and auditory systems work the way they do, at both behavioral and neural levels.
Collapse
Affiliation(s)
- Nancy Kanwisher
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA; McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Meenakshi Khosla
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA; McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Katharina Dobs
- Department of Psychology, Justus Liebig University Giessen, Giessen, Germany; Center for Mind, Brain and Behavior (CMBB), University of Marburg and Justus Liebig University, Giessen, Germany.
| |
Collapse
|
31
|
Yago Malo J, Cicchini GM, Morrone MC, Chiofalo ML. Quantum spin models for numerosity perception. PLoS One 2023; 18:e0284610. [PMID: 37098002 PMCID: PMC10128973 DOI: 10.1371/journal.pone.0284610] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 04/04/2023] [Indexed: 04/26/2023] Open
Abstract
Humans share with animals, both vertebrates and invertebrates, the capacity to sense the number of items in their environment already at birth. The pervasiveness of this skill across the animal kingdom suggests that it should emerge in very simple populations of neurons. Current modelling literature, however, has struggled to provide a simple architecture carrying out this task, with most proposals suggesting the emergence of number sense in multi-layered complex neural networks, and typically requiring supervised learning; while simple accumulator models fail to predict Weber's Law, a common trait of human and animal numerosity processing. We present a simple quantum spin model with all-to-all connectivity, where numerosity is encoded in the spectrum after stimulation with a number of transient signals occurring in a random or orderly temporal sequence. We use a paradigmatic simulational approach borrowed from the theory and methods of open quantum systems out of equilibrium, as a possible way to describe information processing in neural systems. Our method is able to capture many of the perceptual characteristics of numerosity in such systems. The frequency components of the magnetization spectra at harmonics of the system's tunneling frequency increase with the number of stimuli presented. The amplitude decoding of each spectrum, performed with an ideal-observer model, reveals that the system follows Weber's law. This contrasts with the well-known failure to reproduce Weber's law with linear system or accumulators models.
Collapse
Affiliation(s)
- Jorge Yago Malo
- Department of Physics "Enrico Fermi" and INFN, University of Pisa, Pisa, Italy
| | | | - Maria Concetta Morrone
- Department of Translational Research and of New Surgical and Medical Technologies, University of Pisa and PisaVisionLab, Pisa, Italy
| | | |
Collapse
|
32
|
Dasgupta S, Hattori D, Navlakha S. A neural theory for counting memories. Nat Commun 2022; 13:5961. [PMID: 36217003 PMCID: PMC9551066 DOI: 10.1038/s41467-022-33577-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 09/22/2022] [Indexed: 11/09/2022] Open
Abstract
Keeping track of the number of times different stimuli have been experienced is a critical computation for behavior. Here, we propose a theoretical two-layer neural circuit that stores counts of stimulus occurrence frequencies. This circuit implements a data structure, called a count sketch, that is commonly used in computer science to maintain item frequencies in streaming data. Our first model implements a count sketch using Hebbian synapses and outputs stimulus-specific frequencies. Our second model uses anti-Hebbian plasticity and only tracks frequencies within four count categories ("1-2-3-many"), which trades-off the number of categories that need to be distinguished with the potential ethological value of those categories. We show how both models can robustly track stimulus occurrence frequencies, thus expanding the traditional novelty-familiarity memory axis from binary to discrete with more than two possible values. Finally, we show that an implementation of the "1-2-3-many" count sketch exists in the insect mushroom body.
Collapse
Affiliation(s)
- Sanjoy Dasgupta
- Computer Science and Engineering Department, University of California San Diego, La Jolla, CA, 92037, USA
| | - Daisuke Hattori
- Department of Physiology, UT Southwestern Medical Center, Dallas, TX, 75390, USA.
| | - Saket Navlakha
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.
| |
Collapse
|
33
|
Hofstetter S, Dumoulin SO. Assessing the ecological validity of numerosity-selective neuronal populations with real-world natural scenes. iScience 2022; 25:105267. [PMID: 36274951 PMCID: PMC9579010 DOI: 10.1016/j.isci.2022.105267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 07/18/2022] [Accepted: 09/26/2022] [Indexed: 11/21/2022] Open
Abstract
Animals and humans are able to quickly and effortlessly estimate the number of items in a set: their numerosity. Numerosity perception is thought to be critical to behavior, from feeding to escaping predators to human mathematical cognition. Virtually, all scientific studies on numerosity mechanisms use well controlled but artificial stimuli to isolate the numerosity dimension from other physical quantities. Here, we probed the ecological validity of these artificial stimuli and evaluate whether an important component in numerosity processing, the numerosity-selective neural populations, also respond to numerosity of items in real-world natural scenes. Using 7T MRI and natural images from a wide range of categories, we provide evidence that the numerosity-tuned neuronal populations show numerosity-selective responses when viewing images from a real-world natural scene. Our findings strengthen the role of numerosity-selective neurons in numerosity perception and provide an important link to their function in numerosity perception in real-world settings.
Collapse
Affiliation(s)
- Shir Hofstetter
- The Spinoza Centre for Neuroimaging, Amsterdam, the Netherlands,Department of Computational Cognitive Neuroscience and Neuroimaging, Netherlands Institute for Neuroscience, Amsterdam, the Netherlands,Corresponding author
| | - Serge O. Dumoulin
- The Spinoza Centre for Neuroimaging, Amsterdam, the Netherlands,Department of Computational Cognitive Neuroscience and Neuroimaging, Netherlands Institute for Neuroscience, Amsterdam, the Netherlands,Department of Experimental Psychology, Helmholtz Institute, Utrecht University, Utrecht, the Netherlands,Department of Experimental and Applied Psychology, VU University, Amsterdam, the Netherlands,Corresponding author
| |
Collapse
|
34
|
Janini D, Hamblin C, Deza A, Konkle T. General object-based features account for letter perception. PLoS Comput Biol 2022; 18:e1010522. [PMID: 36155642 PMCID: PMC9536565 DOI: 10.1371/journal.pcbi.1010522] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 10/06/2022] [Accepted: 08/29/2022] [Indexed: 11/30/2022] Open
Abstract
After years of experience, humans become experts at perceiving letters. Is this visual capacity attained by learning specialized letter features, or by reusing general visual features previously learned in service of object categorization? To explore this question, we first measured the perceptual similarity of letters in two behavioral tasks, visual search and letter categorization. Then, we trained deep convolutional neural networks on either 26-way letter categorization or 1000-way object categorization, as a way to operationalize possible specialized letter features and general object-based features, respectively. We found that the general object-based features more robustly correlated with the perceptual similarity of letters. We then operationalized additional forms of experience-dependent letter specialization by altering object-trained networks with varied forms of letter training; however, none of these forms of letter specialization improved the match to human behavior. Thus, our findings reveal that it is not necessary to appeal to specialized letter representations to account for perceptual similarity of letters. Instead, we argue that it is more likely that the perception of letters depends on domain-general visual features. For over a century, scientists have conducted behavioral experiments to investigate how the visual system recognizes letters, but it has proven difficult to propose a model of the feature space underlying this capacity. Here we leveraged recent advances in machine learning to model a wide variety of features ranging from specialized letter features to general object-based features. Across two large-scale behavioral experiments we find that general object-based features account well for letter perception, and that adding letter specialization did not improve the correspondence to human behavior. It is plausible that the ability to recognize letters largely relies on general visual features unaltered by letter learning.
Collapse
Affiliation(s)
- Daniel Janini
- Department of Psychology, Harvard University, Cambridge, Massachusetts, United States of America
- * E-mail:
| | - Chris Hamblin
- Department of Psychology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Arturo Deza
- Department of Psychology, Harvard University, Cambridge, Massachusetts, United States of America
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Talia Konkle
- Department of Psychology, Harvard University, Cambridge, Massachusetts, United States of America
| |
Collapse
|
35
|
Zhou Y, Chen H, Wang Y. Recognition ability of untrained neural networks to symbolic numbers. Front Neuroinform 2022; 16:973010. [PMID: 36213547 PMCID: PMC9534535 DOI: 10.3389/fninf.2022.973010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Accepted: 09/05/2022] [Indexed: 11/13/2022] Open
Abstract
Although animals can learn to use abstract numbers to represent the number of items, whether untrained animals could distinguish between different abstract numbers is not clear. A two-layer spiking neural network with lateral inhibition was built from the perspective of biological interpretability. The network connection weight was set randomly without adjustment. On the basis of this model, experiments were carried out on the symbolic number dataset MNIST and non-symbolic numerosity dataset. Results showed that the model has abilities to distinguish symbolic numbers. However, compared with number sense, tuning curves of symbolic numbers could not reproduce size and distance effects. The preference distribution also could not show high distribution characteristics at both ends and low distribution characteristics in the middle. More than half of the network units prefer the symbolic numbers 0 and 5. The average goodness-of-fit of the Gaussian fitting of tuning curves increases with the increase in abscissa non-linearity. These results revealed that the concept of human symbolic number is trained on the basis of number sense.
Collapse
|
36
|
The malleable impact of non-numeric features in visual number perception. Acta Psychol (Amst) 2022; 230:103737. [PMID: 36095870 DOI: 10.1016/j.actpsy.2022.103737] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 07/20/2022] [Accepted: 09/03/2022] [Indexed: 11/21/2022] Open
Abstract
Non-numeric stimulus features frequently influence observers' number judgments: when judging the number of items in a display, we will often (mis)perceive the set with a larger cumulative surface area as more numerous. These "congruency effects" are often used as evidence for how vision extracts numeric information and have been invoked in arguments surrounding whether non-numeric cues (e.g., cumulative area, density, etc.) are combined for number perception. We test whether congruency effects for one such cue - cumulative area - provide evidence that it is necessarily used and integrated in number perception, or if its influence on number is malleable. In Experiment 1, we replicate and extend prior work showing that the presence of feedback eliminates congruency effects between number and cumulative area, suggesting that the role of cumulative area in number perception is malleable rather than obligatory. In Experiment 2, we test whether this malleable influence is because of use of prior experiences about how number naturalistically correlates with cumulative area, or the result of response competition, with number and cumulative area actively competing for the same behavioral decision. We preserve cumulative area as a visual cue but eliminate response competition with number by replacing one side of the dot array with its corresponding Hindu-Arabic numeral. Independent of the presence or absence of feedback, we do not observe congruency effects in Experiment 2. These experiments suggest that cumulative area is not necessarily integrated in number perception nor a reflection of a rational use of naturalistic correlations, but rather congruency effects between cumulative area and number emerge as a consequence of response competition. Our findings help to elucidate the mechanism through which non-numeric cues and number interact, and provide an explanation for why congruency effects are only sometimes observed across studies.
Collapse
|
37
|
Abstract
Numerosity, that is, the number of items in a set, is a significant aspect in the perception of the environment. Behavioral and in silico experiments suggest that number sense belongs to a core knowledge system and can be present already at birth. However, neurons sensitive to the number of visual items have been so far described only in the brain of adult animals. Therefore, it remained unknown to what extent their selectivity would depend on visual learning and experience. We found number neurons in the caudal nidopallium (a higher associative area functionally similar to the mammalian prefrontal cortex) of very young, numerically naïve domestic chicks. This result suggests that numerosity perception is possibly an inborn feature of the vertebrate brain. Numerical cognition is ubiquitous in the animal kingdom. Domestic chicks are a widely used developmental model for studying numerical cognition. Soon after hatching, chicks can perform sophisticated numerical tasks. Nevertheless, the neural basis of their numerical abilities has remained unknown. Here, we describe number neurons in the caudal nidopallium (functionally equivalent to the mammalian prefrontal cortex) of young domestic chicks. Number neurons that we found in young chicks showed remarkable similarities to those in the prefrontal cortex and caudal nidopallium of adult animals. Thus, our results suggest that numerosity perception based on number neurons might be an inborn feature of the vertebrate brain.
Collapse
|
38
|
Cai Y, Hofstetter S, Harvey BM, Dumoulin SO. Attention drives human numerosity-selective responses. Cell Rep 2022; 39:111005. [PMID: 35767956 DOI: 10.1016/j.celrep.2022.111005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 04/18/2022] [Accepted: 06/03/2022] [Indexed: 11/03/2022] Open
Abstract
Numerosity, the set size of a group of items, helps guide behavior and decisions. Previous studies have shown that neural populations respond selectively to numerosities. How numerosity is extracted from the visual scene is a longstanding debate, often contrasting low-level visual with high-level cognitive processes. Here, we investigate how attention influences numerosity-selective responses. The stimuli consisted of black and white dots within the same display. Participants' attention was focused on either black or white dots, while we systematically changed the numerosity of black, white, and total dots. Using 7 T fMRI, we show that the numerosity-tuned neural populations respond only when attention is focused on their preferred numerosity, irrespective of the unattended or total numerosities. Without attention, responses to preferred numerosity are suppressed. Unlike traditional effects of attention in the visual cortex, where attention enhances already existing responses, these results suggest that attention is required to drive numerosity-selective responses.
Collapse
Affiliation(s)
- Yuxuan Cai
- Spinoza Centre for Neuroimaging, Meibergdreef 75, 1105BK Amsterdam, the Netherlands; Computational Cognitive Neuroscience and Neuroimaging, Netherlands Institute for Neuroscience, Amsterdam, the Netherlands; Experimental and Applied Psychology, Vrije University Amsterdam, Amsterdam, the Netherlands.
| | - Shir Hofstetter
- Spinoza Centre for Neuroimaging, Meibergdreef 75, 1105BK Amsterdam, the Netherlands; Computational Cognitive Neuroscience and Neuroimaging, Netherlands Institute for Neuroscience, Amsterdam, the Netherlands
| | - Ben M Harvey
- Experimental Psychology, Helmholtz Institute, Utrecht University, Utrecht, the Netherlands
| | - Serge O Dumoulin
- Spinoza Centre for Neuroimaging, Meibergdreef 75, 1105BK Amsterdam, the Netherlands; Computational Cognitive Neuroscience and Neuroimaging, Netherlands Institute for Neuroscience, Amsterdam, the Netherlands; Experimental and Applied Psychology, Vrije University Amsterdam, Amsterdam, the Netherlands; Experimental Psychology, Helmholtz Institute, Utrecht University, Utrecht, the Netherlands.
| |
Collapse
|
39
|
Zhou Y, Chen H, Wang Y. Role of Lateral Inhibition on Visual Number Sense. Front Comput Neurosci 2022; 16:810448. [PMID: 35795083 PMCID: PMC9252291 DOI: 10.3389/fncom.2022.810448] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2021] [Accepted: 01/11/2022] [Indexed: 11/13/2022] Open
Abstract
Newborn animals, such as 4-month-old infants, 4-day-old chicks, and 1-day-old guppies, exhibit sensitivity to an approximate number of items in the visual array. These findings are often interpreted as evidence for an innate "number sense." However, number sense is typically investigated using explicit behavioral tasks, which require a form of calibration (e.g., habituation or reward-based training) in experimental studies. Therefore, the generation of number sense may be the result of calibration. We built a number-sense neural network model on the basis of lateral inhibition to explore whether animals demonstrate an innate "number sense" and determine important factors affecting this competence. The proposed model can reproduce size and distance effects of output responses of number-selective neurons when network connection weights are set randomly without an adjustment. Results showed that number sense can be produced under the influence of lateral inhibition, which is one of the fundamental mechanisms of the nervous system, and independent of learning.
Collapse
Affiliation(s)
| | - Huanwen Chen
- The School of Automation, Central South University, Changsha, China
| | - Yijun Wang
- The School of Automation, Central South University, Changsha, China
| |
Collapse
|
40
|
Zhou L, Yang A, Meng M, Zhou K. Emerged human-like facial expression representation in a deep convolutional neural network. SCIENCE ADVANCES 2022; 8:eabj4383. [PMID: 35319988 PMCID: PMC8942361 DOI: 10.1126/sciadv.abj4383] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 02/02/2022] [Indexed: 06/14/2023]
Abstract
Recent studies found that the deep convolutional neural networks (DCNNs) trained to recognize facial identities spontaneously learned features that support facial expression recognition, and vice versa. Here, we showed that the self-emerged expression-selective units in a VGG-Face trained for facial identification were tuned to distinct basic expressions and, importantly, exhibited hallmarks of human expression recognition (i.e., facial expression confusion and categorical perception). We then investigated whether the emergence of expression-selective units is attributed to either face-specific experience or domain-general processing by conducting the same analysis on a VGG-16 trained for object classification and an untrained VGG-Face without any visual experience, both having the identical architecture with the pretrained VGG-Face. Although similar expression-selective units were found in both DCNNs, they did not exhibit reliable human-like characteristics of facial expression perception. Together, these findings revealed the necessity of domain-specific visual experience of face identity for the development of facial expression perception, highlighting the contribution of nurture to form human-like facial expression perception.
Collapse
Affiliation(s)
- Liqin Zhou
- Beijing Key Laboratory of Applied Experimental Psychology, Faculty of Psychology, Beijing Normal University, Beijing 100875, China
| | - Anmin Yang
- Beijing Key Laboratory of Applied Experimental Psychology, Faculty of Psychology, Beijing Normal University, Beijing 100875, China
| | - Ming Meng
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, Guangzhou 510631, China
- Guangdong Key Laboratory of Mental Health and Cognitive Science, School of Psychology, South China Normal University, Guangzhou 510631, China
| | - Ke Zhou
- Beijing Key Laboratory of Applied Experimental Psychology, Faculty of Psychology, Beijing Normal University, Beijing 100875, China
| |
Collapse
|
41
|
Dobs K, Martinez J, Kell AJE, Kanwisher N. Brain-like functional specialization emerges spontaneously in deep neural networks. SCIENCE ADVANCES 2022; 8:eabl8913. [PMID: 35294241 PMCID: PMC8926347 DOI: 10.1126/sciadv.abl8913] [Citation(s) in RCA: 56] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 01/21/2022] [Indexed: 05/10/2023]
Abstract
The human brain contains multiple regions with distinct, often highly specialized functions, from recognizing faces to understanding language to thinking about what others are thinking. However, it remains unclear why the cortex exhibits this high degree of functional specialization in the first place. Here, we consider the case of face perception using artificial neural networks to test the hypothesis that functional segregation of face recognition in the brain reflects a computational optimization for the broader problem of visual recognition of faces and other visual categories. We find that networks trained on object recognition perform poorly on face recognition and vice versa and that networks optimized for both tasks spontaneously segregate themselves into separate systems for faces and objects. We then show functional segregation to varying degrees for other visual categories, revealing a widespread tendency for optimization (without built-in task-specific inductive biases) to lead to functional specialization in machines and, we conjecture, also brains.
Collapse
Affiliation(s)
- Katharina Dobs
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Psychology, Justus Liebig University Giessen, Giessen, Germany
| | - Julio Martinez
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Psychology, Stanford University, Stanford, CA, USA
| | | | - Nancy Kanwisher
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
42
|
Paul JM, van Ackooij M, Ten Cate TC, Harvey BM. Numerosity tuning in human association cortices and local image contrast representations in early visual cortex. Nat Commun 2022; 13:1340. [PMID: 35292648 PMCID: PMC8924234 DOI: 10.1038/s41467-022-29030-z] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 02/21/2022] [Indexed: 01/31/2023] Open
Abstract
Human early visual cortex response amplitudes monotonically increase with numerosity (object number), regardless of object size and spacing. However, numerosity is typically considered a high-level visual or cognitive feature, while early visual responses follow image contrast in the spatial frequency domain. We find that, at fixed contrast, aggregate Fourier power (at all orientations and spatial frequencies) follows numerosity closely but nonlinearly with little effect of object size, spacing or shape. This would allow straightforward numerosity estimation from spatial frequency domain image representations. Using 7T fMRI, we show monotonic responses originate in primary visual cortex (V1) at the stimulus's retinotopic location. Responses here and in neural network models follow aggregate Fourier power more closely than numerosity. Truly numerosity tuned responses emerge after lateral occipital cortex and are independent of retinotopic location. We propose numerosity's straightforward perception and neural responses may result from the pervasive spatial frequency analyses of early visual processing.
Collapse
Affiliation(s)
- Jacob M Paul
- Experimental Psychology, Helmholtz Institute, Utrecht University, Heidelberglaan 1, Utrecht, 3584 CS, Netherlands.
- Melbourne School of Psychological Sciences, University of Melbourne, Redmond Barry Building, Parkville, 3010, Victoria, Australia.
| | - Martijn van Ackooij
- Experimental Psychology, Helmholtz Institute, Utrecht University, Heidelberglaan 1, Utrecht, 3584 CS, Netherlands
| | - Tuomas C Ten Cate
- Experimental Psychology, Helmholtz Institute, Utrecht University, Heidelberglaan 1, Utrecht, 3584 CS, Netherlands
| | - Ben M Harvey
- Experimental Psychology, Helmholtz Institute, Utrecht University, Heidelberglaan 1, Utrecht, 3584 CS, Netherlands
| |
Collapse
|
43
|
Zhang D, Zhou L, Yang A, Li S, Chang C, Liu J, Zhou K. A connectome-based neuromarker of nonverbal number acuity and arithmetic skills. Cereb Cortex 2022; 33:881-894. [PMID: 35254408 PMCID: PMC9890459 DOI: 10.1093/cercor/bhac108] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Revised: 02/15/2022] [Accepted: 02/16/2022] [Indexed: 02/04/2023] Open
Abstract
The approximate number system (ANS) is vital for survival and reproduction in animals and is crucial for constructing abstract mathematical abilities in humans. Most previous neuroimaging studies focused on identifying discrete brain regions responsible for the ANS and characterizing their functions in numerosity perception. However, a neuromarker to characterize an individual's ANS acuity is lacking, especially one based on whole-brain functional connectivity (FC). Here, based on the resting-state functional magnetic resonance imaging (rs-fMRI) data obtained from a large sample, we identified a distributed brain network (i.e. a numerosity network) using a connectome-based predictive modeling (CPM) analysis. The summed FC strength within the numerosity network reliably predicted individual differences in ANS acuity regarding behavior, as measured using a nonsymbolic number-comparison task. Furthermore, in an independent dataset of the Human Connectome Project (HCP), we found that the summed FC strength within the numerosity network also specifically predicted individual differences in arithmetic skills, but not domain-general cognitive abilities. Therefore, our findings revealed that the identified numerosity network could serve as an applicable neuroimaging-based biomarker of nonverbal number acuity and arithmetic skills.
Collapse
Affiliation(s)
- Dai Zhang
- Beijing Key Laboratory of Applied Experimental Psychology, School of Psychology, Beijing Normal University, No. 19, Xinjiekouwai Street, Haidian District, Beijing 100875, China,Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Shenzhen University Health Science Center, No. 1066, Xueyuan Street, Nanshan District, Shenzhen 518060, China
| | - Liqin Zhou
- Beijing Key Laboratory of Applied Experimental Psychology, School of Psychology, Beijing Normal University, No. 19, Xinjiekouwai Street, Haidian District, Beijing 100875, China
| | - Anmin Yang
- Beijing Key Laboratory of Applied Experimental Psychology, School of Psychology, Beijing Normal University, No. 19, Xinjiekouwai Street, Haidian District, Beijing 100875, China
| | - Shanshan Li
- Beijing Key Laboratory of Applied Experimental Psychology, School of Psychology, Beijing Normal University, No. 19, Xinjiekouwai Street, Haidian District, Beijing 100875, China
| | - Chunqi Chang
- Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Shenzhen University Health Science Center, No. 1066, Xueyuan Street, Nanshan District, Shenzhen 518060, China
| | - Jia Liu
- Department of Psychology & Tsinghua Laboratory of Brain and Intelligence, Tsinghua University, No. 30, Shuangqing Street, Haidian District, Beijing 100084, China
| | - Ke Zhou
- Corresponding author: Beijing Key Laboratory of Applied Experimental Psychology, School of Psychology, Beijing Normal University, No. 19, Xinjiekouwai Street, Haidian District, Beijing 100875, China.
| |
Collapse
|
44
|
|
45
|
Fu W, Dolfi S, Decarli G, Spironelli C, Zorzi M. Electrophysiological Signatures of Numerosity Encoding in a Delayed Match-to-Sample Task. Front Hum Neurosci 2022; 15:750582. [PMID: 35058763 PMCID: PMC8764258 DOI: 10.3389/fnhum.2021.750582] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 11/29/2021] [Indexed: 11/13/2022] Open
Abstract
The number of elements in a small set of items is appraised in a fast and exact manner, a phenomenon called subitizing. In contrast, humans provide imprecise responses when comparing larger numerosities, with decreasing precision as the number of elements increases. Estimation is thought to rely on a dedicated system for the approximate representation of numerosity. While previous behavioral and neuroimaging studies associate subitizing to a domain-general system related to object tracking and identification, the nature of small numerosity processing is still debated. We investigated the neural processing of numerosity across subitizing and estimation ranges by examining electrophysiological activity during the memory retention period in a delayed numerical match-to-sample task. We also assessed potential differences in the neural signature of numerical magnitude in a fully non-symbolic or cross-format comparison. In line with behavioral performance, we observed modulation of parietal-occipital neural activity as a function of numerosity that differed in two ranges, with distinctive neural signatures of small numerosities showing clear similarities with those observed in visuospatial working memory tasks. We also found differences in neural activity related to numerical information in anticipation of single vs. cross-format comparison, suggesting a top-down modulation of numerical processing. Finally, behavioral results revealed enhanced performance in the mixed-format conditions and a significant correlation between task performance and symbolic mathematical skills. Overall, we provide evidence for distinct mechanisms related to small and large numerosity and differences in numerical encoding based on task demands.
Collapse
Affiliation(s)
- Wanlu Fu
- Department of General Psychology, University of Padova, Padua, Italy
| | - Serena Dolfi
- Department of Developmental Psychology and Socialisation, University of Padova, Padua, Italy
| | - Gisella Decarli
- Department of General Psychology, University of Padova, Padua, Italy
| | - Chiara Spironelli
- Department of General Psychology, University of Padova, Padua, Italy
- Padova Neuroscience Center (PNC), University of Padova, Padua, Italy
| | - Marco Zorzi
- Department of General Psychology, University of Padova, Padua, Italy
- IRCCS San Camillo Hospital, Venice, Italy
- *Correspondence: Marco Zorzi,
| |
Collapse
|
46
|
Gupta SK, Zhang M, Wu CC, Wolfe JM, Kreiman G. Visual Search Asymmetry: Deep Nets and Humans Share Similar Inherent Biases. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 2021; 34:6946-6959. [PMID: 36062138 PMCID: PMC9436507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Visual search is a ubiquitous and often challenging daily task, exemplified by looking for the car keys at home or a friend in a crowd. An intriguing property of some classical search tasks is an asymmetry such that finding a target A among distractors B can be easier than finding B among A. To elucidate the mechanisms responsible for asymmetry in visual search, we propose a computational model that takes a target and a search image as inputs and produces a sequence of eye movements until the target is found. The model integrates eccentricity-dependent visual recognition with target-dependent top-down cues. We compared the model against human behavior in six paradigmatic search tasks that show asymmetry in humans. Without prior exposure to the stimuli or task-specific training, the model provides a plausible mechanism for search asymmetry. We hypothesized that the polarity of search asymmetry arises from experience with the natural environment. We tested this hypothesis by training the model on augmented versions of ImageNet where the biases of natural images were either removed or reversed. The polarity of search asymmetry disappeared or was altered depending on the training protocol. This study highlights how classical perceptual properties can emerge in neural network models, without the need for task-specific training, but rather as a consequence of the statistical properties of the developmental diet fed to the model. All source code and data are publicly available at https://github.com/kreimanlab/VisualSearchAsymmetry.
Collapse
Affiliation(s)
| | - Mengmi Zhang
- Children's Hospital, Harvard Medical School
- Center for Brains, Minds and Machines
| | - Chia-Chien Wu
- Brigham and Women's Hospital, Harvard Medical School
| | | | - Gabriel Kreiman
- Children's Hospital, Harvard Medical School
- Center for Brains, Minds and Machines
| |
Collapse
|
47
|
Nasr K, Nieder A. Spontaneous representation of numerosity zero in a deep neural network for visual object recognition. iScience 2021; 24:103301. [PMID: 34765921 PMCID: PMC8571726 DOI: 10.1016/j.isci.2021.103301] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 09/16/2021] [Accepted: 10/14/2021] [Indexed: 12/01/2022] Open
Abstract
Conceiving "nothing" as a numerical value zero is considered a sophisticated numerical capability that humans share with cognitively advanced animals. We demonstrate that representation of zero spontaneously emerges in a deep learning neural network without any number training. As a signature of numerical quantity representation, and similar to real neurons from animals, numerosity zero network units show maximum activity to empty sets and a gradual decrease in activity with increasing countable numerosities. This indicates that the network spontaneously ordered numerosity zero as the smallest numerical value along the number line. Removal of empty-set network units caused specific deficits in the network's judgment of numerosity zero, thus reflecting these units' functional relevance. These findings suggest that processing visual information is sufficient for a visual number sense that includes zero to emerge and explains why cognitively advanced animals with whom we share a nonverbal number system exhibit rudiments of numerosity zero.
Collapse
Affiliation(s)
- Khaled Nasr
- Animal Physiology Unit, Institute of Neurobiology, Auf der Morgenstelle 28, University of Tübingen, 72076 Tübingen, Germany
| | - Andreas Nieder
- Animal Physiology Unit, Institute of Neurobiology, Auf der Morgenstelle 28, University of Tübingen, 72076 Tübingen, Germany
| |
Collapse
|
48
|
Tsouli A, Harvey BM, Hofstetter S, Cai Y, van der Smagt MJ, Te Pas SF, Dumoulin SO. The role of neural tuning in quantity perception. Trends Cogn Sci 2021; 26:11-24. [PMID: 34702662 DOI: 10.1016/j.tics.2021.10.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 10/01/2021] [Accepted: 10/04/2021] [Indexed: 11/16/2022]
Abstract
Perception of quantities, such as numerosity, timing, and size, is essential for behavior and cognition. Accumulating evidence demonstrates neurons processing quantities are tuned, that is, have a preferred quantity amount, not only for numerosity, but also other quantity dimensions and sensory modalities. We argue that quantity-tuned neurons are fundamental to understanding quantity perception. We illustrate how the properties of quantity-tuned neurons can underlie a range of perceptual phenomena. Furthermore, quantity-tuned neurons are organized in distinct but overlapping topographic maps. We suggest that this overlap in tuning provides the neural basis for perceptual interactions between different quantities, without the need for a common neural representational code.
Collapse
Affiliation(s)
- Andromachi Tsouli
- Department of Experimental Psychology, Helmholtz Institute, Utrecht University, Utrecht, The Netherlands
| | - Ben M Harvey
- Department of Experimental Psychology, Helmholtz Institute, Utrecht University, Utrecht, The Netherlands
| | - Shir Hofstetter
- The Spinoza Centre for Neuroimaging, Amsterdam, The Netherlands
| | - Yuxuan Cai
- The Spinoza Centre for Neuroimaging, Amsterdam, The Netherlands; Department of Experimental and Applied Psychology, VU University, Amsterdam, The Netherlands
| | - Maarten J van der Smagt
- Department of Experimental Psychology, Helmholtz Institute, Utrecht University, Utrecht, The Netherlands
| | - Susan F Te Pas
- Department of Experimental Psychology, Helmholtz Institute, Utrecht University, Utrecht, The Netherlands
| | - Serge O Dumoulin
- Department of Experimental Psychology, Helmholtz Institute, Utrecht University, Utrecht, The Netherlands; The Spinoza Centre for Neuroimaging, Amsterdam, The Netherlands; Department of Experimental and Applied Psychology, VU University, Amsterdam, The Netherlands; Netherlands Institute for Neuroscience, Royal Netherlands Academy of Sciences, Amsterdam, The Netherlands.
| |
Collapse
|
49
|
Creatore C, Sabathiel S, Solstad T. Learning exact enumeration and approximate estimation in deep neural network models. Cognition 2021; 215:104815. [PMID: 34182145 DOI: 10.1016/j.cognition.2021.104815] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2020] [Revised: 06/12/2021] [Accepted: 06/15/2021] [Indexed: 01/08/2023]
Abstract
A system for approximate number discrimination has been shown to arise in at least two types of hierarchical neural network models-a generative Deep Belief Network (DBN) and a Hierarchical Convolutional Neural Network (HCNN) trained to classify natural objects. Here, we investigate whether the same two network architectures can learn to recognise exact numerosity. A clear difference in performance could be traced to the specificity of the unit responses that emerged in the last hidden layer of each network. In the DBN, the emergence of a layer of monotonic 'summation units' was sufficient to produce classification behaviour consistent with the behavioural signature of the approximate number system. In the HCNN, a layer of units uniquely tuned to the transition between particular numerosities effectively encoded a thermometer-like 'numerosity code' that ensured near-perfect classification accuracy. The results support the notion that parallel pattern-recognition mechanisms may give rise to exact and approximate number concepts, both of which may contribute to the learning of symbolic numbers and arithmetic.
Collapse
Affiliation(s)
- Celestino Creatore
- Department of Teacher Education, Faculty of Social and Educational Sciences, NTNU-Norwegian University of Science and Technology, Norway.
| | - Silvester Sabathiel
- Department of Teacher Education, Faculty of Social and Educational Sciences, NTNU-Norwegian University of Science and Technology, Norway; Department of Computer Science, Faculty of Information Technology and Electrical Engineering, NTNU-Norwegian University of Science and Technology, Norway.
| | - Trygve Solstad
- Department of Teacher Education, Faculty of Social and Educational Sciences, NTNU-Norwegian University of Science and Technology, Norway.
| |
Collapse
|
50
|
Abstract
Enumerating objects in the environment (i.e., “number sense”) is crucial for survival in many animal species, and foundational for the construction of more abstract and complex mathematical knowledge in humans. Perhaps surprisingly, deep convolutional neural networks (DCNNs) spontaneously emerge a similar number sense even without any explicit training for numerosity estimation. However, little is known about how the number sense emerges, and the extent to which it is comparable with human number sense. Here, we examined whether the numerosity underestimation effect, a phenomenon indicating that numerosity perception acts upon the perceptual number rather than the physical number, can be observed in DCNNs. In a typical DCNN, AlexNet, we found that number-selective units at late layers operated on the perceptual number, like humans do. More importantly, this perceptual number sense did not emerge abruptly, rather developed progressively along the hierarchy in the DCNN, shifting from the physical number sense at early layers to perceptual number sense at late layers. Our finding hence provides important implications for the neural implementation of number sense in the human brain and advocates future research to determine whether the representation of numerosity also develops gradually along the human visual stream from physical number to perceptual number.
Collapse
|