1
|
Puebla G, Bowers JS. Visual reasoning in object-centric deep neural networks: A comparative cognition approach. Neural Netw 2025; 189:107582. [PMID: 40409010 DOI: 10.1016/j.neunet.2025.107582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 03/28/2025] [Accepted: 05/03/2025] [Indexed: 05/25/2025]
Abstract
Achieving visual reasoning is a long-term goal of artificial intelligence. In the last decade, several studies have applied deep neural networks (DNNs) to the task of learning visual relations from images, with modest results in terms of generalization of the relations learned. However, in recent years, object-centric representation learning has been put forward as a way to achieve visual reasoning within the deep learning framework. Object-centric models attempt to model input scenes as compositions of objects and relations between them. To this end, these models use several kinds of attention mechanisms to segregate the individual objects in a scene from the background and from other objects. In this work we tested relation learning and generalization in several object-centric models, as well as a ResNet-50 baseline. In contrast to previous research, which has focused heavily in the same-different task in order to asses relational reasoning in DNNs, we use a set of tasks - with varying degrees of complexity - derived from the comparative cognition literature. Our results show that object-centric models are able to segregate the different objects in a scene, even in many out-of-distribution cases. In our simpler tasks, this improves their capacity to learn and generalize visual relations in comparison to the ResNet-50 baseline. However, object-centric models still struggle in our more difficult tasks and conditions. We conclude that abstract visual reasoning remains an open challenge for DNNs, including object-centric models.
Collapse
Affiliation(s)
- Guillermo Puebla
- Facultad de Administración y Economía, Universidad de Tarapacá, Arica 1000000, Chile.
| | - Jeffrey S Bowers
- School of Psychological Science, University of Bristol, 12a Priory Road, Bristol BS8 1TU, UK
| |
Collapse
|
2
|
Zhang EQ, Shi ER, Pleyer M. Category Learning as a Cognitive Foundation of Language Evolution. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2025; 16:e70007. [PMID: 40411358 DOI: 10.1002/wcs.70007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 02/24/2025] [Accepted: 05/13/2025] [Indexed: 05/26/2025]
Abstract
Category learning gives rise to category formation, which is a crucial ability in human cognition. Category learning is also one of the required learning abilities in language development. Understanding the evolution of category learning thus can shed light on the evolution of human cognition and language. The current paper emphasizes its foundational role in language evolution by reviewing behavioral and neurological studies on category learning across species. In doing so, we first review studies on the critical role of category learning in learning sounds, words, and grammatical patterns of language. Next, from a comparative perspective, we review studies on category learning conducted on different species of nonhuman animals, including invertebrates and vertebrates, suggesting that category learning displays evolutionary continuity. Then, from a neurological perspective, we focus on the prefrontal cortex and the basal ganglia. Reviewing the involvement of these structures in vertebrates and the proposed homologous brain structure to the basal ganglia in invertebrates in category learning, as well as in language processing in humans, suggests that the neural basis of category learning likely has an ancient origin dating back to invertebrates. With evidence from both behavioral and neurological levels in both nonhuman animals and humans, we conclude that category learning lays a crucial cognitive foundation for language evolution.
Collapse
Affiliation(s)
- Elizabeth Qing Zhang
- School of Linguistic Sciences and Arts, Jiangsu Normal University, Xuzhou, China
| | - Edward Ruoyang Shi
- Department of Translation and Language Sciences, University Pompeu Fabra, Barcelona, Spain
| | - Michael Pleyer
- Center for Language Evolution Studies, Nicolaus Copernicus University in Toruń, Toruń, Poland
| |
Collapse
|
3
|
Martinovic J. Acquisition of colour categories through learning: Differences between hue and lightness. Cognition 2024; 242:105657. [PMID: 37980878 DOI: 10.1016/j.cognition.2023.105657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 09/20/2023] [Accepted: 10/30/2023] [Indexed: 11/21/2023]
Abstract
Colour categories are acquired through learning, but the nature of this process is not fully understood. Some category distinctions are defined by hue (e.g. red/purple) but other by lightness (red/pink). The aim of this study was to investigate if the acquisition of key information for making accurate cross-boundary discriminations poses different challenges for hue-defined as opposed to lightness-defined boundaries. To answer this question, hue- and lightness-learners were trained on a novel category boundary within the GREEN region of colour space. After training, hue- and lightness-learners as well as untrained controls performed delayed same-different discrimination for lightness and hue pairs. In addition to discrimination data, errors during learning and category-labelling strategies were examined. Errors during learning distributed non-uniformly and in accordance with the Bezold-Brücke effect, which accounts for darker colours at the green-blue boundary appearing greener and lighter colours appearing bluer. Only hue-learners showed discrimination improvements due to category boundary acquisition. Thus, acquisition is more efficient for hue-category compared to lightness-category boundaries. Almost all learners reported using category-labelling strategies, with hue-learners almost exclusively using 'green'/'blue' and lightness learners using a wider range of labels, most often 'light'/'dark'. Thus, labels play an important role in colour category learning and such labelling does not conform to everyday naming: here, the label 'blue' is used for exemplars that would normally be named 'green'. In conclusion, labelling serves the purpose of highlighting key information that differentiates exemplars across the category boundary, and basic colour terms may be particularly effective in facilitating such attentional guidance.
Collapse
Affiliation(s)
- Jasna Martinovic
- Department of Psychology, School of Philosophy, Psychology and Linguistics, University of Edinburgh, 7 George Square, EH8 9JZ Edinburgh, Scotland, UK.
| |
Collapse
|
4
|
Bianchi I, Burro R. The Perception of Similarity, Difference and Opposition. J Intell 2023; 11:172. [PMID: 37754901 PMCID: PMC10532253 DOI: 10.3390/jintelligence11090172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 08/10/2023] [Accepted: 08/15/2023] [Indexed: 09/28/2023] Open
Abstract
After considering the pervasiveness of same/different relationships in Psychology and the experimental evidence of their perceptual foundation in Psychophysics and Infant and Comparative Psychology, this paper develops its main argument. Similarity and diversity do not complete the panorama since opposition constitutes a third relationship which is distinct from the other two. There is evidence of this in the previous literature investigating the perceptual basis of opposition and in the results of the two new studies presented in this paper. In these studies, the participants were asked to indicate to what extent pairs of simple bi-dimensional figures appeared to be similar, different or opposite to each other. A rating task was used in Study 1 and a pair comparison task was used in Study 2. Three main results consistently emerged: Firstly, opposition is distinct from similarity and difference which, conversely, are in a strictly inverse relationship. Secondly, opposition is specifically linked to something which points in an allocentrically opposite direction. Thirdly, alterations to the shape of an object are usually associated with the perception of diversity rather than opposition. The implications of a shift from a dyadic (same/different) to a triadic (similar/different/opposite) paradigm are discussed in the final section.
Collapse
Affiliation(s)
- Ivana Bianchi
- Department of Humanities, University of Macerata, 62100 Macerata, Italy
| | - Roberto Burro
- Department of Human Sciences, University of Verona, 37129 Verona, Italy;
| |
Collapse
|
5
|
Coldren J. Conditions under which college students cease learning. Front Psychol 2023; 14:1116853. [PMID: 37151351 PMCID: PMC10157072 DOI: 10.3389/fpsyg.2023.1116853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 03/30/2023] [Indexed: 05/09/2023] Open
Abstract
Introduction Effective learning involves the acquisition of information toward a goal and cessation upon reaching that goal. Whereas the process of learning acquisition is well understood, comparatively little is known about how or when learning ceases under naturalistic, open-ended learning conditions in which the criterion for performance is not specified. Ideally, learning should cease once there is no progress toward the goal, although this has never been directly tested in human learners. The present set of experiments explored the conditions under which college students stopped attempting to learn a series of inductive perceptual discrimination problems. Methods Each problem varied by whether it was solvable and had a criterion for success. The first problem was solvable and involved a pre-determined criterion. The second problem was solvable, but with no criterion for ending the problem so that learners eventually achieved a highly accurate level of performance (overlearning). The third problem was unsolvable as the correct answer varied randomly across features. Measures included the number of trials attempted and the outcome of each problem. Results and Discussion Results revealed that college students rarely ceased learning in the overlearning or unsolvable problems even though there was no possibility for further progress. Learning cessation increased only by manipulating time demands for completion or reducing the opportunity for reinforcement. These results suggest that human learners show laudable, but inefficient and unproductive, attempts to master problems they should cease.
Collapse
Affiliation(s)
- Jeffrey Coldren
- Department of Psychological Sciences and Counseling, Youngstown State University, Youngstown, OH, United States
| |
Collapse
|
6
|
Felsche E, Stevens P, Völter CJ, Buchsbaum D, Seed AM. Evidence for abstract representations in children but not capuchin monkeys. Cogn Psychol 2023; 140:101530. [PMID: 36495840 DOI: 10.1016/j.cogpsych.2022.101530] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 10/02/2022] [Accepted: 11/23/2022] [Indexed: 12/12/2022]
Abstract
The use of abstract higher-level knowledge (also called overhypotheses) allows humans to learn quickly from sparse data and make predictions in new situations. Previous research has suggested that humans may be the only species capable of abstract knowledge formation, but this remains controversial. There is also mixed evidence for when this ability emerges over human development. Kemp et al. (2007) proposed a computational model of how overhypotheses could be learned from sparse examples. We provide the first direct test of this model: an ecologically valid paradigm for testing two species, capuchin monkeys (Sapajus spp.) and 4- to 5-year-old human children. We presented participants with sampled evidence from different containers which suggested that all containers held items of uniform type (type condition) or of uniform size (size condition). Subsequently, we presented two new test containers and an example item from each: a small, high-valued item and a large but low-valued item. Participants could then choose from which test container they would like to receive the next sample - the optimal choice was the container that yielded a large item in the size condition or a high-valued item in the type condition. We compared performance to a priori predictions made by models with and without the capacity to learn overhypotheses. Children's choices were consistent with the model predictions and thus suggest an ability for abstract knowledge formation in the preschool years, whereas monkeys performed at chance level.
Collapse
Affiliation(s)
- Elisa Felsche
- School of Psychology and Neuroscience, University of St Andrews, Scotland; Department of Comparative Cultural Psychology, Max Planck Institute for Evolutionary Anthropology, Germany.
| | | | - Christoph J Völter
- Messerli Research Institute, University of Veterinary Medicine Vienna, Medical University of Vienna and University of Vienna, Austria
| | - Daphna Buchsbaum
- Department of Cognitive, Linguistic and Psychological Sciences, Brown University, USA
| | - Amanda M Seed
- School of Psychology and Neuroscience, University of St Andrews, Scotland
| |
Collapse
|
7
|
Martin L, Marie J, Brun M, de Hevia MD, Streri A, Izard V. Abstract representations of small sets in newborns. Cognition 2022; 226:105184. [PMID: 35671541 PMCID: PMC9289748 DOI: 10.1016/j.cognition.2022.105184] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 03/22/2022] [Accepted: 05/26/2022] [Indexed: 11/21/2022]
Abstract
From the very first days of life, newborns are not tied to represent narrow, modality- and object-specific aspects of their environment. Rather, they sometimes react to abstract properties shared by stimuli of very different nature, such as approximate numerosity or magnitude. As of now, however, there is no evidence that newborns possess abstract representations that apply to small sets: in particular, while newborns can match large approximate numerosities across senses, this ability does not extend to small numerosities. In two experiments, we presented newborn infants (N = 64, age 17 to 98 h) with patterned sets AB or ABB simultaneously in the auditory and visual modalities. Auditory patterns were presented as periodic sequences of sounds (AB: triangle-drum-triangle-drum-triangle-drum …; ABB: triangle-drum-drum-triangle-drum-drum-triangle-drum-drum …), and visual patterns as arrays of 2 or 3 shapes (AB: circle-diamond; ABB: circle-diamond-diamond). In both experiments, we found that participants reacted and looked longer when the patterns matched across the auditory and visual modalities – provided that the first stimulus they received was congruent. These findings uncover the existence of yet another type of abstract representations at birth, applying to small sets. As such, they bolster the hypothesis that newborns are endowed with the capacity to represent their environment in broad strokes, in terms of its most abstract properties. This capacity for abstraction could later serve as a scaffold for infants to learn about the particular entities surrounding them. Newborns were presented with auditory and visual patterns (AB vs. ABB). Participants reacted when the patterns presented were congruent across modalities. Newborns possess abstract representations applying to small sets. These representations may encode numerosity and/or repetitions.
Collapse
Affiliation(s)
- Lucie Martin
- Université Paris Cité, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France
| | - Julien Marie
- Université Paris Cité, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France
| | - Mélanie Brun
- Université Paris Cité, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France
| | - Maria Dolores de Hevia
- Université Paris Cité, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France
| | - Arlette Streri
- Université Paris Cité, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France
| | - Véronique Izard
- Université Paris Cité, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France.
| |
Collapse
|
8
|
Biscione V, Bowers JS. Learning online visual invariances for novel objects via supervised and self-supervised training. Neural Netw 2022; 150:222-236. [DOI: 10.1016/j.neunet.2022.02.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 01/14/2022] [Accepted: 02/23/2022] [Indexed: 10/18/2022]
|