1
|
Phillips PJ, White D. The state of modelling face processing in humans with deep learning. Br J Psychol 2025. [PMID: 40364689 DOI: 10.1111/bjop.12794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 04/20/2025] [Indexed: 05/15/2025]
Abstract
Deep learning models trained for facial recognition now surpass the highest performing human participants. Recent evidence suggests that they also model some qualitative aspects of face processing in humans. This review compares the current understanding of deep learning models with psychological models of the face processing system. Psychological models consist of two components that operate on the information encoded when people perceive a face, which we refer to here as 'face codes'. The first component, the core system, extracts face codes from retinal input that encode invariant and changeable properties. The second component, the extended system, links face codes to personal information about a person and their social context. Studies of face codes in existing deep learning models reveal some surprising results. For example, face codes in networks designed for identity recognition also encode expression information, which contrasts with psychological models that separate invariant and changeable properties. Deep learning can also be used to implement candidate models of the face processing system, for example to compare alternative cognitive architectures and codes that might support interchange between core and extended face processing systems. We conclude by summarizing seven key lessons from this research and outlining three open questions for future study.
Collapse
Affiliation(s)
| | - David White
- School of Psychology, UNSW Sydney, Sydney, New South Wales, Australia
| |
Collapse
|
2
|
Ianni GR, Vázquez Y, Rouse AG, Schieber MH, Prut Y, Freiwald WA. Facial gestures are enacted via a cortical hierarchy of dynamic and stable codes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.03.641159. [PMID: 40161717 PMCID: PMC11952350 DOI: 10.1101/2025.03.03.641159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Successful communication requires the generation and perception of a shared set of signals. Facial gestures are one fundamental set of communicative behaviors in primates, generated through the dynamic arrangement of dozens of fine muscles. While much progress has been made uncovering the neural mechanisms of face perception, little is known about those controlling facial gesture production. Commensurate with the importance of facial gestures in daily social life, anatomical work has shown that facial muscles are under direct control from multiple cortical regions, including primary and premotor in lateral frontal cortex, and cingulate in medial frontal cortex. Furthermore, neuropsychological evidence from focal lesion patients has suggested that lateral cortex controls voluntary movements, and medial emotional expressions. Here we show that lateral and medial cortical face motor regions encode both types of gestures. They do so through unique temporal activity patterns, distinguishable well-prior to movement onset. During gesture production, cortical regions encoded facial kinematics in a context-dependent manner. Our results show how cortical regions projecting in parallel downstream, but each situated at a different level of a posterior-anterior hierarchy form a continuum of gesture coding from dynamic to temporally stable, in order to produce context-related, coherent motor outputs during social communication.
Collapse
|
3
|
Amita H, Koyano KW, Kunimatsu J. Neuronal Mechanisms Underlying Face Recognition in Non-human Primates. JAPANESE PSYCHOLOGICAL RESEARCH 2024; 66:416-442. [PMID: 39611029 PMCID: PMC11601097 DOI: 10.1111/jpr.12530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 03/29/2024] [Indexed: 11/30/2024]
Abstract
Humans and primates rely on visual face recognition for social interactions. Damage to specific brain areas causes prosopagnosia, a condition characterized by the inability to recognize familiar faces, indicating the presence of specialized brain areas for face processing. A breakthrough finding came from a non-human primate (NHP) study conducted in the early 2000s; it was the first to identify multiple face processing areas in the temporal lobe, termed face patches. Subsequent studies have demonstrated the unique role of each face patch in the structural analysis of faces. More recent studies have expanded these findings by exploring the role of face patch networks in social and memory functions and the importance of early face exposure in the development of the system. In this review, we discuss the neuronal mechanisms responsible for analyzing facial features, categorizing faces, and associating faces with memory and social contexts within both the cerebral cortex and subcortical areas. Use of NHPs in neuropsychological and neurophysiological studies can highlight the mechanistic aspects of the neuronal circuit underlying face recognition at both the single-neuron and whole-brain network levels.
Collapse
|
4
|
Gainotti G. Human Recognition: The Utilization of Face, Voice, Name and Interactions-An Extended Editorial. Brain Sci 2024; 14:345. [PMID: 38671996 PMCID: PMC11048321 DOI: 10.3390/brainsci14040345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 03/21/2024] [Indexed: 04/28/2024] Open
Abstract
The many stimulating contributions to this Special Issue of Brain Science focused on some basic issues of particular interest in current research, with emphasis on human recognition using faces, voices, and names [...].
Collapse
Affiliation(s)
- Guido Gainotti
- Institute of Neurology, Università Cattolica del Sacro Cuore, Fondazione Policlinico A. Gemelli, Istituto di Ricovero e Cura a Carattere Scientifico, 00168 Rome, Italy
| |
Collapse
|
5
|
Sharma KK, Diltz MA, Lincoln T, Albuquerque ER, Romanski LM. Neuronal Population Encoding of Identity in Primate Prefrontal Cortex. J Neurosci 2024; 44:e0703232023. [PMID: 37963766 PMCID: PMC10860606 DOI: 10.1523/jneurosci.0703-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 08/22/2023] [Accepted: 10/10/2023] [Indexed: 11/16/2023] Open
Abstract
The ventrolateral prefrontal cortex (VLPFC) shows robust activation during the perception of faces and voices. However, little is known about what categorical features of social stimuli drive neural activity in this region. Since perception of identity and expression are critical social functions, we examined whether neural responses to naturalistic stimuli were driven by these two categorical features in the prefrontal cortex. We recorded single neurons in the VLPFC, while two male rhesus macaques (Macaca mulatta) viewed short audiovisual videos of unfamiliar conspecifics making expressions of aggressive, affiliative, and neutral valence. Of the 285 neurons responsive to the audiovisual stimuli, 111 neurons had a main effect (two-way ANOVA) of identity, expression, or their interaction in their stimulus-related firing rates; however, decoding of expression and identity using single-unit firing rates rendered poor accuracy. Interestingly, when decoding from pseudo-populations of recorded neurons, the accuracy for both expression and identity increased with population size, suggesting that the population transmitted information relevant to both variables. Principal components analysis of mean population activity across time revealed that population responses to the same identity followed similar trajectories in the response space, facilitating segregation from other identities. Our results suggest that identity is a critical feature of social stimuli that dictates the structure of population activity in the VLPFC, during the perception of vocalizations and their corresponding facial expressions. These findings enhance our understanding of the role of the VLPFC in social behavior.
Collapse
Affiliation(s)
- K K Sharma
- Department of Neuroscience, School of Medicine and Dentistry, University of Rochester, Rochester, New York 14620
| | - M A Diltz
- Department of Neuroscience, School of Medicine and Dentistry, University of Rochester, Rochester, New York 14620
| | - T Lincoln
- Department of Neuroscience, School of Medicine and Dentistry, University of Rochester, Rochester, New York 14620
| | - E R Albuquerque
- Department of Neuroscience, School of Medicine and Dentistry, University of Rochester, Rochester, New York 14620
| | - L M Romanski
- Department of Neuroscience, School of Medicine and Dentistry, University of Rochester, Rochester, New York 14620
| |
Collapse
|
6
|
Raman R, Bognár A, Nejad GG, Taubert N, Giese M, Vogels R. Bodies in motion: Unraveling the distinct roles of motion and shape in dynamic body responses in the temporal cortex. Cell Rep 2023; 42:113438. [PMID: 37995183 PMCID: PMC10783614 DOI: 10.1016/j.celrep.2023.113438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 09/26/2023] [Accepted: 10/26/2023] [Indexed: 11/25/2023] Open
Abstract
The temporal cortex represents social stimuli, including bodies. We examine and compare the contributions of dynamic and static features to the single-unit responses to moving monkey bodies in and between a patch in the anterior dorsal bank of the superior temporal sulcus (dorsal patch [DP]) and patches in the anterior inferotemporal cortex (ventral patch [VP]), using fMRI guidance in macaques. The response to dynamics varies within both regions, being higher in DP. The dynamic body selectivity of VP neurons correlates with static features derived from convolutional neural networks and motion. DP neurons' dynamic body selectivity is not predicted by static features but is dominated by motion. Whereas these data support the dominance of motion in the newly proposed "dynamic social perception" stream, they challenge the traditional view that distinguishes DP and VP processing in terms of motion versus static features, underscoring the role of inferotemporal neurons in representing body dynamics.
Collapse
Affiliation(s)
- Rajani Raman
- Department of Neurosciences, KU Leuven, 3000 Leuven, Belgium; Leuven Brain Institute, KU Leuven, 3000 Leuven, Belgium
| | - Anna Bognár
- Department of Neurosciences, KU Leuven, 3000 Leuven, Belgium; Leuven Brain Institute, KU Leuven, 3000 Leuven, Belgium
| | - Ghazaleh Ghamkhari Nejad
- Department of Neurosciences, KU Leuven, 3000 Leuven, Belgium; Leuven Brain Institute, KU Leuven, 3000 Leuven, Belgium
| | - Nick Taubert
- Hertie Institute for Clinical Brain Research and Center for Integrative Neuroscience, University Clinic Tuebingen, 72074 Tuebingen, Germany
| | - Martin Giese
- Hertie Institute for Clinical Brain Research and Center for Integrative Neuroscience, University Clinic Tuebingen, 72074 Tuebingen, Germany
| | - Rufin Vogels
- Department of Neurosciences, KU Leuven, 3000 Leuven, Belgium; Leuven Brain Institute, KU Leuven, 3000 Leuven, Belgium.
| |
Collapse
|
7
|
Romanski LM, Sharma KK. Multisensory interactions of face and vocal information during perception and memory in ventrolateral prefrontal cortex. Philos Trans R Soc Lond B Biol Sci 2023; 378:20220343. [PMID: 37545305 PMCID: PMC10404928 DOI: 10.1098/rstb.2022.0343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 03/21/2023] [Indexed: 08/08/2023] Open
Abstract
The ventral frontal lobe is a critical node in the circuit that underlies communication, a multisensory process where sensory features of faces and vocalizations come together. The neural basis of face and vocal integration is a topic of great importance since the integration of multiple sensory signals is essential for the decisions that govern our social interactions. Investigations have shown that the macaque ventrolateral prefrontal cortex (VLPFC), a proposed homologue of the human inferior frontal gyrus, is involved in the processing, integration and remembering of audiovisual signals. Single neurons in VLPFC encode and integrate species-specific faces and corresponding vocalizations. During working memory, VLPFC neurons maintain face and vocal information online and exhibit selective activity for face and vocal stimuli. Population analyses indicate that identity, a critical feature of social stimuli, is encoded by VLPFC neurons and dictates the structure of dynamic population activity in the VLPFC during the perception of vocalizations and their corresponding facial expressions. These studies suggest that VLPFC may play a primary role in integrating face and vocal stimuli with contextual information, in order to support decision making during social communication. This article is part of the theme issue 'Decision and control processes in multisensory perception'.
Collapse
Affiliation(s)
- Lizabeth M. Romanski
- Department of Neuroscience, University of Rochester School of Medicine, Rochester, NY 14642, USA
| | - Keshov K. Sharma
- Department of Neuroscience, University of Rochester School of Medicine, Rochester, NY 14642, USA
| |
Collapse
|
8
|
Chong I, Ramezanpour H, Thier P. Causal Manipulation of Gaze-Following in the Macaque Temporal Cortex. Prog Neurobiol 2023; 226:102466. [PMID: 37211234 DOI: 10.1016/j.pneurobio.2023.102466] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 05/09/2023] [Accepted: 05/17/2023] [Indexed: 05/23/2023]
Abstract
Gaze-following, the ability to shift one's own attention to places or objects others are looking at, is essential for social interactions. Single unit recordings from the monkey cortex and neuroimaging work on the human and monkey brain suggest that a distinct region in the temporal cortex, the gaze-following patch (GFP), underpins this ability. Since previous studies of the GFP have relied on correlational techniques, it remains unclear whether gaze-following related activity in the GFP indicates a causal role rather than being just a reverberation of behaviorally relevant information produced elsewhere. To answer this question, we applied focal electrical and pharmacological perturbation to the GFP. Both approaches, when applied to the GFP, disrupted gaze-following if the monkeys had been instructed to follow gaze, along with the ability to suppress it if vetoed by the context. Hence the GFP is necessary for gaze-following as well as its cognitive control.
Collapse
Affiliation(s)
- Ian Chong
- Cognitive Neurology Laboratory, Hertie Institute for Clinical Brain Research, University of Tübingen, Tübingen, Germany.
| | - Hamidreza Ramezanpour
- Cognitive Neurology Laboratory, Hertie Institute for Clinical Brain Research, University of Tübingen, Tübingen, Germany; Centre for Vision Research, York University, Toronto, Ontario, Canada
| | - Peter Thier
- Cognitive Neurology Laboratory, Hertie Institute for Clinical Brain Research, University of Tübingen, Tübingen, Germany; Werner Reichardt Centre for Integrative Neuroscience, University of Tübingen, Tübingen, Germany.
| |
Collapse
|
9
|
Bognár A, Raman R, Taubert N, Zafirova Y, Li B, Giese M, De Gelder B, Vogels R. The contribution of dynamics to macaque body and face patch responses. Neuroimage 2023; 269:119907. [PMID: 36717042 PMCID: PMC9986793 DOI: 10.1016/j.neuroimage.2023.119907] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 12/20/2022] [Accepted: 01/26/2023] [Indexed: 01/29/2023] Open
Abstract
Previous functional imaging studies demonstrated body-selective patches in the primate visual temporal cortex, comparing activations to static bodies and static images of other categories. However, the use of static instead of dynamic displays of moving bodies may have underestimated the extent of the body patch network. Indeed, body dynamics provide information about action and emotion and may be processed in patches not activated by static images. Thus, to map with fMRI the full extent of the macaque body patch system in the visual temporal cortex, we employed dynamic displays of natural-acting monkey bodies, dynamic monkey faces, objects, and scrambled versions of these videos, all presented during fixation. We found nine body patches in the visual temporal cortex, starting posteriorly in the superior temporal sulcus (STS) and ending anteriorly in the temporal pole. Unlike for static images, body patches were present consistently in both the lower and upper banks of the STS. Overall, body patches showed a higher activation by dynamic displays than by matched static images, which, for identical stimulus displays, was less the case for the neighboring face patches. These data provide the groundwork for future single-unit recording studies to reveal the spatiotemporal features the neurons of these body patches encode. These fMRI findings suggest that dynamics have a stronger contribution to population responses in body than face patches.
Collapse
Affiliation(s)
- A Bognár
- Deparment of Neurosciences, KU Leuven, Leuven, Belgium; Leuven Brain Institute, KU Leuven, Leuven, Belgium
| | - R Raman
- Deparment of Neurosciences, KU Leuven, Leuven, Belgium; Leuven Brain Institute, KU Leuven, Leuven, Belgium
| | - N Taubert
- Department of Cognitive Neurology, University of Tuebingen, Tuebingen, Germany
| | - Y Zafirova
- Deparment of Neurosciences, KU Leuven, Leuven, Belgium; Leuven Brain Institute, KU Leuven, Leuven, Belgium
| | - B Li
- Department of Cognitive Neuroscience, Maastricht University, Maastricht, the Netherlands
| | - M Giese
- Department of Cognitive Neurology, University of Tuebingen, Tuebingen, Germany
| | - B De Gelder
- Department of Cognitive Neuroscience, Maastricht University, Maastricht, the Netherlands; Department of Computer Science, University College London, London, UK
| | - R Vogels
- Deparment of Neurosciences, KU Leuven, Leuven, Belgium; Leuven Brain Institute, KU Leuven, Leuven, Belgium.
| |
Collapse
|
10
|
Mundy P. Research on social attention in autism and the challenges of the research domain criteria (RDoC) framework. Autism Res 2023; 16:697-712. [PMID: 36932883 DOI: 10.1002/aur.2910] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Accepted: 02/22/2023] [Indexed: 03/19/2023]
Abstract
The fuzzy nature of categories of psychopathology, such as autism, leads to significant research challenges. Alternatively, focusing research on the study of a common set of important and well-defined psychological constructs across psychiatric conditions may make the fundamental etiological processes of psychopathology easier to discern and treat (Cuthbert, 2022). The development of the research domain criteria (RDoC) framework is designed to guide this new research approach (Insel et al., 2010). However, progress in research may be expected to continually refine and reorganize the understanding of the specifics of these mental processes (Cuthbert & Insel, 2013). Moreover, knowledge gleaned from the study of both normative and atypical development can be mutually informative in the evolution of our understanding of these fundamental processes. A case in point is the study of social attention. This Autism 101 commentary provides an educational summary of research over the last few decades indicates that social attention is major construct in the study of human social-cognitive development, autism and other forms of psychopathology. The commentary also describes how this research can inform the Social Process dimension of the RDoC framework.
Collapse
Affiliation(s)
- Peter Mundy
- School of Education, Department of Psychiatry and the MIND Institute, University of California at Davis, Davis, California, USA
| |
Collapse
|
11
|
Yang Z, Freiwald WA. Encoding of dynamic facial information in the middle dorsal face area. Proc Natl Acad Sci U S A 2023; 120:e2212735120. [PMID: 36787369 PMCID: PMC9974491 DOI: 10.1073/pnas.2212735120] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 01/04/2023] [Indexed: 02/15/2023] Open
Abstract
Faces in motion reveal a plethora of information through visual dynamics. Faces can move in complex patterns while transforming facial shape, e.g., during the generation of different emotional expressions. While motion and shape processing have been studied extensively in separate research enterprises, much less is known about their conjunction during biological motion. Here, we took advantage of the discovery in brain-imaging studies of an area in the dorsal portion of the macaque monkey superior temporal sulcus (STS), the middle dorsal face area (MD), with selectivity for naturalistic face motion. To gain mechanistic insights into the coding of facial motion, we recorded single-unit activity from MD, testing whether and how MD cells encode face motion. The MD population was highly sensitive to naturalistic facial motion and facial shape. Some MD cells responded only to the conjunction of facial shape and motion, others were selective for facial shape even without movement, and yet others were suppressed by facial motion. We found that this heterogeneous MD population transforms face motion into a higher dimensional activity space, a representation that would allow for high sensitivity to relevant small-scale movements. Indeed, we show that many MD cells carry such sensitivity for eye movements. We further found that MD cells encode motion of head, mouth, and eyes in a separable manner, requiring the use of multiple reference frames. Thus, MD is a bona fide face-motion area that uses highly heterogeneous cell populations to create codes capturing even complex facial motion trajectories.
Collapse
Affiliation(s)
- Zetian Yang
- Laboratory of Neural Systems, The Rockefeller University, New York, NY10065
| | - Winrich A. Freiwald
- Laboratory of Neural Systems, The Rockefeller University, New York, NY10065
- The Center for Brains, Minds and Machines, Cambridge, MA02139
| |
Collapse
|
12
|
Schwartz E, O’Nell K, Saxe R, Anzellotti S. Challenging the Classical View: Recognition of Identity and Expression as Integrated Processes. Brain Sci 2023; 13:296. [PMID: 36831839 PMCID: PMC9954353 DOI: 10.3390/brainsci13020296] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/01/2023] [Accepted: 02/02/2023] [Indexed: 02/12/2023] Open
Abstract
Recent neuroimaging evidence challenges the classical view that face identity and facial expression are processed by segregated neural pathways, showing that information about identity and expression are encoded within common brain regions. This article tests the hypothesis that integrated representations of identity and expression arise spontaneously within deep neural networks. A subset of the CelebA dataset is used to train a deep convolutional neural network (DCNN) to label face identity (chance = 0.06%, accuracy = 26.5%), and the FER2013 dataset is used to train a DCNN to label facial expression (chance = 14.2%, accuracy = 63.5%). The identity-trained and expression-trained networks each successfully transfer to labeling both face identity and facial expression on the Karolinska Directed Emotional Faces dataset. This study demonstrates that DCNNs trained to recognize face identity and DCNNs trained to recognize facial expression spontaneously develop representations of facial expression and face identity, respectively. Furthermore, a congruence coefficient analysis reveals that features distinguishing between identities and features distinguishing between expressions become increasingly orthogonal from layer to layer, suggesting that deep neural networks disentangle representational subspaces corresponding to different sources.
Collapse
Affiliation(s)
- Emily Schwartz
- Department of Psychology and Neuroscience, Boston College, Boston, MA 02467, USA
| | - Kathryn O’Nell
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH 03755, USA
| | - Rebecca Saxe
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Stefano Anzellotti
- Department of Psychology and Neuroscience, Boston College, Boston, MA 02467, USA
| |
Collapse
|
13
|
Representational structure of fMRI/EEG responses to dynamic facial expressions. Neuroimage 2022; 263:119631. [PMID: 36113736 DOI: 10.1016/j.neuroimage.2022.119631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 09/09/2022] [Accepted: 09/12/2022] [Indexed: 11/23/2022] Open
Abstract
Face perception provides an excellent example of how the brain processes nuanced visual differences and transforms them into behaviourally useful representations of identities and emotional expressions. While a body of literature has looked into the spatial and temporal neural processing of facial expressions, few studies have used a dimensionally varying set of stimuli containing subtle perceptual changes. In the current study, we used 48 short videos varying dimensionally in their intensity and category (happy, angry, surprised) of expression. We measured both fMRI and EEG responses to these video clips and compared the neural response patterns to the predictions of models based on image features and models derived from behavioural ratings of the stimuli. In fMRI, the inferior frontal gyrus face area (IFG-FA) carried information related only to the intensity of the expression, independent of image-based models. The superior temporal sulcus (STS), inferior temporal (IT) and lateral occipital (LO) areas contained information about both expression category and intensity. In the EEG, the coding of expression category and low-level image features were most pronounced at around 400 ms. The expression intensity model did not, however, correlate significantly at any EEG timepoint. Our results show a specific role for IFG-FA in the coding of expressions and suggest that it contains image and category invariant representations of expression intensity.
Collapse
|
14
|
Sliwa J, Mallet M, Christiaens M, Takahashi DY. Neural basis of multi-sensory communication in primates. ETHOL ECOL EVOL 2022. [DOI: 10.1080/03949370.2021.2024266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Julia Sliwa
- Paris Brain Institute–Institut du Cerveau, Inserm, CNRS, APHP, Hôpital Pitié-Salpêtrière, Sorbonne Université, Paris, France
| | - Marion Mallet
- Paris Brain Institute–Institut du Cerveau, Inserm, CNRS, APHP, Hôpital Pitié-Salpêtrière, Sorbonne Université, Paris, France
| | - Maëlle Christiaens
- Paris Brain Institute–Institut du Cerveau, Inserm, CNRS, APHP, Hôpital Pitié-Salpêtrière, Sorbonne Université, Paris, France
| | | |
Collapse
|
15
|
Joint encoding of facial identity, orientation, gaze, and expression in the middle dorsal face area. Proc Natl Acad Sci U S A 2021; 118:2108283118. [PMID: 34385326 DOI: 10.1073/pnas.2108283118] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The last two decades have established that a network of face-selective areas in the temporal lobe of macaque monkeys supports the visual processing of faces. Each area within the network contains a large fraction of face-selective cells. And each area encodes facial identity and head orientation differently. A recent brain-imaging study discovered an area outside of this network selective for naturalistic facial motion, the middle dorsal (MD) face area. This finding offers the opportunity to determine whether coding principles revealed inside the core network would generalize to face areas outside the core network. We investigated the encoding of static faces and objects, facial identity, and head orientation, dimensions which had been studied in multiple areas of the core face-processing network before, as well as facial expressions and gaze. We found that MD populations form a face-selective cluster with a degree of selectivity comparable to that of areas in the core face-processing network. MD encodes facial identity robustly across changes in head orientation and expression, it encodes head orientation robustly against changes in identity and expression, and it encodes expression robustly across changes in identity and head orientation. These three dimensions are encoded in a separable manner. Furthermore, MD also encodes the direction of gaze in addition to head orientation. Thus, MD encodes both structural properties (identity) and changeable ones (expression and gaze) and thus provides information about another animal's direction of attention (head orientation and gaze). MD contains a heterogeneous population of cells that establish a multidimensional code for faces.
Collapse
|