1
|
Hollis J, Humphreys GW, Allen PM. Intermediate, Wholistic Shape Representation in Object Recognition: A Pre-Attentive Stage of Processing? Front Hum Neurosci 2021; 15:761174. [PMID: 35002652 PMCID: PMC8735852 DOI: 10.3389/fnhum.2021.761174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Accepted: 11/18/2021] [Indexed: 11/13/2022] Open
Abstract
Evidence is presented for intermediate, wholistic visual representations of objects and non-objects that are computed online and independent of visual attention. Short-term visual priming was examined between visually similar shapes, with targets either falling at the (valid) location cued by primes or at another (invalid) location. Object decision latencies were facilitated when the overall shapes of the stimuli were similar irrespective of whether the location of the prime was valid or invalid, with the effects being equally large for object and non-object targets. In addition, the effects were based on the overall outlines of the stimuli and low spatial frequency components, not on local parts. In conclusion, wholistic shape representations based on outline form, are rapidly computed online during object recognition. Moreover, activation of common wholistic shape representations prime the processing of subsequent objects and non-objects irrespective of whether they appear at attended or unattended locations. Rapid derivation of wholistic form provides a key intermediate stage of object recognition.
Collapse
Affiliation(s)
- Jarrod Hollis
- Vision and Hearing Sciences Research Centre, Anglia Ruskin University, Cambridge, United Kingdom
| | - Glyn W. Humphreys
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Peter M. Allen
- Vision and Hearing Sciences Research Centre, Anglia Ruskin University, Cambridge, United Kingdom
| |
Collapse
|
2
|
Byosiere SE, Feng LC, Chouinard PA, Howell TJ, Bennett PC. Relational concept learning in domestic dogs: Performance on a two-choice size discrimination task generalises to novel stimuli. Behav Processes 2017; 145:93-101. [PMID: 29056526 DOI: 10.1016/j.beproc.2017.10.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2017] [Revised: 10/16/2017] [Accepted: 10/18/2017] [Indexed: 11/29/2022]
Abstract
One central issue in the study of animal cognition concerns conceptual behaviour, where an organism categorises objects, events, and relationships so as to transfer previously learned rules to novel contexts. In this study, we investigated whether or not dogs demonstrate conceptual behaviour in the form of simple relational class concept learning. A two-choice visual discrimination task was used to assess if dogs are capable of simple relational class concept learning by generalising the same rule (i.e. circle is larger or smaller than) to various novel shapes. Eight purebred Lagotto Romagnolos were included in the study. The results demonstrated that they were capable of generalising a previously learned size discrimination rule to novel stimuli; however, there were differences in dog's generalization capabilities across certain shapes. Considering their unique relationship with humans, and their immediacy in everyday life, a better understanding of conceptual behaviour and generalising abilities in domestic dogs may have implications for training and management methods, as well as contributing to comparative psychology and applied ethology.
Collapse
Affiliation(s)
| | - Lynna C Feng
- School of Psychology and Public Health, La Trobe University, Victoria, Australia
| | - Philippe A Chouinard
- School of Psychology and Public Health, La Trobe University, Victoria, Australia
| | - Tiffani J Howell
- School of Psychology and Public Health, La Trobe University, Victoria, Australia
| | - Pauleen C Bennett
- School of Psychology and Public Health, La Trobe University, Victoria, Australia
| |
Collapse
|
3
|
Lescroart MD, Biederman I. Cortical Representation of Medial Axis Structure. Cereb Cortex 2012; 23:629-37. [DOI: 10.1093/cercor/bhs046] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
4
|
Biederman I, Cooper EE. Translational and reflectional priming invariance: a retrospective. Perception 2010; 38:809-17. [PMID: 19806959 DOI: 10.1068/pmkbie] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
Biederman and Cooper (1991a) showed that the presentation of a briefly presented image of an object at one position in the visual field facilitated its identification, as assessed by naming speed and accuracy, several minutes later. The facilitation was unaffected by a translation or a reflection of the stimulus. A component of this priming was visual rather than basic-level conceptual or lexical in that there was less facilitation for an object with the same name (and basic-level class) but a different shape. The invariance of priming to view variables has stood up well over the years and appears to be a general phenomenon--as long as the original structural description can be readily resolved--in that it has also been observed for variations in size and orientation in depth. Although priming was unaffected by a change in position, we documented that there was explicit memory for the position (and orientation and size) of the stimulus. The existence of two forms of representation from the identical stimulus presentation--one invariant and the other dependent on view variables--poses a challenge as to what can be concluded about view invariance from single-unit activity.
Collapse
Affiliation(s)
- Irving Biederman
- Department of Psychology, University of Southern California, Los Angeles 90089-2520, USA.
| | | |
Collapse
|
5
|
Biederman I, Yue X, Davidoff J. Representation of shape in individuals from a culture with minimal exposure to regular, simple artifacts: sensitivity to nonaccidental versus metric properties. Psychol Sci 2009; 20:1437-42. [PMID: 19883490 DOI: 10.1111/j.1467-9280.2009.02465.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
Many of the phenomena underlying shape recognition can be derived from the greater sensitivity to nonaccidental properties of an image (e.g., whether a contour is straight or curved), which are invariant to orientation in depth, than to the metric properties of an image (e.g., a contour's degree of curvature), which can vary with orientation. What enables this sensitivity? One explanation is that it derives from people's immersion in a manufactured world in which simple, regular shapes distinguished by nonaccidental properties abound (e.g., a can, a brick), and toddlers are encouraged to play with toy shape sorters. This report provides evidence against this explanation. The Himba, a seminomadic people living in a remote region of northwestern Namibia where there is little exposure to regular, simple artifacts, were virtually identical to Western observers in their greater sensitivity to nonaccidental properties than to metric properties of simple shapes.
Collapse
Affiliation(s)
- Irving Biederman
- Department of Psychology/Neuroscience, University of Southern California, Los Angeles, CA 90089-2520, USA.
| | | | | |
Collapse
|
6
|
Xu X, Yue X, Lescroart MD, Biederman I, Kim JG. Adaptation in the fusiform face area (FFA): image or person? Vision Res 2009; 49:2800-7. [PMID: 19712692 DOI: 10.1016/j.visres.2009.08.021] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2009] [Revised: 08/19/2009] [Accepted: 08/19/2009] [Indexed: 10/20/2022]
Abstract
Viewing a sequence of faces of two different people results in a greater Blood Oxygenation Level Dependent (BOLD) response in FFA compared to a sequence of identical faces. Changes in identity, however, necessarily involve changes in the image. Is the release from adaptation a result of a change in face identity, per se, or could it be an effect that would arise from any change in the image of a face? Subjects viewed a sequence of two faces that could be of the same or different person, and in the same or different orientation in depth. Critically, the physical similarity of view changes of the same person was scaled, by Gabor-jet differences, to be equivalent to that produced by an identity change. Both person and orientation changes produced equivalent releases from adaptation in FFA (relative to identical faces) suggesting that FFA is sensitive to the physical similarity of faces rather than to the individuals depicted in the images.
Collapse
Affiliation(s)
- Xiaokun Xu
- Department of Psychology, University of Southern California, Los Angeles, CA 90089-2520, USA.
| | | | | | | | | |
Collapse
|
7
|
Kim JG, Biederman I, Lescroart MD, Hayworth KJ. Adaptation to objects in the lateral occipital complex (LOC): shape or semantics? Vision Res 2009; 49:2297-305. [PMID: 19577590 DOI: 10.1016/j.visres.2009.06.020] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2008] [Revised: 06/19/2009] [Accepted: 06/26/2009] [Indexed: 10/20/2022]
Abstract
A change in the basic-level class when viewing a sequence of two objects produces a large release from adaptation in LOC compared to when the images are identical. Is this due to a change in semantics or shape? In an fMRI-adaptation experiment, subjects viewed a sequence of two objects and judged whether the stimuli were identical in shape. Different-shaped stimuli could be from the same or different basic-level classes, where the physical similarities of the pairs in the two conditions were equated by a model of simple cell similarity. BOLD responses in LOC for the two conditions were equivalent, and higher than that of the identical condition, indicating that LOC is sensitive to shape rather than to basic-level semantics.
Collapse
Affiliation(s)
- Jiye G Kim
- University of Southern California, Department of Psychology, 3641 Watt Way, Los Angeles, CA 90089, USA.
| | | | | | | |
Collapse
|
8
|
Hole GJ, George PA, Eaves K, Rasek A. Effects of geometric distortions on face-recognition performance. Perception 2003; 31:1221-40. [PMID: 12430949 DOI: 10.1068/p3252] [Citation(s) in RCA: 102] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The importance of 'configural' processing for face recognition is now well established, but it remains unclear precisely what it entails. Through four experiments we attempted to clarify the nature of configural processing by investigating the effects of various affine transformations on the recognition of familiar faces. Experiment 1 showed that recognition was markedly impaired by inversion of faces, somewhat impaired by shearing or horizontally stretching them, but unaffected by vertical stretching of faces to twice their normal height. In experiment 2 we investigated vertical and horizontal stretching in more detail, and found no effects of either transformation. Two further experiments were performed to determine whether participants were recognising stretched faces by using configural information. Experiment 3 showed that nonglobal vertical stretching of faces (stretching either the top or the bottom half while leaving the remainder undistorted) impaired recognition, implying that configural information from the stretched part of the face was influencing the process of recognition--ie that configural processing involves global facial properties. In experiment 4 we examined the effects of Gaussian blurring on recognition of undistorted and vertically stretched faces. Faces remained recognisable even when they were both stretched and blurred, implying that participants were basing their judgments on configural information from these stimuli, rather than resorting to some strategy based on local featural details. The tolerance of spatial distortions in human face recognition suggests that the configural information used as a basis for face recognition is unlikely to involve information about the absolute position of facial features relative to each other, at least not in any simple way.
Collapse
Affiliation(s)
- Graham J Hole
- School of Cognitive and Computing Sciences, University of Sussex, Falmer, Brighton, UK
| | | | | | | |
Collapse
|
9
|
Fiser J, Biederman I. Invariance of long-term visual priming to scale, reflection, translation, and hemisphere. Vision Res 2001; 41:221-34. [PMID: 11163856 DOI: 10.1016/s0042-6989(00)00234-0] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The representation of shape mediating visual object priming was investigated. In two blocks of trials, subjects named images of common objects presented for 185 ms that were bandpass filtered, either at high (10 cpd) or at low (2 cpd) center frequency with a 1.5 octave bandwidth, and positioned either 5 degrees right or left of fixation. The second presentation of an image of a given object type could be filtered at the same or different band, be shown at the same or translated (and mirror reflected) position, and be the same exemplar as that in the first block or a same-name different-shaped exemplar (e.g. a different kind of chair). Second block reaction times (RTs) and error rates were markedly lower than they were on the first block, which, in the context of prior results, was indicative of strong priming. A change of exemplar in the second block resulted in a significant cost in RTs and error rates, indicating that a portion of the priming was visual and not just verbal or basic-level conceptual. However, a change in the spatial frequency (SF) content of the image had no effect on priming despite the dramatic difference it made in appearance of the objects. This invariance to SF changes was also preserved with centrally presented images in a second experiment. Priming was also invariant to a change in left-right position (and mirror orientation) of the image. The invariance over translation of such a large magnitude suggests that the locus of the representation mediating the priming is beyond an area that would be homologous to posterior TEO in the monkey. We conclude that this representation is insensitive to low level image variations (e.g. SF, precise position or orientation of features) that do not alter the basic part-structure of the object. Finally, recognition performance was unaffected by whether low or high bandpassed images were presented either in the left or right visual field, giving no support to the hypothesis of hemispheric differences in processing low and high spatial frequencies.
Collapse
Affiliation(s)
- J Fiser
- Department of Brain and Cognitive Sciences, Center for Visual Sciences, University of Rochester, Rochester, NY 14627, USA.
| | | |
Collapse
|
10
|
|
11
|
Abstract
Evidence from neurophysiological and psychological studies is coming together to shed light on how we represent and recognize objects. This review describes evidence supporting two major hypotheses: the first is that objects are represented in a mosaic-like form in which objects are encoded by combinations of complex, reusable features, rather than two-dimensional templates, or three-dimensional models. The second hypothesis is that transform-invariant representations of objects are learnt through experience, and that this learning is affected by the temporal sequence in which different views of the objects are seen, as well as by their physical appearance.
Collapse
|
12
|
Treisman AM, Kanwisher NG. Perceiving visually presented objects: recognition, awareness, and modularity. Curr Opin Neurobiol 1998; 8:218-26. [PMID: 9635205 DOI: 10.1016/s0959-4388(98)80143-8] [Citation(s) in RCA: 114] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Object perception may involve seeing, recognition, preparation of actions, and emotional responses--functions that human brain imaging and neuropsychology suggest are localized separately. Perhaps because of this specialization, object perception is remarkably rapid and efficient. Representations of componential structure and interpolation from view-dependent images both play a part in object recognition. Unattended objects may be implicitly registered, but recent experiments suggest that attention is required to bind features, to represent three-dimensional structure, and to mediate awareness.
Collapse
Affiliation(s)
- A M Treisman
- Department of Psychology, Princeton University, New Jersey 08544-1010, USA.
| | | |
Collapse
|
13
|
Biederman I, Kalocsai P. Neurocomputational bases of object and face recognition. Philos Trans R Soc Lond B Biol Sci 1997; 352:1203-19. [PMID: 9304687 PMCID: PMC1692012 DOI: 10.1098/rstb.1997.0103] [Citation(s) in RCA: 151] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
A number of behavioural phenomena distinguish the recognition of faces and objects, even when members of a set of objects are highly similar. Because faces have the same parts in approximately the same relations, individuation of faces typically requires specification of the metric variation in a holistic and integral representation of the facial surface. The direct mapping of a hypercolumn-like pattern of activation onto a representation layer that preserves relative spatial filter values in a two-dimensional (2D) coordinate space, as proposed by C. von der Malsburg and his associates, may account for many of the phenomena associated with face recognition. An additional refinement, in which each column of filters (termed a 'jet') is centred on a particular facial feature (or fiducial point), allows selectivity of the input into the holistic representation to avoid incorporation of occluding or nearby surfaces. The initial hypercolumn representation also characterizes the first stage of object perception, but the image variation for objects at a given location in a 2D coordinate space may be too great to yield sufficient predictability directly from the output of spatial kernels. Consequently, objects can be represented by a structural description specifying qualitative (typically, non-accidental) characterizations of an object's parts, the attributes of the parts, and the relations among the parts, largely based on orientation and depth discontinuities (as shown by Hummel & Biederman). A series of experiments on the name priming or physical matching of complementary images (in the Fourier domain) of objects and faces documents that whereas face recognition is strongly dependent on the original spatial filter values, evidence from object recognition indicates strong invariance to these values, even when distinguishing among objects that are as similar as faces.
Collapse
Affiliation(s)
- I Biederman
- University of Southern California, Department of Psychology and Neuroscience Program, Los Angeles 90089-2520, USA
| | | |
Collapse
|