1
|
Gur M. Seeing on the fly: Physiological and behavioral evidence show that space-to-space representation and processing enable fast and efficient performance by the visual system. J Vis 2024; 24:11. [PMID: 39392446 PMCID: PMC11472890 DOI: 10.1167/jov.24.11.11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Accepted: 08/16/2024] [Indexed: 10/12/2024] Open
Abstract
When we view the world, our eyes saccade quickly between points of interest. Even when fixating a target our eyes are not completely at rest but execute small fixational eye movements (FEMs). That vision is not blurred despite this ever-present jitter has seemingly motivated an increasingly popular theory denying the reliance of the visual system on pure spatial processing in favor of a space-to-time mechanism generated by the eye drifting across the image. Accordingly, FEMs are not detrimental but rather essential to good visibility. However, the space-to-time theory is incompatible with physiological data showing that all information is conveyed by the short neural volleys generated when the eyes land on a target, and with our faithful perception of briefly displayed objects, during which time FEMs have no effect. Another difficulty in rejecting the idea of image representation by the locations and nature of responding cells in favor of a timecode, is that somewhere, somehow, this code must be decoded into a parallel spatial one when reaching perception. Thus, in addition to the implausibility of generating meaningful responses during retinal drift, the space-to-time hypothesis calls for replacing efficient point-to-point parallel transmission with a cumbersome, delayed, space-to-time-to-space process. A novel physiological framework is presented here wherein the ability of the visual system to quickly process information is mediated by the short, powerful neural volleys generated by the landing saccades. These volleys are necessary and sufficient for normal perception without FEMs contribution. This mechanism enables our excellent perception of brief stimuli and explains that visibility is not blurred by FEMs because they do not generate useful information.
Collapse
Affiliation(s)
- Moshe Gur
- Department of Biomedical Engineering, Technion-Israel Institute of Technology Haifa, Israel
| |
Collapse
|
2
|
Burchfield T, Greene E. Evaluating spatiotemporal integration of shape cues. PLoS One 2020; 15:e0224530. [PMID: 32433696 PMCID: PMC7239445 DOI: 10.1371/journal.pone.0224530] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2019] [Accepted: 05/02/2020] [Indexed: 11/19/2022] Open
Abstract
Prior work has shown that humans can successfully identify letters that are constructed with a sparse array of dots, wherein the dot pattern reflects the strokes that would normally be used to fashion a given letter. In the present work the dots were briefly displayed, one at a time in sequence, varying the spatial order in which they were shown. A forward sequence was spatially ordered as though one were passing a stroke across the dots to connect them. Experiments compared this baseline condition to the following three conditions: a) the dot sequence was spatially ordered, but in the reverse direction from how letter strokes might normally be written; b) the dots in each stroke of the letter were displayed in a random order; c) the sequence of displayed dots were chosen for display from any location in the letter. Significant differences were found between the baseline condition and all three of the comparison conditions, with letter recognition being far worse for the random conditions than for conditions that provided consistent spatial ordering of dot sequences. These findings show that spatial order is critical for integration of shape cues that have been sequentially displayed.
Collapse
Affiliation(s)
- Taylor Burchfield
- Laboratory for Neurometric Research, Department of Psychology, University of Southern California, Los Angeles, California, United States of America
| | - Ernest Greene
- Laboratory for Neurometric Research, Department of Psychology, University of Southern California, Los Angeles, California, United States of America
- * E-mail:
| |
Collapse
|
3
|
Greene E. Comparing methods for scaling shape similarity. AIMS Neurosci 2019; 6:54-59. [PMID: 32341968 PMCID: PMC7179340 DOI: 10.3934/neuroscience.2019.2.54] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 04/24/2019] [Indexed: 11/18/2022] Open
Affiliation(s)
- Ernest Greene
- Laboratory for Neurometric Research, Department of Psychology, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
4
|
Greene E. New encoding concepts for shape recognition are needed. AIMS Neurosci 2018; 5:162-178. [PMID: 32341959 PMCID: PMC7179345 DOI: 10.3934/neuroscience.2018.3.162] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Accepted: 02/26/2018] [Indexed: 11/18/2022] Open
Abstract
Models designed to explain how shapes are perceived and stored by the nervous system commonly emphasize encoding of contour features, especially orientation, curvature, and linear extent. A number of experiments from my laboratory provide evidence that contours deliver a multitude of location markers, and shapes can be identified when relatively few of the markers are displayed. The emphasis on filtering for orientation and other contour features has directed attention away from full and effective examination of how the location information is registered and used for summarizing shapes. Neural network (connectionist) models try to deal with location information by modifying linkage among neuronal populations through training trials. Connections that are initially diffuse and not useful in achieving recognition get eliminated or changed in strength, resulting in selective response to a given shape. But results from my laboratory, reviewed here, demonstrate that unknown shapes that are displayed only once can be identified using a matching task. These findings show that our visual system can immediately encode shape information with no requirement for training trials. This encoding might be accomplished by neuronal circuits in the retina.
Collapse
Affiliation(s)
- Ernest Greene
- Laboratory for Neurometric Research, Department of Psychology, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
5
|
Nordberg H, Hautus MJ, Greene E. Visual encoding of partial unknown shape boundaries. AIMS Neurosci 2018; 5:132-147. [PMID: 32341957 PMCID: PMC7181889 DOI: 10.3934/neuroscience.2018.2.132] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 05/10/2018] [Indexed: 12/21/2022] Open
Abstract
Prior research has found that known shapes and letters can be recognized from a sparse sampling of dots that mark locations on their boundaries. Further, unknown shapes that are displayed only once can be identified by a matching protocol, and here also, above-chance performance requires very few boundary markers. The present work examines whether partial boundaries can be identified under similar low-information conditions. Several experiments were conducted that used a match-recognition task, with initial display of a target shape followed quickly by a comparison shape. The comparison shape was either derived from the target shape or was based on a different shape, and the respondent was asked for a matching judgment, i.e., did it "match" the target shape. Stimulus treatments included establishing how density affected the probability of a correct decision, followed by assessment of how much positioning of boundary dots affected this probability. Results indicate that correct judgments were possible when partial boundaries were displayed with a sparse sampling of dots. We argue for a process that quickly registers the locations of boundary markers and distills that information into a shape summary that can be used to identify the shape even when only a portion of the boundary is represented.
Collapse
Affiliation(s)
- Hannah Nordberg
- Department of Psychology, University of Southern California, Los Angeles, California USA
| | - Michael J Hautus
- The School of Psychology, University of Auckland, Auckland New Zealand, California USA
| | - Ernest Greene
- Department of Psychology, University of Southern California, Los Angeles, California USA
| |
Collapse
|
6
|
Greene E, Hautus MJ. Evaluating persistence of shape information using a matching protocol. AIMS Neurosci 2018; 5:81-96. [PMID: 32341953 PMCID: PMC7181892 DOI: 10.3934/neuroscience.2018.1.81] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 01/09/2018] [Indexed: 12/02/2022] Open
Abstract
Many laboratories have studied persistence of shape information, the goal being to better understand how the visual system mediates recognition of objects. Most have asked for recognition of known shapes, e.g., letters of the alphabet, or recall from an array. Recognition of known shapes requires access to long-term memory, so it is not possible to know whether the experiment is assessing short-term encoding and working memory mechanisms, or has encountered limitations on retrieval from memory stores. Here we have used an inventory of unknown shapes, wherein a string of discrete dots forms the boundary of each shape. Each was displayed as a target only once to a given respondent, with recognition being tested using a matching task. Analysis based on signal detection theory was used to provide an unbiased estimate of the probability of correct decisions about whether comparison shapes matched target shapes. Four experiments were conducted, which found the following: a) Shapes were identified with a high probability of being correct with dot densities ranging from 20% to 4%. Performance dropped only about 10% across this density range. b) Shape identification levels remained very high with up to 500 milliseconds of target and comparison shape separation. c) With one-at-a-time display of target dots, varying the total time for a given display, the proportion of correct decisions dropped only about 10% even with a total display time of 500 milliseconds. d) With display of two complementary target subsets, also varying the total time of each display, there was a dramatic decline of proportion correct that reached chance levels by 500 milliseconds. The greater rate of decline for the two-pulse condition may be due to a mechanism that registers when the number of dots is sufficient to create a shape summary. Once a summary is produced, the temporal window that allows shape information to be added may be more limited.
Collapse
Affiliation(s)
- Ernest Greene
- Department of Psychology, University of Southern California, Los Angeles, USA
| | - Michael J Hautus
- School of Psychology, University of Auckland, Auckland, New Zealand
| |
Collapse
|
7
|
Gur M. Space reconstruction by primary visual cortex activity: a parallel, non-computational mechanism of object representation. Trends Neurosci 2015; 38:207-16. [DOI: 10.1016/j.tins.2015.02.005] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2015] [Revised: 02/12/2015] [Accepted: 02/13/2015] [Indexed: 11/27/2022]
|
8
|
Greene E, Ogden RT. Shapes displayed with durations in the microsecond range do not obey Bloch's law of temporal summation. Iperception 2013; 4:429-36. [PMID: 24349700 PMCID: PMC3859558 DOI: 10.1068/i0602] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2013] [Revised: 08/02/2013] [Indexed: 10/30/2022] Open
Abstract
Shape patterns were displayed with simultaneous brief flashes from a light-emitting diode array. Flash durations in the microsecond range and luminous intensities were adjusted to vary the degree of successful shape recognition. Four experiments were conducted to test whether Bloch's law would apply in this task. Bloch's law holds that for very brief flashes the perceptual threshold is determined by the total number of photons being delivered, i.e., there is reciprocity of intensity and duration. The present results did not find that effectiveness of flashes was based on the total quantity of photons, as predicted by Bloch's law. Additionally, the evidence points to a visual mechanism that has ultra-high temporal precision that either registers the rate of photon flux or the duration of flashes.
Collapse
Affiliation(s)
- Ernest Greene
- Laboratory for Neurometric Research, Department of Psychology, University of Southern California, Los Angeles, CA, USA; e-mail:
| | - R Todd Ogden
- Department of Biostatistics, Columbia University, New York, NY, USA; e-mail:
| |
Collapse
|
9
|
Greene E, Ogden RT. Evaluating the contribution of shape attributes to recognition using the minimal transient discrete cue protocol. Behav Brain Funct 2012; 8:53. [PMID: 23146718 PMCID: PMC3537607 DOI: 10.1186/1744-9081-8-53] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2012] [Accepted: 11/06/2012] [Indexed: 11/10/2022] Open
Abstract
Subjects were tested for their ability to identify objects that were represented by an array of dots that marked the major contours, usually only the outer boundary. Each dot was briefly flashed to make its position known, and a major variable was the time interval that was required to flash all the dots for a given shape. Recognition declined as the total time for display of the dot inventory was increased. Each shape was shown to a given subject only once and it was either recognized -- named - or not. Although the recorded response was binary, a large number of subjects was tested, which made it possible to derive regression functions and thus specify an intercept and slope for each shape. Shapes differed substantially in their slopes, which is likely due to the amount of redundant information provided by neighboring dots. Indices of shape attributes were also derived, specifically Attneave's indices of complexity, mean curvature, inflection count, and symmetry. Three of the four shape attributes were significantly related to intercept and slope levels, but none made a substantial contribution. This suggests that these attributes are not essential properties that define shapes and allow for recognition.
Collapse
Affiliation(s)
- Ernest Greene
- Department of Psychology, Laboratory for Neurometric Research, University of Southern California, Los Angeles, CA 90089-1061, USA.
| | | |
Collapse
|
10
|
Early category-specific cortical activation revealed by visual stimulus inversion. PLoS One 2008; 3:e3503. [PMID: 18946504 PMCID: PMC2566817 DOI: 10.1371/journal.pone.0003503] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2008] [Accepted: 09/12/2008] [Indexed: 11/19/2022] Open
Abstract
Visual categorization may already start within the first 100-ms after stimulus onset, in contrast with the long-held view that during this early stage all complex stimuli are processed equally and that category-specific cortical activation occurs only at later stages. The neural basis of this proposed early stage of high-level analysis is however poorly understood. To address this question we used magnetoencephalography and anatomically-constrained distributed source modeling to monitor brain activity with millisecond-resolution while subjects performed an orientation task on the upright and upside-down presented images of three different stimulus categories: faces, houses and bodies. Significant inversion effects were found for all three stimulus categories between 70-100-ms after picture onset with a highly category-specific cortical distribution. Differential responses between upright and inverted faces were found in well-established face-selective areas of the inferior occipital cortex and right fusiform gyrus. In addition, early category-specific inversion effects were found well beyond visual areas. Our results provide the first direct evidence that category-specific processing in high-level category-sensitive cortical areas already takes place within the first 100-ms of visual processing, significantly earlier than previously thought, and suggests the existence of fast category-specific neocortical routes in the human brain.
Collapse
|
11
|
Beer AL, Heckel AH, Greenlee MW. A motion illusion reveals mechanisms of perceptual stabilization. PLoS One 2008; 3:e2741. [PMID: 18648651 PMCID: PMC2453321 DOI: 10.1371/journal.pone.0002741] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2008] [Accepted: 06/20/2008] [Indexed: 11/19/2022] Open
Abstract
Visual illusions are valuable tools for the scientific examination of the mechanisms underlying perception. In the peripheral drift illusion special drift patterns appear to move although they are static. During fixation small involuntary eye movements generate retinal image slips which need to be suppressed for stable perception. Here we show that the peripheral drift illusion reveals the mechanisms of perceptual stabilization associated with these micromovements. In a series of experiments we found that illusory motion was only observed in the peripheral visual field. The strength of illusory motion varied with the degree of micromovements. However, drift patterns presented in the central (but not the peripheral) visual field modulated the strength of illusory peripheral motion. Moreover, although central drift patterns were not perceived as moving, they elicited illusory motion of neutral peripheral patterns. Central drift patterns modulated illusory peripheral motion even when micromovements remained constant. Interestingly, perceptual stabilization was only affected by static drift patterns, but not by real motion signals. Our findings suggest that perceptual instabilities caused by fixational eye movements are corrected by a mechanism that relies on visual rather than extraretinal (proprioceptive or motor) signals, and that drift patterns systematically bias this compensatory mechanism. These mechanisms may be revealed by utilizing static visual patterns that give rise to the peripheral drift illusion, but remain undetected with other patterns. Accordingly, the peripheral drift illusion is of unique value for examining processes of perceptual stabilization.
Collapse
Affiliation(s)
- Anton L Beer
- Institut für Psychologie, Universität Regensburg, Regensburg, Germany.
| | | | | |
Collapse
|
12
|
Greene E. Additional evidence that contour attributes are not essential cues for object recognition. Behav Brain Funct 2008; 4:26. [PMID: 18593469 PMCID: PMC2467424 DOI: 10.1186/1744-9081-4-26] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2007] [Accepted: 07/01/2008] [Indexed: 11/12/2022] Open
Abstract
It is believed that certain contour attributes, specifically orientation, curvature and linear extent, provide essential cues for object (shape) recognition. The present experiment examined this hypothesis by comparing stimulus conditions that differentially provided such cues. A spaced array of dots was used to mark the outside boundary of namable objects, and subsets were chosen that contained either contiguous strings of dots or randomly positioned dots. These subsets were briefly and successively displayed using an MTDC information persistence paradigm. Across the major range of temporal separation of the subsets, it was found that contiguity of boundary dots did not provide more effective shape recognition cues. This is at odds with the concept that encoding and recognition of shapes is predicated on the encoding of contour attributes such as orientation, curvature and linear extent.
Collapse
Affiliation(s)
- Ernest Greene
- Laboratory for Neurometric Research, Department of Psychology, University of Southern California, USA.
| |
Collapse
|