151
|
Simulating the cortical 3D visuomotor transformation of reach depth. PLoS One 2012; 7:e41241. [PMID: 22815979 PMCID: PMC3397995 DOI: 10.1371/journal.pone.0041241] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2011] [Accepted: 06/22/2012] [Indexed: 11/22/2022] Open
Abstract
We effortlessly perform reach movements to objects in different directions and depths. However, how networks of cortical neurons compute reach depth from binocular visual inputs remains largely unknown. To bridge the gap between behavior and neurophysiology, we trained a feed-forward artificial neural network to uncover potential mechanisms that might underlie the 3D transformation of reach depth. Our physiologically-inspired 4-layer network receives distributed 3D visual inputs (1st layer) along with eye, head and vergence signals. The desired motor plan was coded in a population (3rd layer) that we read out (4th layer) using an optimal linear estimator. After training, our network was able to reproduce all known single-unit recording evidence on depth coding in the parietal cortex. Network analyses predict the presence of eye/head and vergence changes of depth tuning, pointing towards a gain-modulation mechanism of depth transformation. In addition, reach depth was computed directly from eye-centered (relative) visual distances, without explicit absolute depth coding. We suggest that these effects should be observable in parietal and pre-motor areas.
Collapse
|
152
|
Vidal-Naquet M, Gepshtein S. Spatially invariant computations in stereoscopic vision. Front Comput Neurosci 2012; 6:47. [PMID: 22811665 PMCID: PMC3397313 DOI: 10.3389/fncom.2012.00047] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2011] [Accepted: 06/26/2012] [Indexed: 11/13/2022] Open
Abstract
PERCEPTION OF STEREOSCOPIC DEPTH REQUIRES THAT VISUAL SYSTEMS SOLVE A CORRESPONDENCE PROBLEM: find parts of the left-eye view of the visual scene that correspond to parts of the right-eye view. The standard model of binocular matching implies that similarity of left and right images is computed by inter-ocular correlation. But the left and right images of the same object are normally distorted relative to one another by the binocular projection, in particular when slanted surfaces are viewed from close distance. Correlation often fails to detect correct correspondences between such image parts. We investigate a measure of inter-ocular similarity that takes advantage of spatially invariant computations similar to the computations performed by complex cells in biological visual systems. This measure tolerates distortions of corresponding image parts and yields excellent performance over a much larger range of surface slants than the standard model. The results suggest that, rather than serving as disparity detectors, multiple binocular complex cells take part in the computation of inter-ocular similarity, and that visual systems are likely to postpone commitment to particular binocular disparities until later stages in the visual process.
Collapse
|
153
|
PAUWELS KARL, VAN HULLE MARCM. HEAD-CENTRIC DISPARITY AND EPIPOLAR GEOMETRY ESTIMATION FROM A POPULATION OF BINOCULAR ENERGY NEURONS. Int J Neural Syst 2012; 22:1250007. [DOI: 10.1142/s0129065712500074] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We present a hybrid neural network architecture that supports the estimation of binocular disparity in a cyclopean, head-centric coordinate system without explicitly establishing retinal correspondences. Instead the responses of binocular energy neurons are gain-modulated by oculomotor signals. The network can handle the full six degrees of freedom of binocular gaze and operates directly on image pairs of possibly varying contrast. Furthermore, we show that in the absence of an oculomotor signal the same architecture is capable of estimating the epipolar geometry directly from the population response. The increased complexity of the scenarios considered in this work provides an important step towards the application of computational models centered on gain modulation mechanisms in real-world robotic applications. The proposed network is shown to outperform a standard computer vision technique on a disparity estimation task involving real-world stereo images.
Collapse
Affiliation(s)
- KARL PAUWELS
- Computer Architecture and Technology Department, University of Granada, Calle Periodista Daniel Saucedo, s/n, 18071 Granada, Spain
| | - MARC M. VAN HULLE
- Laboratorium voor Neuro-en Psychofysiologie, K.U. Leuven, Campus Gasthuisberg, O&N II Herestraat 49–Bus 1021, 3000 Leuven, Belgium
| |
Collapse
|
154
|
Ninomiya T, Sanada TM, Ohzawa I. Contributions of excitation and suppression in shaping spatial frequency selectivity of V1 neurons as revealed by binocular measurements. J Neurophysiol 2012; 107:2220-31. [DOI: 10.1152/jn.00832.2010] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Neurons in the early visual cortex are generally highly sensitive to stimuli presented to the two eyes. However, the majority of studies on spatial and temporal aspects of neural responses were based on monocular measurements. To study neurons under more natural, i.e., binocular, conditions, we presented sinusoidal gratings of a variety of spatial frequencies (SF) dichoptically in rapid sequential flashes and analyzed the data using a binocular reverse correlation technique for neurons in cat area 17. The resulting set of data represents a frequency-domain binocular receptive field from which detailed selectivities, both monocular and binocular, could be obtained. Consistent with previous studies, the responses could generally be explained by linear summation of inputs from the two eyes. Suppressive responses were also observed and were delayed typically by 5–15 ms relative to excitatory responses. However, we have found more diverse nature of suppressive responses than those reported previously. The optimal suppressive frequency could be either higher or lower than that of the excitatory responses. The bandwidth of SF tuning of the suppressive responses was usually broader than that of the excitatory responses. Cells with lower optimal SFs for suppression tended to show high optimal SFs and sharp tuning curves. The dynamic shift of optimal SF from low to high SF was accompanied by suppression with earlier onset and higher peak SF or later onset and lower peak SF than excitation. These results suggest that the suppression plays an essential role in generating the temporal dynamics of SF selectivity.
Collapse
Affiliation(s)
| | - Takahisa M. Sanada
- Graduate School of Engineering Science, Osaka University, Machikaneyama, Toyonaka
| | - Izumi Ohzawa
- Graduate School of Frontier Biosciences and
- Graduate School of Engineering Science, Osaka University, Machikaneyama, Toyonaka
- Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Tokyo; and
- Center for Information and Neural Networks (CiNet), Yamadaoka, Suita, Osaka, Japan
| |
Collapse
|
155
|
Chinellato E, Grzyb BJ, del Pobil AP. Pose estimation through cue integration: a neuroscience-inspired approach. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS. PART B, CYBERNETICS : A PUBLICATION OF THE IEEE SYSTEMS, MAN, AND CYBERNETICS SOCIETY 2012; 42:530-538. [PMID: 22027389 DOI: 10.1109/tsmcb.2011.2168952] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The aim of this paper is to improve the skills of robotic systems in their interaction with nearby objects. The basic idea is to enhance visual estimation of objects in the world through the merging of different visual estimators of the same stimuli. A neuroscience-inspired model of stereoptic and perspective orientation estimators, merged according to different criteria, is implemented on a robotic setup and tested in different conditions. Experimental results suggest that the integration of multiple monocular and binocular cues can make robot sensory systems more reliable and versatile. The same results, compared with simulations and data from human studies, show that the model is able to reproduce some well-recognized neuropsychological effects.
Collapse
Affiliation(s)
- Eris Chinellato
- Robotic Intelligence Laboratory, Jaume I University, 12071 Castellón de la Plana, Spain.
| | | | | |
Collapse
|
156
|
Cottereau BR, McKee SP, Norcia AM. Bridging the gap: global disparity processing in the human visual cortex. J Neurophysiol 2012; 107:2421-9. [PMID: 22323636 DOI: 10.1152/jn.01051.2011] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The human stereoscopic system is remarkable in its ability to utilize widely separated features as references to support fine depth discrimination. In a search for possible neural substrates of this ability, we recorded high-density EEG and used a distributed inverse technique to estimate population-level disparity responses in five regions of interest (ROIs): V1, V3A, hMT+, V4, and lateral occipital complex (LOC). The stimulus was a central modulating disk surrounded by a correlated "reference" annulus presented in the fixation plane. We varied a gap separating the disk from the annulus parametrically from 0 to 5.5° as a test of long-range disparity integration. In the V1, LOC, and hMT+ ROIs, the responses with gaps >0.5° were equal to those obtained in a control condition where the surround was composed of uncorrelated noise (no reference). By contrast, in the V4 and V3A ROIs, responses with gaps as large as 5.5° were still significantly higher than the control. As a test of the spatial distribution of the disparity reference information, we manipulated the properties of the stimulus by placing noise between the center and the surround or throughout the surround. The V3A ROI was particularly sensitive to disparity noise between the center and annulus regions, suggesting an important contribution of disparity edge detectors in this ROI.
Collapse
Affiliation(s)
- Benoit R Cottereau
- Department of Psychology, Jordan Hall, Bldg. 01-420, Stanford University, 450 Serra Mall, Stanford, CA 94305, USA.
| | | | | |
Collapse
|
157
|
Theys T, Srivastava S, van Loon J, Goffin J, Janssen P. Selectivity for three-dimensional contours and surfaces in the anterior intraparietal area. J Neurophysiol 2012; 107:995-1008. [DOI: 10.1152/jn.00248.2011] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The macaque anterior intraparietal area (AIP) is crucial for visually guided grasping. AIP neurons respond during the visual presentation of real-world objects and encode the depth profile of disparity-defined curved surfaces. We investigated the neural representation of curved surfaces in AIP using a stimulus-reduction approach. The stimuli consisted of three-dimensional (3-D) shapes curved along the horizontal axis, the vertical axis, or both the horizontal and the vertical axes of the shape. The depth profile was defined solely by binocular disparity that varied along either the boundary or the surface of the shape or along both the boundary and the surface of the shape. The majority of AIP neurons were selective for curved boundaries along the horizontal or the vertical axis, and neural selectivity emerged at short latencies. Stimuli in which disparity varied only along the surface of the shape (with zero disparity on the boundaries) evoked selectivity in a smaller proportion of AIP neurons and at considerably longer latencies. AIP neurons were not selective for 3-D surfaces composed of anticorrelated disparities. Thus the neural selectivity for object depth profile in AIP is present when only the boundary is curved in depth, but not for disparity in anticorrelated stereograms.
Collapse
Affiliation(s)
- Tom Theys
- Laboratorium voor Neuro- en Psychofysiologie and
- Afdeling Experimentele Neurochirurgie en Neuroanatomie, Katholieke Universiteit Leuven, Leuven, Belgium
| | | | - Johannes van Loon
- Afdeling Experimentele Neurochirurgie en Neuroanatomie, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Jan Goffin
- Afdeling Experimentele Neurochirurgie en Neuroanatomie, Katholieke Universiteit Leuven, Leuven, Belgium
| | | |
Collapse
|
158
|
Abstract
One of the main functions of vision is to estimate the 3D shape of objects in our environment. Many different visual cues, such as stereopsis, motion parallax, and shading, are thought to be involved. One important cue that remains poorly understood comes from surface texture markings. When a textured surface is slanted in 3D relative to the observer, the surface patterns appear compressed in the retinal image, providing potentially important information about 3D shape. What is not known, however, is how the brain actually measures this information from the retinal image. Here, we explain how the key information could be extracted by populations of cells tuned to different orientations and spatial frequencies, like those found in the primary visual cortex. To test this theory, we created stimuli that selectively stimulate such cell populations, by "smearing" (filtering) images of 2D random noise into specific oriented patterns. We find that the resulting patterns appear vividly 3D, and that increasing the strength of the orientation signals progressively increases the sense of 3D shape, even though the filtering we apply is physically inconsistent with what would occur with a real object. This finding suggests we have isolated key mechanisms used by the brain to estimate shape from texture. Crucially, we also find that adapting the visual system's orientation detectors to orthogonal patterns causes unoriented random noise to look like a specific 3D shape. Together these findings demonstrate a crucial role of orientation detectors in the perception of 3D shape.
Collapse
|
159
|
Dieter KC, Tadin D. Understanding attentional modulation of binocular rivalry: a framework based on biased competition. Front Hum Neurosci 2011; 5:155. [PMID: 22144958 PMCID: PMC3228993 DOI: 10.3389/fnhum.2011.00155] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2011] [Accepted: 11/14/2011] [Indexed: 11/25/2022] Open
Abstract
Starting from early scientific explorations of binocular rivalry, researchers have wondered about the degree to which an observer can exert voluntary attentional control over rivalry dynamics. The answer to this question would not only reveal the extent to which we may determine our own conscious visual experience, but also advance our understanding of the neural mechanisms underlying binocular rivalry. Classic studies, intriguingly, reached contradictory conclusions, ranging from an absence of attentional control, as advocated by Breese, to nearly complete control of rivalry dynamics, as reported by Helmholtz. Recent investigations have revisited this question, but the results have continued to echo the conflicting findings of earlier studies, seemingly precluding a comprehensive understanding of attentional effects on rivalry. Here, we review both classic and modern studies, and propose a unifying framework derived from the biased competition theory of attention. The key assumption of this theory is that the nature of stimulus conflict determines the limits of attentional modulation. For example, a condition in which unresolved stimulus conflict transpires through many levels of visual processing should be very susceptible to attentional control. When applied to binocular rivalry, this framework predicts strong attentional modulations under conditions of unresolved stimulus conflict (e.g., initial selection) and conditions where conflict is resolved at higher levels of visual processing (e.g., stimulus rivalry). Additionally, the efficacy of attentional control over rivalry can be increased by utilization of demanding, behaviorally relevant tasks, and likely through perceptual training paradigms. We show that this framework can help facilitate the understanding and synthesis of a diverse set of results on attentional control over rivalry, and we propose several directions for future research on this interesting topic.
Collapse
Affiliation(s)
- Kevin Conrad Dieter
- Department of Brain and Cognitive Sciences, University of Rochester Rochester, NY, USA
| | | |
Collapse
|
160
|
Crawford JD, Henriques DYP, Medendorp WP. Three-dimensional transformations for goal-directed action. Annu Rev Neurosci 2011; 34:309-31. [PMID: 21456958 DOI: 10.1146/annurev-neuro-061010-113749] [Citation(s) in RCA: 124] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Much of the central nervous system is involved in visuomotor transformations for goal-directed gaze and reach movements. These transformations are often described in terms of stimulus location, gaze fixation, and reach endpoints, as viewed through the lens of translational geometry. Here, we argue that the intrinsic (primarily rotational) 3-D geometry of the eye-head-reach systems determines the spatial relationship between extrinsic goals and effector commands, and therefore the required transformations. This approach provides a common theoretical framework for understanding both gaze and reach control. Combined with an assessment of the behavioral, neurophysiological, imaging, and neuropsychological literature, this framework leads us to conclude that (a) the internal representation and updating of visual goals are dominated by gaze-centered mechanisms, but (b) these representations must then be transformed as a function of eye and head orientation signals into effector-specific 3-D movement commands.
Collapse
Affiliation(s)
- J Douglas Crawford
- York Centre for Vision Research, Canadian Action and Perception Network, and Departments of Psychology, Toronto, Ontario, Canada, M3J 1P3.
| | | | | |
Collapse
|
161
|
Liu Y, Cormack LK, Bovik AC. Statistical modeling of 3-D natural scenes with application to Bayesian stereopsis. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2011; 20:2515-2530. [PMID: 21342845 DOI: 10.1109/tip.2011.2118223] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
We studied the empirical distributions of luminance, range and disparity wavelet coefficients using a coregistered database of luminance and range images. The marginal distributions of range and disparity are observed to have high peaks and heavy tails, similar to the well-known properties of luminance wavelet coefficients. However, we found that the kurtosis of range and disparity coefficients is significantly larger than that of luminance coefficients. We used generalized Gaussian models to fit the empirical marginal distributions. We found that the marginal distribution of luminance coefficients have a shape parameter p between 0.6 and 0.8, while range and disparity coefficients have much smaller parameters p < 0.32, corresponding to a much higher peak. We also examined the conditional distributions of luminance, range and disparity coefficients. The magnitudes of luminance and range (disparity) coefficients show a clear positive correlation, which means, at a location with larger luminance variation, there is a higher probability of a larger range (disparity) variation. We also used generalized Gaussians to model the conditional distributions of luminance and range (disparity) coefficients. The values of the two shape parameters (p,s) reflect the observed luminance-range (disparity) dependency. As an example of the usefulness of luminance statistics conditioned on range statistics, we modified a well-known Bayesian stereo ranging algorithm using our natural scene statistics models, which improved its performance.
Collapse
Affiliation(s)
- Yang Liu
- Center for Perceptual Systems and the Department of Electrical and Computer Engineering, University of Texas at Austin, Austin, TX 78712, USA.
| | | | | |
Collapse
|
162
|
Visuomotor coordination is different for different directions in three-dimensional space. J Neurosci 2011; 31:7857-66. [PMID: 21613499 DOI: 10.1523/jneurosci.0486-11.2011] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
In most visuomotor tasks in which subjects have to reach to visual targets or move the hand along a particular trajectory, eye movements have been shown to lead hand movements. Because the dynamics of vergence eye movements is different from that of smooth pursuit and saccades, we have investigated the lead time of gaze relative to the hand for the depth component (vergence) and in the frontal plane (smooth pursuit and saccades) in a tracking task and in a tracing task in which human subjects were instructed to move the finger along a 3D path. For tracking, gaze leads finger position on average by 28 ± 6 ms (mean ± SE) for the components in the frontal plane but lags finger position by 95 ± 39 ms for the depth dimension. For tracing, gaze leads finger position by 151 ± 36 ms for the depth dimension. For the frontal plane, the mean lead time of gaze relative to the hand is 287 ± 13 ms. However, we found that the lead time in the frontal plane was inversely related to the tangential velocity of finger. This inverse relation for movements in the frontal plane could be explained by assuming that gaze leads the finger by a constant distance of ∼ 2.6 cm (range of 1.5-3.6 cm across subjects).
Collapse
|
163
|
Anzai A, Chowdhury SA, DeAngelis GC. Coding of stereoscopic depth information in visual areas V3 and V3A. J Neurosci 2011; 31:10270-82. [PMID: 21753004 PMCID: PMC3143190 DOI: 10.1523/jneurosci.5956-10.2011] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2010] [Revised: 05/23/2011] [Accepted: 05/26/2011] [Indexed: 11/21/2022] Open
Abstract
The process of stereoscopic depth perception is thought to begin with the analysis of absolute binocular disparity, the difference in position of corresponding features in the left and right eye images with respect to the points of fixation. Our sensitivity to depth, however, is greater when depth judgments are based on relative disparity, the difference between two absolute disparities, compared to when they are based on absolute disparity. Therefore, the visual system is thought to compute relative disparities for fine depth discrimination. Functional magnetic resonance imaging studies in humans and monkeys have suggested that visual areas V3 and V3A may be specialized for stereoscopic depth processing based on relative disparities. In this study, we measured absolute and relative disparity-tuning of neurons in V3 and V3A of alert fixating monkeys, and we compared their basic tuning properties with those published previously for other visual areas. We found that neurons in V3 and V3A predominantly encode absolute, not relative, disparities. We also found that basic parameters of disparity-tuning in V3 and V3A are similar to those from other extrastriate visual areas. Finally, by comparing single-unit activity with multi-unit activity measured at the same recording site, we demonstrate that neurons with similar disparity selectivity are clustered in both V3 and V3A. We conclude that areas V3 and V3A are not particularly specialized for processing stereoscopic depth information compared to other early visual areas, at least with respect to the tuning properties that we have examined.
Collapse
Affiliation(s)
- Akiyuki Anzai
- Department of Brain and Cognitive Sciences, Center for Visual Science, University of Rochester, Rochester, New York 14627, and
| | - Syed A. Chowdhury
- Department of Neurobiology, Washington University School of Medicine, St. Louis, Missouri 63110
| | - Gregory C. DeAngelis
- Department of Brain and Cognitive Sciences, Center for Visual Science, University of Rochester, Rochester, New York 14627, and
| |
Collapse
|
164
|
Abstract
It is well-established that some aspects of perception and action can be understood as probabilistic inferences over underlying probability distributions. In some situations, it would be advantageous for the nervous system to sample interpretations from a probability distribution rather than commit to a particular interpretation. In this study, we asked whether visual percepts correspond to samples from the probability distribution over image interpretations, a form of sampling that we refer to as Bayesian sampling. To test this idea, we manipulated pairs of sensory cues in a bistable display consisting of two superimposed moving drifting gratings, and we asked subjects to report their perceived changes in depth ordering. We report that the fractions of dominance of each percept follow the multiplicative rule predicted by Bayesian sampling. Furthermore, we show that attractor neural networks can sample probability distributions if input currents add linearly and encode probability distributions with probabilistic population codes.
Collapse
|
165
|
Grossberg S, Srinivasan K, Yazdanbakhsh A. On the road to invariant object recognition: how cortical area V2 transforms absolute to relative disparity during 3D vision. Neural Netw 2011; 24:686-92. [PMID: 21507610 DOI: 10.1016/j.neunet.2011.03.021] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2010] [Revised: 03/17/2011] [Accepted: 03/21/2011] [Indexed: 11/18/2022]
Abstract
Invariant recognition of objects depends on a hierarchy of cortical stages that build invariance gradually. Binocular disparity computations are a key part of this transformation. Cortical area V1 computes absolute disparity, which is the horizontal difference in retinal location of an image in the left and right foveas. Many cells in cortical area V2 compute relative disparity, which is the difference in absolute disparity of two visible features. Relative, but not absolute, disparity is invariant under both a disparity change across a scene and vergence eye movements. A neural network model is introduced which predicts that shunting lateral inhibition of disparity-sensitive layer 4 cells in V2 causes a peak shift in cell responses that transforms absolute disparity from V1 into relative disparity in V2. This inhibitory circuit has previously been implicated in contrast gain control, divisive normalization, selection of perceptual groupings, and attentional focusing. The model hereby links relative disparity to other visual functions and thereby suggests new ways to test its mechanistic basis. Other brain circuits are reviewed wherein lateral inhibition causes a peak shift that influences behavioral responses.
Collapse
Affiliation(s)
- Stephen Grossberg
- Center for Adaptive Systems, Department of Cognitive and Neural Systems, Boston University, 677 Beacon Street, Boston, MA 02215, USA.
| | | | | |
Collapse
|
166
|
Schiller PH, Slocum WM, Jao B, Weiner VS. The integration of disparity, shading and motion parallax cues for depth perception in humans and monkeys. Brain Res 2011; 1377:67-77. [PMID: 21219887 PMCID: PMC3047464 DOI: 10.1016/j.brainres.2011.01.003] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2010] [Revised: 12/28/2010] [Accepted: 01/02/2011] [Indexed: 11/17/2022]
Abstract
A visual stimulus display was created that enabled us to examine how effectively the three depth cues of disparity, motion parallax and shading can be integrated in humans and monkeys. The display was designed to allow us to present these three depth cues separately and in various combinations. Depth was processed most effectively and most rapidly when all three cues were presented together indicating that these separate cues are integrated at yet unknown sites in the brain. Testing in humans and monkeys yielded similar results suggesting that monkeys are a good animal model for the study of the underlying neural mechanisms of depth perception.
Collapse
Affiliation(s)
- Peter H Schiller
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | | | | | | |
Collapse
|
167
|
Westheimer G. Path dissociation of visual signals entering the cortex: disparity, contour orientation and position. Vision Res 2011; 51:1058-63. [PMID: 21356227 DOI: 10.1016/j.visres.2011.02.020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2010] [Revised: 02/19/2011] [Accepted: 02/22/2011] [Indexed: 10/18/2022]
Abstract
Whether position and orientation shifts induced by monocular context also act as a disparity for purposes of stereoscopy was investigated experimentally in order to examine the extent to which lateral spatial localization and stereoscopic depth share circuitry. A monocular tilt illusion in a line does not lead to a commensurate depth tilt of that line in binocular view, nor does a position shift in a bisection task caused by a gap within monocular dynamic random noise produce the commensurate depth displacement. Interocular transfer of monocularly-induced shifts, which might explain such findings, was eliminated as a factor. The results can therefore be interpreted as indicators of channeling and ordering of spatial signals paths in the visual cortex and imply that two-dimensional contextual interactions operate at a processing level beyond where disparity has already been extracted.
Collapse
Affiliation(s)
- Gerald Westheimer
- Division of Neurobiology, University of California, 144 Life Sciences Addition, Berkeley, CA 94720-3200, USA.
| |
Collapse
|
168
|
Cottereau BR, McKee SP, Ales JM, Norcia AM. Disparity-tuned population responses from human visual cortex. J Neurosci 2011; 31:954-65. [PMID: 21248120 PMCID: PMC3298090 DOI: 10.1523/jneurosci.3795-10.2011] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2010] [Revised: 09/21/2010] [Accepted: 10/26/2010] [Indexed: 11/21/2022] Open
Abstract
We used source imaging of visual evoked potentials to measure neural population responses over a wide range of horizontal disparities (0.5-64 arcmin). The stimulus was a central disk that moved back and forth across the fixation plane at 2 Hz, surrounded either by binocularly uncorrelated dots (disparity noise) or by correlated dots presented in the fixation plane. Both disk and surround were composed of dynamic random dots to remove coherent monocular information. Disparity tuning was measured in five visual regions of interest (ROIs) [V1, human middle temporal area (hMT+), V4, lateral occipital complex (LOC), and V3A], defined in separate functional magnetic resonance imaging scans. The disparity tuning functions peaked between 2 and 16 arcmin for both types of surround in each ROI. Disparity tuning in the V1 ROI was unaffected by the type of surround, but surround correlation altered both the amplitude and phase of the disparity responses in the other ROIs. Response amplitude increased when the disk was in front of the surround in the V3A and LOC ROIs, indicating that these areas encode figure-ground relationships and object convexity. The correlated surround produced a consistent phase lag at the second harmonic in the hMT+ and V4 ROIs without a change in amplitude, while in the V3A ROI, both phase and amplitude effects were observed. Sensitivity to disparity context is thus widespread in visual cortex, but the dynamics of these contextual interactions differ across regions.
Collapse
Affiliation(s)
- Benoit R Cottereau
- The Smith-Kettlewell Eye Research Institute, San Francisco, CA 94115, USA.
| | | | | | | |
Collapse
|
169
|
Abstract
In natural images, visual objects are typically occluded by other objects. A remarkable ability of our visual system is to complete occluded objects effortlessly and see whole, uninterrupted objects. How object completion is implemented in the visual system is still largely unknown. In this study, using a backward masking paradigm, we combined psychophysics and functional magnetic resonance imaging to investigate the temporal evolvement of face completion at different levels of the visual processing hierarchy. Human subjects were presented with two kinds of stimuli that were designed to elicit or not elicit the percept of a completed face, although they were physically very similar. By contrasting subjects' behavioral and blood oxygenation level-dependent (BOLD) responses to completed and noncompleted faces, we measured the psychophysical time course of the face completion and its underlying cortical dynamics. We found that face completion manifested its effect between 50 and 250 ms after stimulus onset. Relative to noncompleted faces, completed faces induced weaker BOLD response at early processing phases in retinotopic visual areas V1 and V2 and stronger BOLD response at late processing phases in occipital face area and fusiform face area. Attending away from the stimuli largely abolished these effects. These findings suggest that face completion consists of two synergetic phases: early suppression in lower visual areas and late enhancement in higher visual areas; moreover, attention is necessary to these neural events.
Collapse
|
170
|
Complex cells in the cat striate cortex have multiple disparity detectors in the three-dimensional binocular receptive fields. J Neurosci 2010; 30:13826-37. [PMID: 20943923 DOI: 10.1523/jneurosci.1135-10.2010] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Along the visual pathway, neurons generally become more specialized for signaling a limited subset of stimulus attributes and become more invariant to changes in the stimulus position within the receptive fields (RFs). One of the likely mechanisms underlying such invariance appears to be pooling of detectors located at different positions. Does such spatial pooling occur for disparity-selective neurons in primary visual cortex? To examine whether the three-dimensional (3D) binocular RFs are constructed by pooling detectors for binocular disparity, we investigated binocular interactions in the 3D space for neurons in the cat striate cortex. Approximately one-third of complex cells showed the spatial pooling of disparity detectors to a significant degree, whereas the majority of simple cells did not. The degree of spatial pooling of disparity detectors along the preferred orientation axis was generally larger than that along the axis orthogonal to the orientation axis. We then reconstructed 3D binocular RFs in their complete form and examined their structures. Disparity tuning curves were compared across positions along the orientation axis in the RFs. A small population of cells appeared to show a gradual shift of the preferred disparity along this axis, indicating that they can potentially signal inclination in the 3D space. However, the majority of cells exhibited a position-invariant disparity tuning. Finally, disparity tuning curves were examined for all oblique angles in addition to horizontal and vertical. Tunings were broadest along the orientation axis as the disparity energy model predicts.
Collapse
|
171
|
Nefs HT, O'Hare L, Harris JM. Two independent mechanisms for motion-in-depth perception: evidence from individual differences. Front Psychol 2010; 1:155. [PMID: 21833221 PMCID: PMC3153770 DOI: 10.3389/fpsyg.2010.00155] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2010] [Accepted: 08/30/2010] [Indexed: 12/05/2022] Open
Abstract
Our forward-facing eyes allow us the advantage of binocular visual information: using the tiny differences between right and left eye views to learn about depth and location in three dimensions. Our visual systems also contain specialized mechanisms to detect motion-in-depth from binocular vision, but the nature of these mechanisms remains controversial. Binocular motion-in-depth perception could theoretically be based on first detecting binocular disparity and then monitoring how it changes over time. The alternative is to monitor the motion in the right and left eye separately and then compare these motion signals. Here we used an individual differences approach to test whether the two sources of information are processed via dissociated mechanisms, and to measure the relative importance of those mechanisms. Our results suggest the existence of two distinct mechanisms, each contributing to the perception of motion-in-depth in most observers. Additionally, for the first time, we demonstrate the relative prevalence of the two mechanisms within a normal population. In general, visual systems appear to rely mostly on the mechanism sensitive to changing binocular disparity, but perception of motion-in-depth is augmented by the presence of a less sensitive mechanism that uses interocular velocity differences. Occasionally, we find observers with the opposite pattern of sensitivity. More generally this work showcases the power of the individual differences approach in studying the functional organization of cognitive systems.
Collapse
Affiliation(s)
- Harold T Nefs
- Vision Lab, The School of Psychology, University of St Andrews, St Andrews, Fife Scotland, UK
| | | | | |
Collapse
|
172
|
Gantz L, Bedell HE. Transfer of perceptual learning of depth discrimination between local and global stereograms. Vision Res 2010; 50:1891-9. [PMID: 20600234 PMCID: PMC2922406 DOI: 10.1016/j.visres.2010.06.011] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2009] [Revised: 06/21/2010] [Accepted: 06/21/2010] [Indexed: 10/19/2022]
Abstract
Several previous studies reported differences when stereothresholds are assessed with local-contour stereograms vs. complex random-dot stereograms (RDSs). Dissimilar thresholds may be due to differences in the properties of the stereograms (e.g. spatial frequency content, contrast, inter-element separation, area) or to different underlying processing mechanisms. This study examined the transfer of perceptual learning of depth discrimination between local and global RDSs with similar properties, and vice versa. If global and local stereograms are processed by separate neural mechanisms, then the magnitude and rate of training for the two types of stimuli are likely to differ, and the transfer of training from one stimulus type to the other should be minimal. Based on previous results, we chose RDSs with element densities of 0.17% and 28.3% to serve as the local and global stereograms, respectively. Fourteen inexperienced subjects with normal binocular vision were randomly assigned to either a local- or global- RDS training group. Stereothresholds for both stimulus types were measured before and after 7700 training trials distributed over 10 sessions. Stereothresholds for the trained condition improve for approximately 3000 trials, by an average of 0.36+/-0.08 for local and 0.29+/-0.10 for global RDSs, and level off thereafter. Neither the rate nor the magnitude of improvement differ statistically between the local- and global-training groups. Further, no significant difference exists in the amount of improvement on the trained vs. the untrained targets for either training group. These results are consistent with the operation of a single mechanism to process both local and global stereograms.
Collapse
Affiliation(s)
- Liat Gantz
- College of Optometry, University of Houston, 505 J. Davis Armistead Building, Houston, TX 77204-2020, USA.
| | | |
Collapse
|
173
|
Minini L, Parker AJ, Bridge H. Neural modulation by binocular disparity greatest in human dorsal visual stream. J Neurophysiol 2010; 104:169-78. [PMID: 20445027 PMCID: PMC2904223 DOI: 10.1152/jn.00790.2009] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2009] [Accepted: 04/30/2010] [Indexed: 11/22/2022] Open
Abstract
Although cortical activation to binocular disparity can be demonstrated throughout occipital and parietal cortices, the relative contributions to depth perception made by different human cortical areas have not been established. To investigate whether different regions are optimized for specific disparity ranges, we have measured the responses of occipital and parietal areas to different magnitudes of binocular disparity. Using stimuli consisting of sinusoidal depth modulations, we measured cortical activation when the stimuli were located at pedestal disparities of 0, 0.1, 0.35, and 0.7 degrees from fixation. Across all areas, occipital and parietal, there was an increase in BOLD signal with increasing pedestal disparity, compared with a plane at zero disparity. However, the greatest modulation of response by the different pedestals was found in the dorsal visual areas and the parietal areas. These differences contrast with the response to the zero disparity plane, compared with fixation, which is greatest in the early visual areas, smaller in the ventral and dorsal visual areas, and absent in parietal areas. Using the simultaneously acquired psychophysical data we also measured a greater response to correct than to incorrect trials, an effect that increased with rising pedestal disparity and was greatest in dorsal visual and parietal areas. These results illustrate that the dorsal stream, along both its occipital and parietal branches, can reliably discriminate a large range of disparities.
Collapse
Affiliation(s)
- Loredana Minini
- Department of Physiology, Anatomy and Genetics, John Radcliffe Hospital, University of Oxford, Oxford, UK.
| | | | | |
Collapse
|
174
|
Abstract
Markov random field (MRF) and belief propagation have given birth to stereo vision algorithms with top performance. This article explores their biological plausibility. First, an MRF model guided by physiological and psychophysical facts was designed. Typically an MRF-based stereo vision algorithm employs a likelihood function that reflects the local similarity of two regions and a potential function that models the continuity constraint. In our model, the likelihood function is constructed on the basis of the disparity energy model because complex cells are considered as front-end disparity encoders in the visual pathway. Our likelihood function is also relevant to several psychological findings. The potential function in our model is constrained by the psychological finding that the strength of the cooperative interaction minimizing relative disparity decreases as the separation between stimuli increases. Our model is tested on three kinds of stereo images. In simulations on images with repetitive patterns, we demonstrate that our model could account for the human depth percepts that were previously explained by the second-order mechanism. In simulations on random dot stereograms and natural scene images, we demonstrate that false matches introduced by the disparity energy model can be reliably removed using our model. A comparison with the coarse-to-fine model shows that our model is able to compute the absolute disparity of small objects with larger relative disparity. We also relate our model to several physiological findings. The hypothesized neurons of the model are selective for absolute disparity and have facilitative extra receptive field. There are plenty of such neurons in the visual cortex. In conclusion, we think that stereopsis can be implemented by neural networks resembling MRF.
Collapse
Affiliation(s)
- Yansheng Ming
- National Laboratory of Pattern Recognition, Institute of Automation Chinese Academy of Sciences, Beijing, PRC.
| | | |
Collapse
|
175
|
Cooperative and competitive interactions facilitate stereo computations in macaque primary visual cortex. J Neurosci 2010; 29:15780-95. [PMID: 20016094 DOI: 10.1523/jneurosci.2305-09.2009] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Inferring depth from binocular disparities is a difficult problem for the visual system because local features in the left- and right-eye images must be matched correctly to solve this "stereo correspondence problem." Cortical architecture and computational studies suggest that lateral interactions among neurons could help resolve local uncertainty about disparity encoded in individual neurons by incorporating contextual constraints. We found that correlated activity among pairs of neurons in primary visual cortex depended both on disparity-tuning relationships and the stimuli displayed within the receptive fields of the neurons. Nearby pairs of neurons with distinct disparity tuning exhibited a decrease in spike correlation at competing disparities soon after response onset. Distant neuronal pairs of similar disparity tuning exhibited an increase in spike correlation at mutually preferred disparities. The observed correlated activity and response dynamics suggests that local competitive and distant cooperative interactions improve disparity tuning of individual neurons over time. Such interactions could represent a neural substrate for the principal constraints underlying cooperative stereo algorithms.
Collapse
|
176
|
Assessment of depth perception using psychophysical thresholds and stereoscopically evoked brain activity. Doc Ophthalmol 2009; 119:209-16. [DOI: 10.1007/s10633-009-9202-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2009] [Accepted: 10/09/2009] [Indexed: 10/20/2022]
|
177
|
Chinellato E, Del Pobil AP. The neuroscience of vision-based grasping: a functional review for computational modeling and bio-inspired robotics. J Integr Neurosci 2009; 8:223-54. [PMID: 19618488 DOI: 10.1142/s0219635209002137] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2009] [Revised: 05/12/2009] [Indexed: 11/18/2022] Open
Abstract
The topic of vision-based grasping is being widely studied in humans and in other primates using various techniques and with different goals. The fundamental related findings are reviewed in this paper, with the aim of providing researchers from different fields, including intelligent robotics and neural computation, a comprehensive but accessible view on the subject. A detailed description of the principal sensorimotor processes and the brain areas involved is provided following a functional perspective, in order to make this survey especially useful for computational modeling and bio-inspired robotic applications.
Collapse
Affiliation(s)
- Eris Chinellato
- Robotic Intelligence Lab, Jaume I University, Campus Riu Sec, 12071, Castellón de la Plana, Spain.
| | | |
Collapse
|
178
|
Abstract
Anatomical superposition of the cortical projections from the overlapping visual fields of the two eyes does not make it obvious how the disposition of objects in the third dimension is encoded. Hubel and Wiesel's demonstration that units in the primary visual cortex of the mammal respond preferentially to elongated contours of specific orientation encouraged the inquiry into whether binocular disparity might not similarly be represented as an attribute interdigitated within the orderly progression of position. When this was found to indeed be the case, this entrained a brisk research activity into the disparity of receptive fields of single units in the primary visual cortex and the influence on their response of the three-dimensional locations of outside world stimuli. That cells' preferred orientations covered the whole gamut whereas space perception required only horizontal disparity was an apparent paradox that needed resolution. A connection with an observer's stereoscopic performance was made by the discovery that cells in the primate primary visual cortex display good tuning to the disparity in random-dot stereograms. But a wide gap still remains between the properties of these cortical units and human stereo thresholds in simple target configurations, let alone depth judgments in which perceptual and cognitive factors enter. When the neural circuits in the primary visual cortex that are involved in processing depth are eventually traced in detail they will also need to have properties that allow for the plasticity in learning and experience.
Collapse
Affiliation(s)
- Gerald Westheimer
- Division of Neurobiology, University of California, Berkeley, CA 94720-3200, USA.
| |
Collapse
|
179
|
Chai YC, Farell B. From disparity to depth: how to make a grating and a plaid appear in the same depth plane. J Vis 2009; 9:3.1-19. [PMID: 19810784 DOI: 10.1167/9.10.3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Even though binocular disparity is a very well-studied cue to depth, the function relating disparity and perceived depth has been characterized only for the case of horizontal disparities. We sought to determine the general relationship between disparity and depth for a particular set of stimuli. The horizontal disparity direction is a special case, albeit an especially important one. Non-horizontal disparities arise from a number of sources under natural viewing condition. Moreover, they are implicit in patterns that are one-dimensional, such as gratings, lines, and edges, and in one-dimensional components of two-dimensional patterns, where a stereo matching direction is not well-defined. What function describes perceived depth in these cases? To find out, we measured the phase disparities that produced depth matches between a reference stimulus and a test stimulus. The reference stimulus was two-dimensional, a plaid; the test stimulus was one-dimensional, a grating. We find that horizontal disparity is no more important than other disparity directions in determining depth matches between these two stimuli. As a result, a grating and a plaid appear equal in depth when their horizontal disparities are, in general, unequal. Depth matches are well predicted by a simple disparity vector calculation; they survive changes in component parameters that conserve these vector quantities. The disparity vector rule also describes how the disparities of 1-D components might contribute to the perceived depth of 2-D stimuli.
Collapse
Affiliation(s)
- Yu-Chin Chai
- Institute for Sensory Research, Syracuse University, Syracuse, NY, USA.
| | | |
Collapse
|
180
|
Joly O, Vanduffel W, Orban GA. The monkey ventral premotor cortex processes 3D shape from disparity. Neuroimage 2009; 47:262-72. [DOI: 10.1016/j.neuroimage.2009.04.043] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2008] [Revised: 04/07/2009] [Accepted: 04/08/2009] [Indexed: 10/20/2022] Open
|
181
|
Disparity- and velocity-based signals for three-dimensional motion perception in human MT+. Nat Neurosci 2009; 12:1050-5. [PMID: 19578382 DOI: 10.1038/nn.2343] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2009] [Accepted: 04/27/2009] [Indexed: 11/08/2022]
Abstract
How does the primate visual system encode three-dimensional motion? The macaque middle temporal area (MT) and the human MT complex (MT+) have well-established sensitivity to two-dimensional frontoparallel motion and static disparity. However, evidence for sensitivity to three-dimensional motion has remained elusive. We found that human MT+ encodes two binocular cues to three-dimensional motion: changing disparities over time and interocular comparisons of retinal velocities. By varying important properties of moving dot displays, we distinguished these three-dimensional motion signals from their constituents, instantaneous binocular disparity and monocular retinal motion. An adaptation experiment confirmed direction selectivity for three-dimensional motion. Our results indicate that MT+ carries critical binocular signals for three-dimensional motion processing, revealing an important and previously overlooked role for this well-studied brain area.
Collapse
|
182
|
Knaut LA, Subramanian SK, McFadyen BJ, Bourbonnais D, Levin MF. Kinematics of pointing movements made in a virtual versus a physical 3-dimensional environment in healthy and stroke subjects. Arch Phys Med Rehabil 2009; 90:793-802. [PMID: 19406299 DOI: 10.1016/j.apmr.2008.10.030] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2008] [Accepted: 10/31/2008] [Indexed: 11/17/2022]
Abstract
OBJECTIVE To compare kinematics of 3-dimensional pointing movements performed in a virtual environment (VE) displayed through a head-mounted display with those made in a physical environment. DESIGN Observational study of movement in poststroke and healthy subjects. SETTING Motion analysis laboratory. PARTICIPANTS Adults (n=15; 4 women; 59+/-15.4y) with chronic poststroke hemiparesis were recruited. Participants had moderate upper-limb impairment with Chedoke-McMaster Arm Scores ranging from 3 to 6 out of 7. Twelve healthy subjects (6 women; 53.3+/-17.1y) were recruited from the community. INTERVENTIONS Not applicable. MAIN OUTCOME MEASURES Arm and trunk kinematics were recorded in similar virtual and physical environments with an Optotrak System (6 markers; 100Hz; 5s). Subjects pointed as quickly and as accurately as possible to 6 targets (12 trials/target in a randomized sequence) placed in arm workspace areas requiring different arm movement patterns and levels of difficulty. Movements were analyzed in terms of performance outcome measures (endpoint precision, trajectory, peak velocity) and arm and trunk movement patterns (elbow and shoulder ranges of motion, elbow/shoulder coordination, trunk displacement, rotation). RESULTS For healthy subjects, precision and trajectory straightness were higher in VE when pointing to contralateral targets, and movements were slower for all targets in VE. Stroke participants made less accurate and more curved movements in VE and used less trunk displacement. Elbow/shoulder coordination differed when pointing to the lower ipsilateral target. There were no group-by-environment interactions. CONCLUSIONS Movements in both environments were sufficiently similar to consider VE a valid environment for clinical interventions and motor control studies.
Collapse
Affiliation(s)
- Luiz A Knaut
- School of Rehabilitation, University of Montreal, Montreal, QC, Canada
| | | | | | | | | |
Collapse
|
183
|
Heesy CP. Seeing in stereo: The ecology and evolution of primate binocular vision and stereopsis. Evol Anthropol 2009. [DOI: 10.1002/evan.20195] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
184
|
Vlaskamp BNS, Filippini HR, Banks MS. Image-size differences worsen stereopsis independent of eye position. J Vis 2009; 9:17.1-13. [PMID: 19271927 PMCID: PMC2935694 DOI: 10.1167/9.2.17] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2008] [Accepted: 11/20/2008] [Indexed: 11/24/2022] Open
Abstract
With the eyes in forward gaze, stereo performance worsens when one eye's image is larger than the other's. Near, eccentric objects naturally create retinal images of different sizes. Does this mean that stereopsis exhibits deficits for such stimuli? Or does the visual system compensate for the predictable image-size differences? To answer this, we measured discrimination of a disparity-defined shape for different relative image sizes. We did so for different gaze directions, some compatible with the image-size difference and some not. Magnifications of 10-15% caused a clear worsening of stereo performance. The worsening was determined only by relative image size and not by eye position. This shows that no neural compensation for image-size differences accompanies eye-position changes, at least prior to disparity estimation. We also found that a local cross-correlation model for disparity estimation performs like humans in the same task, suggesting that the decrease in stereo performance due to image-size differences is a byproduct of the disparity-estimation method. Finally, we looked for compensation in an observer who has constantly different image sizes due to differing eye lengths. She performed best when the presented images were roughly the same size, indicating that she has compensated for the persistent image-size difference.
Collapse
|
185
|
Filippini HR, Banks MS. Limits of stereopsis explained by local cross-correlation. J Vis 2009; 9:8.1-18. [PMID: 19271878 PMCID: PMC2940423 DOI: 10.1167/9.1.8] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2008] [Accepted: 09/19/2008] [Indexed: 11/24/2022] Open
Abstract
Human stereopsis has two well-known constraints: the disparity-gradient limit, which is the inability to perceive depth when the change in disparity within a region is too large, and the limit of stereoresolution, which is the inability to perceive spatial variations in disparity that occur at too fine a spatial scale. We propose that both limitations can be understood as byproducts of estimating disparity by cross-correlating the two eyes' images, the fundamental computation underlying the disparity-energy model. To test this proposal, we constructed a local cross-correlation model with biologically motivated properties. We then compared model and human behaviors in the same psychophysical tasks. The model and humans behaved quite similarly: they both exhibited a disparity-gradient limit and had similar stereoresolution thresholds. Performance was affected similarly by changes in a variety of stimulus parameters. By modeling the effects of stimulus blur and of using different sizes of image patches, we found evidence that the smallest neural mechanism humans use to estimate disparity is 3-6 arcmin in diameter. We conclude that the disparity-gradient limit and stereoresolution are indeed byproducts of using local cross-correlation to estimate disparity.
Collapse
Affiliation(s)
- Heather R. Filippini
- UCSF and UCB Joint Graduate Group in Bioengineering, University of California at Berkeley, Berkeley, CA, USA
| | - Martin S. Banks
- School of Optometry, Psychology, Wills Neuroscience, University of California at Berkeley, Berkeley, CA, USA
| |
Collapse
|
186
|
Lambooij M, IJsselsteijn W, Fortuin M, Heynderickx I. Visual Discomfort and Visual Fatigue of Stereoscopic Displays: A Review. J Imaging Sci Technol 2009. [DOI: 10.2352/j.imagingsci.technol.2009.53.3.030201] [Citation(s) in RCA: 707] [Impact Index Per Article: 44.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
|
187
|
Abstract
In this issue of Neuron, Chowdhury and DeAngelis report that training monkeys to perform a fine depth discrimination abolishes the contribution of signals from area MT to the execution of a different, coarse depth discrimination. This result calls into question the principle of associating particular visual areas with particular visual functions, by showing that such associations are modifiable by experience.
Collapse
Affiliation(s)
- Pascal Wallisch
- Center for Neural Science, New York University, New York, NY 10003, USA
| | | |
Collapse
|
188
|
Affiliation(s)
- Carlos R Ponce
- Department of Neurobiology, Harvard Medical School, 220 Longwood Avenue, Boston, Massachusetts 02115, USA
| | | |
Collapse
|
189
|
Tanabe S, Cumming BG. Mechanisms underlying the transformation of disparity signals from V1 to V2 in the macaque. J Neurosci 2008; 28:11304-14. [PMID: 18971472 PMCID: PMC2601577 DOI: 10.1523/jneurosci.3477-08.2008] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2008] [Revised: 08/29/2008] [Accepted: 09/18/2008] [Indexed: 11/21/2022] Open
Abstract
Stereo vision relies on cortical signals that encode binocular disparity. In V1, the disparity energy model explains many features of binocular interaction, but it overestimates the responses to anticorrelated images. Combining the outputs of two, or more, energy model-like subunits [two-subunit (2SU) model] can resolve this discrepancy and provides an alternative explanation for disparity signals previously thought to indicate phase disparity between the receptive fields (RFs) of each eye. The 2SU model naturally explains how "near/far" (odd-symmetric) tuning becomes dominant in extrastriate cortex. To compare the energy and the 2SU models, we used a broadband compound grating and applied a common interocular phase difference to all spatial frequency components (a stimulus phase disparity), combined with a common spatial displacement (a stimulus position disparity). This produces binocular images that never occur in natural viewing, for which the 2SU model and the energy model make distinctively different predictions. Responses of neurons recorded from both V1 and V2 of awake rhesus macaques systematically deviated from the predictions of the energy model, in accordance with the 2SU model. These deviations correlated with the symmetry of the tuning curve, indicating that the 2SU mechanism is exploited to produce odd symmetry. Nonetheless, individual subunits also contain RF phase disparity that contributes to odd symmetry. The results suggest that neurons in V2 probably inherit phase disparity signals from V1 neurons, but systematically combine input from V1 neurons with different position disparities, in a way that elaborates odd-symmetric tuning and extends the range of disparities encoded by single neurons.
Collapse
Affiliation(s)
- Seiji Tanabe
- Laboratory of Sensorimotor Research, National Eye Institute-National Institutes of Health, Bethesda, Maryland 20892-4435, USA.
| | | |
Collapse
|
190
|
Chowdhury SA, DeAngelis GC. Fine discrimination training alters the causal contribution of macaque area MT to depth perception. Neuron 2008; 60:367-77. [PMID: 18957227 PMCID: PMC2642520 DOI: 10.1016/j.neuron.2008.08.023] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2007] [Revised: 05/18/2008] [Accepted: 08/08/2008] [Indexed: 11/19/2022]
Abstract
When a new perceptual task is learned, plasticity occurs in the brain to mediate improvements in performance with training. How do these changes affect the neural substrates of previously learned tasks? We addressed this question by examining the effect of fine discrimination training on the causal contribution of area MT to coarse depth discrimination. When monkeys are trained to discriminate between two coarse absolute disparities (near versus far) embedded in noise, reversible inactivation of area MT devastates performance. In contrast, after animals are trained to discriminate fine differences in relative disparity, MT inactivation no longer impairs coarse depth discrimination. This effect does not result from changes in the disparity tuning of MT neurons, suggesting plasticity in the flow of disparity signals to decision circuitry. These findings show that the contribution of particular brain area to task performance can change dramatically as a result of learning new tasks.
Collapse
Affiliation(s)
- Syed A Chowdhury
- Department of Anatomy and Neurobiology, Washington University School of Medicine, St Louis, MO 63110, USA
| | | |
Collapse
|
191
|
Bridge H, Cumming BG. Representation of binocular surfaces by cortical neurons. Curr Opin Neurobiol 2008; 18:425-30. [PMID: 18809495 DOI: 10.1016/j.conb.2008.09.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2008] [Revised: 09/09/2008] [Accepted: 09/11/2008] [Indexed: 10/21/2022]
Abstract
Useful representations of the three-dimensional (3D) world go beyond assigning depth to individual points, building maps of surfaces and shapes. Studies in a wide range of extrastriate cortical areas have shown that single neurons show selective responses to 3D surfaces. The extent to which this advances the representation beyond that provided by the earliest binocular signals requires careful evaluation. We conclude that current data are not sufficient to identify distinctive contributions from different cortical areas to the binocular representation of 3D surfaces.
Collapse
Affiliation(s)
- Holly Bridge
- FMRIB Centre, Department of Clinical Neurology, University of Oxford, UK
| | | |
Collapse
|
192
|
Hansard M, Horaud R. Cyclopean geometry of binocular vision. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA. A, OPTICS, IMAGE SCIENCE, AND VISION 2008; 25:2357-2369. [PMID: 18758564 DOI: 10.1364/josaa.25.002357] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
The geometry of binocular projection is analyzed in relation to the primate visual system. An oculomotor parameterization that includes the classical vergence and version angles is defined. It is shown that the epipolar geometry of the system is constrained by binocular coordination of the eyes. A local model of the scene is adopted in which depth is measured relative to a plane containing the fixation point. These constructions lead to an explicit parameterization of the binocular disparity field involving the gaze angles as well as the scene structure. The representation of visual direction and depth is discussed with reference to the relevant psychophysical and neurophysiological literature.
Collapse
|
193
|
Maruko I, Zhang B, Tao X, Tong J, Smith EL, Chino YM. Postnatal development of disparity sensitivity in visual area 2 (v2) of macaque monkeys. J Neurophysiol 2008; 100:2486-95. [PMID: 18753321 DOI: 10.1152/jn.90397.2008] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Macaque monkeys do not reliably discriminate binocular depth cues until about 8 wk of age. The neural factors that limit the development of fine depth perception in primates are not known. In adults, binocular depth perception critically depends on detection of relative binocular disparities and the earliest site in the primate visual brain where a substantial proportion of neurons are capable of discriminating relative disparity is visual area 2 (V2). We examined the disparity sensitivity of V2 neurons during the first 8 wk of life in infant monkeys and compared the responses of V2 neurons to those of V1 neurons. We found that the magnitude of response modulation in V2 and V1 neurons as a function of interocular spatial phase disparity was adult-like as early as 2 wk of age. However, the optimal spatial frequency and binocular response rate of these disparity sensitive neurons were more than an octave lower in 2- and 4-wk-old infants than in adults. Consequently, despite the lower variability of neuronal firing in V2 and V1 neurons of infant monkeys, the ability of these neurons to discriminate fine disparity differences was significantly reduced compared with adults. This reduction in disparity sensitivity of V2 and V1 neurons is likely to limit binocular depth perception during the first several weeks of a monkey's life.
Collapse
Affiliation(s)
- I Maruko
- College of Optometry, Univ. of Houston, 505 J. Davis Armistead Bldg., Houston, TX 77204-2020, USA
| | | | | | | | | | | |
Collapse
|
194
|
Priebe NJ. The relationship between subthreshold and suprathreshold ocular dominance in cat primary visual cortex. J Neurosci 2008; 28:8553-9. [PMID: 18716214 PMCID: PMC2651840 DOI: 10.1523/jneurosci.2182-08.2008] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2008] [Revised: 07/11/2008] [Accepted: 07/12/2008] [Indexed: 11/21/2022] Open
Abstract
Primary visual cortex (V1) is the site at which right and left eye pathways are first integrated, leading to a representation of the visual world in depth. The ocular dominance (OD) of individual cortical neurons varies and may be changed by altering visual experience during the developmental critical period. Estimates of OD, commonly used to quantify the right and left eye synaptic inputs, have previously been based on spike rate. Membrane potential (V(m)), however, is more closely related to the synaptic inputs onto neurons and should therefore more closely reflect the degree of input from the two eyes. To determine the relationship between OD based on membrane potential and on spike rate, intracellular recordings were made from visual cortical neurons. OD based on spike rate was systematically more monocular than OD based on membrane potential. The discrepancy between membrane-potential OD and spike-rate OD may be accounted for by a simple model of V(m)-to-spike-rate transformation.
Collapse
Affiliation(s)
- Nicholas J Priebe
- Section of Neurobiology, School of Biological Sciences, University of Texas at Austin, Austin, Texas 78705, USA.
| |
Collapse
|
195
|
Chen G, Lu HD, Roe AW. A map for horizontal disparity in monkey V2. Neuron 2008; 58:442-50. [PMID: 18466753 PMCID: PMC2441920 DOI: 10.1016/j.neuron.2008.02.032] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2007] [Revised: 12/26/2007] [Accepted: 02/08/2008] [Indexed: 11/17/2022]
Abstract
The perception of visual depth is determined by integration of spatial disparities of inputs from the two eyes. Single cells in visual cortex of monkeys are known to respond to specific binocular disparities; however, little is known about their functional organization. We now show, using intrinsic signal optical imaging and single-unit physiology, that, in the thick stripe compartments of the second visual area (V2), there is a clustered organization of Near cells and Far cells, and moreover, there are topographic maps for Near to Far disparities within V2. Our findings suggest that maps for visual disparity are calculated in V2, and demonstrate parallels in functional organization between the thin, pale, and thick stripes of V2.
Collapse
Affiliation(s)
- Gang Chen
- Department of Psychology, Vanderbilt University, Nashville, TN 37203, USA
| | | | | |
Collapse
|
196
|
Van Pelt S, Medendorp WP. Updating Target Distance Across Eye Movements in Depth. J Neurophysiol 2008; 99:2281-90. [DOI: 10.1152/jn.01281.2007] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
We tested between two coding mechanisms that the brain may use to retain distance information about a target for a reaching movement across vergence eye movements. If the brain was to encode a retinal disparity representation (retinal model), i.e., target depth relative to the plane of fixation, each vergence eye movement would require an active update of this representation to preserve depth constancy. Alternatively, if the brain was to store an egocentric distance representation of the target by integrating retinal disparity and vergence signals at the moment of target presentation, this representation should remain stable across subsequent vergence shifts (nonretinal model). We tested between these schemes by measuring errors of human reaching movements ( n = 14 subjects) to remembered targets, briefly presented before a vergence eye movement. For comparison, we also tested their directional accuracy across version eye movements. With intervening vergence shifts, the memory-guided reaches showed an error pattern that was based on the new eye position and on the depth of the remembered target relative to that position. This suggests that target depth is recomputed after the gaze shift, supporting the retinal model. Our results also confirm earlier literature showing retinal updating of target direction. Furthermore, regression analyses revealed updating gains close to one for both target depth and direction, suggesting that the errors arise after the updating stage during the subsequent reference frame transformations that are involved in reaching.
Collapse
|
197
|
Abstract
The extrastriate cortex of primates encompasses a substantial portion of the cerebral cortex and is devoted to the higher order processing of visual signals and their dispatch to other parts of the brain. A first step towards the understanding of the function of this cortical tissue is a description of the selectivities of the various neuronal populations for higher order aspects of the image. These selectivities present in the various extrastriate areas support many diverse representations of the scene before the subject. The list of the known selectivities includes that for pattern direction and speed gradients in middle temporal/V5 area; for heading in medial superior temporal visual area, dorsal part; for orientation of nonluminance contours in V2 and V4; for curved boundary fragments in V4 and shape parts in infero-temporal area (IT); and for curvature and orientation in depth from disparity in IT and CIP. The most common putative mechanism for generating such emergent selectivity is the pattern of excitatory and inhibitory linear inputs from the afferent area combined with nonlinear mechanisms in the afferent and receiving area.
Collapse
Affiliation(s)
- Guy A Orban
- Laboratorium voor Neuro- en Psychofysiologie, K. U. Leuven Medical School, Leuven, Belgium.
| |
Collapse
|
198
|
Chowdhury SA, Christiansen DL, Morgan ML, DeAngelis GC. Effect of vertical disparities on depth representation in macaque monkeys: MT physiology and behavior. J Neurophysiol 2008; 99:876-87. [PMID: 18077666 PMCID: PMC2386435 DOI: 10.1152/jn.00732.2007] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Horizontal binocular disparities provide information about the distance of objects relative to the point of ocular fixation and must be combined with an estimate of viewing distance to recover the egocentric distance of an object. Vergence angle and the gradient of vertical disparities across the visual field are thought to provide independent sources of viewing distance information based on human behavioral studies. Although the effect of vergence angle on horizontal disparity selectivity in early visual cortex has been examined (with mixed results), the effect of the vertical disparity field has not been explored. We manipulated the vertical disparities in a large random-dot stimulus to simulate different viewing distances, and we examined the effect of this manipulation on both the responses of neurons in the middle temporal (MT) area and on the psychophysical performance of the animal in a curvature discrimination task. We report here that alterations to the vertical disparity field have no effect on the horizontal disparity tuning of MT neurons. However, the same manipulation strongly and systematically biases the monkey's judgments of curvature, consistent with previous human studies. We conclude that monkeys, like humans, make use of the vertical disparity field to estimate viewing distance, but that the physiological mechanisms for this effect occur either downstream of MT or in a different pathway.
Collapse
Affiliation(s)
| | | | - Michael L. Morgan
- Department of Anatomy and Neurobiology, Washington University School of Medicine, Saint Louis MO, 63110
| | - Gregory C. DeAngelis
- Department of Anatomy and Neurobiology, Washington University School of Medicine, Saint Louis MO, 63110
| |
Collapse
|
199
|
Ponce CR, Lomber SG, Born RT. Integrating motion and depth via parallel pathways. Nat Neurosci 2008; 11:216-23. [PMID: 18193039 PMCID: PMC2377395 DOI: 10.1038/nn2039] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2007] [Accepted: 12/13/2007] [Indexed: 11/08/2022]
Abstract
Processing of visual information is both parallel and hierarchical, with each visual area richly interconnected with other visual areas. An example of the parallel architecture of the primate visual system is the existence of two principal pathways providing input to the middle temporal visual area (MT): namely, a direct projection from striate cortex (V1), and a set of indirect projections that also originate in V1 but then relay through V2 and V3. Here we have reversibly inactivated the indirect pathways while recording from MT neurons and measuring eye movements in alert monkeys, a procedure that has enabled us to assess whether the two different input pathways are redundant or whether they carry different kinds of information. We find that this inactivation causes a disproportionate degradation of binocular disparity tuning relative to direction tuning in MT neurons, suggesting that the indirect pathways are important in the recovery of depth in three-dimensional scenes.
Collapse
Affiliation(s)
- Carlos R Ponce
- Harvard-MIT Division of Health Sciences and Technology, Harvard Medical School, 260 Longwood Avenue, Boston, Massachusetts 02115, USA.
| | | | | |
Collapse
|
200
|
Haefner RM, Cumming BG. Adaptation to natural binocular disparities in primate V1 explained by a generalized energy model. Neuron 2008; 57:147-58. [PMID: 18184571 PMCID: PMC2344156 DOI: 10.1016/j.neuron.2007.10.042] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2007] [Revised: 09/24/2007] [Accepted: 10/31/2007] [Indexed: 11/18/2022]
Abstract
Sensory processing in the brain is thought to have evolved to encode naturally occurring stimuli efficiently. We report an adaptation in binocular cortical neurons that reflects the tight constraints imposed by the geometry of 3D vision. We show that the widely used binocular energy model predicts that neurons dedicate part of their dynamic range to impossible combinations of left and right images. Approximately 42% of the neurons we record from V1 of awake monkeys behave in this way (a powerful confirmation of the model), while about 58% deviate from the model in a manner that concentrates more of their dynamic range on stimuli that obey the constraints of binocular geometry. We propose a simple extension of the energy model, using multiple subunits, that explains the adaptation we observe, as well as other properties of binocular neurons that have been hard to account for, such as the response to anti-correlated stereograms.
Collapse
Affiliation(s)
- Ralf M Haefner
- Laboratory for Sensorimotor Research, National Eye Institute/NIH, 49 Convent Drive, Building 49/2A50, Bethesda MD 20892, USA.
| | | |
Collapse
|