1
|
Farell B. What's special about horizontal disparity. J Vis 2023; 23:4. [PMID: 37930689 PMCID: PMC10629538 DOI: 10.1167/jov.23.13.4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 09/13/2023] [Indexed: 11/07/2023] Open
Abstract
Horizontal disparity has been recognized as the primary signal driving stereoscopic depth since the invention of the stereoscope in the 1830s. It has a unique status in our understanding of binocular vision. The direction of offset of the eyes gives the disparities of corresponding image point locations across the two retinas a strong horizontal bias. Beyond the retina, other factors give shape to the effective disparity direction used by visual mechanisms. The influence of orientation is examined here. I argue that horizontal disparity is an inflection point along a continuum of effective directions, and its role in stereo vision can be reinterpreted. The pointwise geometric justification for its special status neglects the oriented structural elements of spatial vision, its physiological support is equivocal, and psychophysical support of its special status may partially reflect biased stimulus sampling. The literature shows that horizontal disparity plays no particular role in the processing of one-dimensional stimuli, a reflection of the stereo aperture problem. The resulting depth is non-veridical, even non-transitive. Although one-dimensional components contribute to the stereo depth of visual objects generally, two-dimensional stimuli appear not to inherit the aperture problem. However, a look at the two-dimensional stimuli that predominate in experimental studies shows regularities in orientation that give a new perspective on horizontal disparity.
Collapse
Affiliation(s)
- Bart Farell
- Institute for Sensory Research, Department of Biomedical and Chemical Engineering, Syracuse University, Syracuse, NY, USA
| |
Collapse
|
2
|
Sheliga BM, Chen KJ, Fitzgibbon EJ, Miles FA. Short-latency disparity vergence in humans: evidence for early spatial filtering. Ann N Y Acad Sci 2006; 1039:252-9. [PMID: 15826979 PMCID: PMC1369053 DOI: 10.1196/annals.1325.024] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Our study was concerned with the disparity detectors underlying the initial disparity vergence responses (DVRs) that are elicited at ultrashort latencies by binocular disparities applied to large images. DVRs were elicited in humans by applying horizontal disparity to vertical square-wave gratings lacking the fundamental (termed here, the "missing fundamental"). In the frequency domain, a pure square wave is composed of odd harmonics--first, third, fifth, seventh, etc.--such that the third, fifth, seventh, etc., have amplitudes that are one-third, one-fifth, one-seventh, etc., that of the first, and the missing fundamental lacks the first harmonic. The patterns seen by the two eyes have a phase difference of one-quarter wavelength, so the disparity of the features and 4n + 1 harmonics (where n = integer) has one sign (crossed or uncrossed), whereas the 4n - 1 harmonics--including the strongest Fourier component (the third harmonic)--has the opposite sign (uncrossed or crossed): spatial aliasing. The earliest DVRs, recorded with the search-coil technique, had minimum latencies of 70 to 80 ms and were generally in the direction of the third harmonic, that is, uncrossed disparities resulted in convergent eye movements. In other experiments on the DVRs, one eye saw a missing fundamental and the other saw a pure sine wave with the contrast and wavelength of the third harmonic but differing in phase by one-quarter wavelength. This resulted in short-latency vergence in accordance with matching of the third harmonic. These data all indicate the importance of the Fourier components, consistent with early spatial filtering prior to binocular matching.
Collapse
Affiliation(s)
- B M Sheliga
- Laboratory of Sensorimotor Research, National Institutes of Health, Building 49 Room 2A50, 49 Convent Drive, Bethesda MD 20892-4435, USA.
| | | | | | | |
Collapse
|
3
|
Sheliga BM, FitzGibbon EJ, Miles FA. Short-latency disparity vergence eye movements: a response to disparity energy. Vision Res 2006; 46:3723-40. [PMID: 16765403 PMCID: PMC2440632 DOI: 10.1016/j.visres.2006.04.020] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2006] [Revised: 04/17/2006] [Accepted: 04/27/2006] [Indexed: 10/24/2022]
Abstract
Vergence eye movements were elicited in human subjects by applying disparities to square-wave gratings lacking the fundamental ("missing fundamental", mf). Using a dichoptic arrangement, subjects viewed gratings that were identical at the two eyes except for a phase difference of 1/4 wavelength so that, based on the nearest-neighbor matches, the features and the 4n+1 harmonics (5th, 9th, etc.) all had binocular disparities of one sign, whereas the 4n-1 harmonics (3rd, 7th, etc.) all had disparities of the opposite sign. Further, the amplitude of the ith harmonic was proportional to 1/i. Using the electromagnetic search coil technique to record the positions of both eyes indicated that the earliest vergence eye movements elicited by these disparity stimuli had ultra-short latencies (minimum, <65 ms) and were always in the direction of the most prominent harmonic, the 3rd, but their magnitudes fell short of those elicited when the same disparities were applied to pure sinusoids whose spatial frequency and contrast matched those of the 3rd harmonic. This shortfall was evident in both the horizontal vergence responses recorded with vertical grating stimuli and the vertical vergence responses recorded with horizontal grating stimuli. When the next most prominent harmonic, the 5th, was removed from the mf stimulus (creating the "mf-5" stimulus) the vertical vergence responses showed almost no shortfall-indicating that it had been almost entirely due to that 5th harmonic-but the horizontal vergence responses still showed a small shortfall, at least with higher contrast stimuli. This small shortfall might represent a very minor contribution from higher harmonics and/or distortion products and/or a feature-based mechanism. We conclude that the earliest disparity vergence responses-especially vertical-were strongly dependent on the major Fourier components of the binocular images, consistent with early spatial filtering of the monocular visual inputs prior to their binocular combination as in the disparity-energy model of complex cells in striate cortex [Ohzawa, I., DeAngelis, G. C., & Freeman, R. D. (1990). Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. Science, 249, 1037-1041].
Collapse
Affiliation(s)
- B M Sheliga
- Laboratory of Sensorimotor Research, National Eye Institute, National Institutes of Health, Building 49, Bethesda, MD 20892-4435, USA.
| | | | | |
Collapse
|
4
|
Abstract
One can measure the disparities between two retinal images in several different ways. Experiments were conducted to identify the measure that is invariant at the threshold for detecting the disparity of two-dimensional patterns. The patterns used were stereo plaids, which permit a partial dissociation between the disparity of the pattern and the disparities of its one-dimensional components. For plaids with near-horizontal disparities, thresholds are limited by a disparity phase shift equal to the threshold phase shift for single gratings. For non-horizontal disparities, thresholds are elevated, yet are still phase-limited. In no disparity direction are thresholds for detecting disparity determined by the spatial extent of the plaids' disparity. Effects of the number and the orientation of components with task-relevant disparities indicate that plaid thresholds are limited by the disparity of the plaid's one-dimensional components. No evidence was found that these components form any higher-order pattern that can be used in detecting disparity. Oblique and near-vertical disparities generate elevated thresholds at a stage beyond component disparity detection. This second stage combines component disparities, which are ambiguous about depth, into pattern disparities capable of supporting veridical depth perception.
Collapse
Affiliation(s)
- Bart Farell
- Institute for Sensory Research, Syracuse University, 621 Skytop Road, Syracuse, NY 13244-5290, USA.
| |
Collapse
|
5
|
Halpern DL, Wilson HR, Blake R. Stereopsis from interocular spatial frequency differences is not robust. Vision Res 1996; 36:2263-70. [PMID: 8776491 DOI: 10.1016/0042-6989(95)00293-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Based on data obtained using one-dimensional noise patterns, Tyler & Sutter (1979). (Vision Research, 19, 859-865) concluded that stereoscopic tilt can result from an interocular spatial frequency difference in the absence of consistent horizontal disparity. We tested stereopsis using two-dimensional random-dot patterns that were bandpass filtered to contain 1.0 octave bands of spatial frequency with means that differed between the two eyes. With vertical, one-dimensional stimuli we replicated the results of Tyler and Sutter. However, stereoscopic tilt was not perceived based on spatial frequency differences alone when the monocular images contained as little as a +/- 14 deg range of orientation variation. In addition, model simulations demonstrate that the modest steroscopic performance produced by interocular spatial frequency differences in one-dimensional noise patterns are predicted by random disparity correlations at the pattern edges. The observations lead to the conclusion that stereopsis from frequency differences in the absence of pointwise disparity correlations does not reflect a special processing capability of human vision but is an artifact associated with one-dimensional stimuli. As such, it plays no role in steroscopic analysis of the natural environment.
Collapse
Affiliation(s)
- D L Halpern
- Department of Psychology, Brandeis University, Waltham, MA 02154, USA
| | | | | |
Collapse
|
6
|
Chapter 3 Stereoscopic depth perception. Perception 1996. [DOI: 10.1016/s1874-5822(96)80006-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
7
|
Abstract
Spatial frequency selectivity has been incorporated into various theories of stereo matching, along with spatial scale interactions operating from coarse-to-fine spatial scales. We concentrate here on the role of fine scale information in the stereo matching process and show that fine scale information is capable of disambiguating matches made at coarser scales. An ambiguous coarse scale stimulus was created by presenting a low frequency (2 c/deg) sine wave in anti-phase to the two eyes, whose endpoints betrayed no information about which way the sine waves should be matched. It could be seen with crossed or uncrossed disparity equally validly and at chance from trial to trial. To this was added a fine scale (8 c/deg) filtered random dot stimulus specifying unambiguously a certain disparity. Observers judged the apparent depth of the two stimuli as the disparity of the fine scale stimulus was varied. The sine wave was usually perceived to have the same sign disparity as the fine scale stimulus. Depth matching with the two superimposed stimuli confirmed that the coarse scale stimulus was actually disambiguated, and seen with disparities equal to half its spatial period. The results suggest the operation of a cross-spatial scale matching disambiguation process, which can operate in a fine-to-coarse fashion.
Collapse
Affiliation(s)
- H S Smallman
- Department of Psychology, University of California, San Diego, La Jolla 92093-0109, USA
| |
Collapse
|
8
|
Abstract
Apparent motion (AM) was studied using the missing-fundamental square-wave grating, displaced discretely over time. At inter-stimulus intervals (ISIs) greater than about 40 msec, AM was seen in the direction of displacement of the visible features of the pattern, while at shorter ISIs AM was seen in the reversed direction, following the displacement of the third harmonic spatial frequency component. This confirms (a) that the "long-range", feature-based process can bridge much greater time-gaps than the "short-range" motion sensors and (b) that the missing-fundamental pattern is a particularly useful tool for teasing the two processes apart.
Collapse
|
9
|
Abstract
Recent anatomical and physiological studies of the visual pathway suggest the existence of at least three parallel processing streams in the lateral geniculate/primary cortex structure--a magno/interblob stream for motion and transient information; a parvo/interblob stream for high spatial frequency, static information; and a parvo/blob stream for chromatic and low spatial frequency information. How does this functional typology relate to the processing for stereoscopic depth? Human stereopsis may be viewed as consisting of three distinct types of disparity processing: coarse, local stereopsis suitable for stereomovement processing by the magno/interblob stream; fine, global stereopsis suitable for the processing of complex random-dot stereograms by the parvo/interblob stream; and simple, protostereopsis for processing size differences between the two eyes by the parvo/blob stream. Extensive psychophysical evidence supports the identification of these three disparity processes with the three processing streams.
Collapse
Affiliation(s)
- C W Tyler
- Smith-Kettlewell Eye Research Institute, San Francisco, CA 94115
| |
Collapse
|
10
|
Abstract
Models of stereopsis generally assume that binocular correspondence is achieved through alignment of luminance edges in the two eyes. Yet the stimulus properties which constitute edge information for stereopsis have not been defined. Three experiments explored the nature of these stimulus properties. The first two experiments tested whether local luminance gradient and the relative phase of spatial components supply information about the position of edges which influences stereosensitivity. In Expt 1, stereothresholds were reduced with increased spatial frequency or contrast of sinusoidal luminance gratings, but no simple relationship between target luminance gradient and stereosensitivity was found. In Expt 2, stereothresholds were equivalent for targets having identical spatial frequency components, but differing in maximum luminance gradient and the relative spatial phase of their components. In addition, stereothresholds were lower for the target having the higher contrast in pairs of unequal-contrast targets having equal maximum luminance gradients. These results suggest that the properties of luminance gradient and relative spatial phase do not influence stereosensitivity independently of spatial frequency and contrast. Experiment 3 directly tested whether stereosensitivity depends on edge information whose disparity is detected independently at different spatial scales. Stereothresholds for IF + 5F compound targets were found to be equivalent to thresholds obtained separately with the more sensitive of the two components. Taken together with a compressive nonlinearity in the relationship between contrast and stereothreshold obtained by others (Halpern and Blake, 1989; Legge and Gu, 1989) and replicated in Expt 1, the results of Expt 3 indicate that, whatever the exact nature of the luminance discontinuity information utilized in disparity detection, it is processed independently at different spatial scales.
Collapse
Affiliation(s)
- T Heckmann
- School of Optometry, University of California, Berkeley 94720
| | | |
Collapse
|
11
|
Schor C, Heckmann T. Interocular differences in contrast and spatial frequency: effects on stereopsis and fusion. Vision Res 1989; 29:837-47. [PMID: 2623826 DOI: 10.1016/0042-6989(89)90095-3] [Citation(s) in RCA: 85] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Anisometropia produces interocular differences in contrast and spatial frequency. The influence of these two parameters on Panum's fusional limit (PFL) and stereoscopic depth thresholds was investigated with sinusoidal gratings and one-dimensional band-pass-limited targets. Vertical fusion limits were unaffected by large interocular differences in contrast (40-10%) at two spatial frequencies (0.8 and 1.6 cpd). However, when tested with a low spatial frequency (0.8 cpd), stereothresholds increased 150% with an interocular difference in contrast as small as 50-25%. Stereoacuity was reduced less by differential contrast when tested with higher spatial frequencies (3.2 cpd). When tested with low spatial frequencies the stereothreshold was elevated more by reducing the contrast of one image than by equal contrast reductions of both ocular images. Stereothresholds appear to be elevated by binocular suppression evoked by interocular differences in contrast. Vertical as well as horizontal fusion limits decreased with increasing interocular size difference. Horizontal fusion limits fell off more gradually with increasing size difference than did vertical fusion limits, particularly at higher spatial frequencies (2.4 cpd). Similarly, stereothresholds increased with increasing interocular size differences. Changes in the fusion limit and stereothreshold that occur with interocular size differences are predicted from positional disparities between edge features rather than from differences in spatial frequency.
Collapse
Affiliation(s)
- C Schor
- University of California, School of Optometry, Berkeley 94720
| | | |
Collapse
|
12
|
Baro JA, Levinson E. Apparent motion can be perceived between patterns with dissimilar spatial frequencies. Vision Res 1988; 28:1311-3. [PMID: 3256148 DOI: 10.1016/0042-6989(88)90062-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Many recent models of movement processing in the human visual system predict that the perception of apparent motion requires stimuli that are similar in spatial frequency. The data presented here provide an example of the perception of apparent motion between patterns with nonoverlapping harmonic content. When patterns presented in alternate frames are dissimilar, motion can be perceived as long as the velocity is not too high.
Collapse
Affiliation(s)
- J A Baro
- Department of Psychology, Washington University, St Louis, MO 63130
| | | |
Collapse
|
13
|
Shipley T. Field processes in stereovision. A description of stereopsis appropriate to ophthalmology and visual perception. Doc Ophthalmol 1987; 66:95-170. [PMID: 3319467 DOI: 10.1007/bf00140453] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
There is, as yet, no satisfactory theory of stereopsis, despite the fact that our overt knowledge of "solid seeing" is now about 150 years old, and that contributions to our understanding come today from many fields: ophthalmology, psychology, psychophysics, neurophysiology, computer modelling, and optical-TV display technology. We review herein, and demonstrate for the reader whenever possible, certain key perceptual properties of the stereoscopic event of which any general theory must take account: vector stereoscopy and the neural grid, depth in empty visual fields, the relationship between stereoscopic and cognitive contours, stereoscopic contour formation in the presence of blur (thus, at low levels of central visual acuity), the phenomenon of cortical locking and of neural grid evocation in the presence of either peripheral or central rivalry, certain unusual ranges of figural mismatch and the concept of the horopter in relation to modern single cell electroneurophysiology in animals and to the constancy of visual directions. Some comments are also made on the concept of disparity processing by single cortical neurons, together with a short discussion of the implications of certain views of the genetics of stereovision for the perception of novel random texture sine-wave stereograms. We conclude that any theory pertinent to ophthalmology and visual science must combine the global concepts of cortical integration, the neural lock and the neural grid, herein introduced, with the more classical concepts of particulate or local binocular cortical correspondence. Certain preliminary steps in this direction are presented.
Collapse
Affiliation(s)
- T Shipley
- McKnight Vision Research Center, Bascom Palmer Eye Institute, Department of Ophthalmology, University of Miami School of Medicine, FL 33101
| |
Collapse
|
14
|
Abstract
A controversy still exists concerning whether the tilt created with interocular spatial frequency disparity arises from a computation of spatial frequency differences or from cumulative positional disparity. In a first experiment, we examined the influence of positional disparity on tilt created with frequency disparity, reasoning that if tilt were computed from spatial frequency differences, the perceived angle should remain unaltered since adding a positional disparity does not change the harmonic content of the stimulus. The results indicated that positional disparity weakened perceived tilt. In a second experiment, we tested the idea that tilt results from the calculation of increasing positional disparity across the display, arguing if local matches of features in the two eyes are made in computing tilt, then the solution to binocular correspondence may be less ambiguous if the same number of cycles was displayed for both spatial frequencies. Perceived tilt increased when the number of cycles was equal, although the angle of tilt still decreased with positional disparity. In Experiment 3, we further reduced potential sources of ambiguity for the binocular matching process by employing D10s (the tenth derivative of a Gaussian) instead of grating patterns. Positional disparity exerted essentially no influence on the perceived angle of tilt of the D10s. Taken together, the results of these experiments suggest that tilt from frequency disparity can be explained solely on the basis of positional disparity.
Collapse
Affiliation(s)
- D L Halpern
- Department of Psychology, Northwestern University, Evanston, IL 60201
| | | | | |
Collapse
|
15
|
Abstract
The range of spatial tuning for channels that process static and dynamic disparities was investigated in the central visual field by measuring stereoscopic thresholds as a function of the difference in size of spatially filtered bar-like patterns presented to the two eyes. Spatial tuning functions were revealed by an elevation of stereothreshold as the difference between the widths of bar patterns increased. Functions tuned to low spatial frequencies (0.075-2 c/deg) were classified as transient since their stereosensitivity was greater for dynamic (1 Hz) than static disparities. Functions tuned to high spatial frequencies (2.4-19 c/deg) were classified as sustained since their stereosensitivity was equal for dynamic and static disparities. When equal width patterns were presented to the two eyes, stereothreshold increased with spatial periods greater than 0.4 deg according to a constant 6 deg phase disparity. This size-disparity correlation suggests that large disparities are processed by spatial filters tuned to disparities proportional to their receptive field dimensions.
Collapse
|
16
|
Richards W, Foley JM. Spatial bandwidth of channels for slant estimated from complex gratings. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA 1981; 71:274-9. [PMID: 7218071 DOI: 10.1364/josa.71.000274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Vertical sinusoidal gratings of slightly differing spatial frequencies presented to each eye lead to the perception of a slanted plane. If a complex pattern is created by presenting two different frequencies to one eye, while the other eye views an increment of one of those frequencies and a decrement of the other, then the stimulus is equivalent to two superimposed vertical planes slanted in opposite directions. Yet the observer generally sees only a single surface at a slant intermediate to the possible extremes. As the relative spatial-frequency content of the two opposing planes is varied, the observed slant is a measure of the strengths of the connecting interactions of the underlying stereomechanisms. From these data the bandwidth of the monocular input channels for the binocular slant mechanism can be estimated and is found to be about two octaves.
Collapse
|
17
|
|
18
|
|