scispace - formally typeset
Search or ask a question

Showing papers on "Monocular vision published in 1987"


Journal ArticleDOI
01 Jan 1987-Nature
TL;DR: A simple algorithm for computing the three-dimensional structure of a scene from a correlated pair of perspective projections is described here, when the spatial relationship between the two projections is unknown.
Abstract: A simple algorithm for computing the three-dimensional structure of a scene from a correlated pair of perspective projections is described here, when the spatial relationship between the two projections is unknown. This problem is relevant not only to photographic surveying1 but also to binocular vision2, where the non-visual information available to the observer about the orientation and focal length of each eye is much less accurate than the optical information supplied by the retinal images themselves. The problem also arises in monocular perception of motion3, where the two projections represent views which are separated in time as well as space. As Marr and Poggio4 have noted, the fusing of two images to produce a three-dimensional percept involves two distinct processes: the establishment of a 1:1 correspondence between image points in the two views—the ‘correspondence problem’—and the use of the associated disparities for determining the distances of visible elements in the scene. I shall assume that the correspondence problem has been solved; the problem of reconstructing the scene then reduces to that of finding the relative orientation of the two viewpoints.

2,671 citations


Journal ArticleDOI
TL;DR: The hypothesis was tested that the coupling of parallax shifts between objects depicted on a monitor screen around a fixation point with the head movements of an observer viewing this screen monocularly around a point coinciding with the fixation point is sufficient to create a convincing depth impression and to enable the observer to make reliable estimations of depth.
Abstract: The hypothesis was tested that the coupling of parallax shifts between objects depicted on a monitor screen around a fixation point with the head movements of an observer viewing this screen monocularly around a point coinciding with the fixation point is sufficient to create a convincing depth impression and to enable the observer to make reliable estimations of depth. The estimates were based on monocular vision and involved the aligning of wedges. The investigation consisted of two analogous experiments carried out simultaneously, one on depth estimations virtually in the screen and one on depth estimations virtually in front of the screen. In each experiment three conditions were compared: an active condition in which the coupling of parallax shifts and observer's head movements operated, a passive condition in which it did not, and a real-life set-up to measure the maximum reliability in depth estimation. The hypothesis is confirmed: in the active condition the variances in the alignments are significantly smaller than in the passive condition and approach those in the real-life set-up. This holds not only for estimates in the screen but also for estimates in front of the screen, that is, we can make a thing apparently leap out of the screen towards the observer. Results are interpreted against the background of the debate between the direct and the indirect theories of perception.

42 citations


Journal ArticleDOI
TL;DR: The experiments demonstrate that in orthographic projection the visual system can compute from local surface orientation a depth quantity that is commensurate with the relative depth derived from stereo disparity, and suggests that, at least in the near field, the zero value for relative depth lies at the same absolute depth as the stereo horopter.
Abstract: It is generally expected that depth (distance) is the internal representational primitive that corresponds to much of the perception of 3D. We tested this assumption in monocular surface stimuli that are devoid of distance information (due to orthographic projection and the chosen surface shape, with perspective projection used as a control) and yet are vividly three-dimensional. Slant judgments were found to be in close correspondence with the actual geometric slant of the stimuli; the spatial orientation of the surfaces was perceived accurately. The apparent depth in these stimuli was then tested by superimposing a stereo depth probe over the monocular surface. In both the perspective and orthographic projection the gradient of perceived depth, measured by matching the apparent depth of the stereo probe with that of the monocular surface at a series of locations, was substantial. The experiments demonstrate that in orthographic projection the visual system can compute from local surface orientation a depth quantity that is commensurate with the relative depth derived from stereo disparity. The depth data suggests that, at least in the near field, the zero value for relative depth lies at the same absolute depth as the stereo horopter (locus of zero stereo disparity). Relative to this zero value, the depth-from-slant computation seems to provide an estimate of distance information that is independent of the absolute distance to the surface.

41 citations


Journal ArticleDOI
R. A. Clement1
TL;DR: The mathematical analysis of binocular vision introduced by Helmholtz is applied and it is shown that matching the images from the left and right eyes along radial directions is an alternative to matching images along the horizontal direction only.
Abstract: The mathematical analysis of binocular vision introduced by Helmholtz is applied to the problem of the use of disparity information to position a stimulus in depth. It is shown that matching the images from the left and right eyes along radial directions is an alternative to matching images along the horizontal direction only.

5 citations


Journal ArticleDOI
TL;DR: Using monocular and dynamic random dot correlogram stimuli, sequential visual evoked potentials changes were demonstrated in 2 patients following cerebral blindness, providing evidence for the vulnerability of postsynaptic cortical mechanisms of human binocular vision.

4 citations


01 Jan 1987
TL;DR: The functional structure of the telerobotic vision system used in the Langley Research Center's (LaRC) Intelligent Systems Research Laboratory (ISRL) and discusses two monovision techniques for accomplishing the vision subtasks are described.
Abstract: The primary task of the vision sensor in a telerobotic system is to provide information about the position of the system's effector relative to objects of interest in its environment. The subtasks required to perform the primary task include image segmentation, object recognition, and object location and orientation in some coordinate system. The accomplishment of the vision task requires the appropriate processing tools and the system methodology to effectively apply the tools to the subtasks. The functional structure of the telerobotic vision system used in the Langley Research Center's Intelligent Systems Research Laboratory is discussed as well as two monovision techniques for accomplishing the vision subtasks.

3 citations


Patent
05 Jan 1987
TL;DR: In this article, a mask resting on the experimenter's face, covering the eyes and nose, is used to simulate the cosmic energy conveyed by the coloured chips of identical colour on each half-screen.
Abstract: The invention takes the form of a mask resting on the experimenter's face, covering the eyes and nose. The apparatus enables removable, coloured, translucent screens (1) to be visualised in mono- or binocular vision by means of a removable internal partition (4). The portable model permits only monocular vision produced as a result of the partition (1). Respiration using the apparatus is carried out via orifices (2) pierced in the mask; one or more loudspeakers (3), integral with the apparatus, enable vibratory energy complementary to the coloured area contained in the apparatus to be introduced. The front of the mask of the portable model possesses a slide channel enabling two opaque half-screens (2), perforated to incorporate translucent coloured chips (3) whose colours are suited to the desired objective, to be brought into position. The half-screens (2), which are movable in respect of their spacing, permit visual superposition of chips of identical colour on each half- screen by binocular mental construction of the phenomenon of vision. This device, bringing about recentering of the eyes, enables the cosmic energy conveyed by the coloured chips (3) to be absorbed symmetrically by the energy centre of each eye.

3 citations


Journal ArticleDOI
TL;DR: Rod-cone interaction at moderate background levels when both test and concentric disk-background stimuli enter the same eye (monocular condition) but not when they enter different eyes (dichoptic condition) was studied in this article.

3 citations