Model of human visual-motion sensing

doi:10.1364/JOSAA.2.000322

Home
/
Papers
/
Model of human visual-motion sensing

Journal Article•DOI•

Model of human visual-motion sensing

Andrew B. Watson¹, Albert J. Ahumada¹•Institutions (1)

01 Feb 1985-Journal of The Optical Society of America A-optics Image Science and Vision (Optical Society of America)-Vol. 2, Iss: 2, pp 322-341

TL;DR: A model of how humans sense the velocity of moving images, using a set of spatial-frequency-tuned, direction-selective linear sensors, agrees qualitatively with human perception.

read less

Abstract: We propose a model of how humans sense the velocity of moving images. The model exploits constraints provided by human psychophysics, notably that motion-sensing elements appear tuned for two-dimensional spatial frequency, and by the frequency spectrum of a moving image, namely, that its support lies in the plane in which the temporal frequency equals the dot product of the spatial frequency and the image velocity. The first stage of the model is a set of spatial-frequency-tuned, direction-selective linear sensors. The temporal frequency of the response of each sensor is shown to encode the component of the image velocity in the sensor direction. At the second stage, these components are resolved in order to measure the velocity of image motion at each of a number of spatial locations and spatial frequencies. The model has been applied to several illustrative examples, including apparent motion, coherent gratings, and natural image sequences. The model agrees qualitatively with human perception.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Performance of optical flow techniques

[...]

John L. Barron¹, David J. Fleet², Steven S. Beauchemin¹•Institutions (2)

University of Western Ontario¹, Queen's University²

01 Feb 1994-International Journal of Computer Vision

TL;DR: These comparisons are primarily empirical, and concentrate on the accuracy, reliability, and density of the velocity measurements; they show that performance can differ significantly among the techniques the authors implemented.

...read moreread less

Abstract: While different optical flow techniques continue to appear, there has been a lack of quantitative evaluation of existing methods. For a common set of real and synthetic image sequences, we report the results of a number of regularly cited optical flow techniques, including instances of differential, matching, energy-based, and phase-based methods. Our comparisons are primarily empirical, and concentrate on the accuracy, reliability, and density of the velocity measurements; they show that performance can differ significantly among the techniques we implemented.

...read moreread less

4,771 citations

Cites methods from "Model of human visual-motion sensin..."

...These are also called frequency-based methods owing to the design of velocity-tuned lters in the Fourier domain [1, 23, 49, 59]....
[...]

Book•

Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems

[...]

Peter Dayan, L. F. Abbott

01 Jan 2001

TL;DR: This text introduces the basic mathematical and computational methods of theoretical neuroscience and presents applications in a variety of areas including vision, sensory-motor integration, development, learning, and memory.

...read moreread less

Abstract: Theoretical neuroscience provides a quantitative basis for describing what nervous systems do, determining how they function, and uncovering the general principles by which they operate This text introduces the basic mathematical and computational methods of theoretical neuroscience and presents applications in a variety of areas including vision, sensory-motor integration, development, learning, and memory The book is divided into three parts Part I discusses the relationship between sensory stimuli and neural responses, focusing on the representation of information by the spiking activity of neurons Part II discusses the modeling of neurons and neural circuits on the basis of cellular and synaptic biophysics Part III analyzes the role of plasticity in development and learning An appendix covers the mathematical methods used, and exercises are available on the book's Web site

...read moreread less

3,441 citations

Journal Article•DOI•

Multichannel texture analysis using localized spatial filters

[...]

Alan C. Bovik, Marianna Clark¹, Wilson S. Geisler•Institutions (1)

Lockheed Martin Corporation¹

01 Jan 1990-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: An interpretation of image texture as a region code, or carrier of region information, is emphasized and examples are given of both types of texture processing using a variety of real and synthetic textures.

...read moreread less

Abstract: A computational approach for analyzing visible textures is described. Textures are modeled as irradiance patterns containing a limited range of spatial frequencies, where mutually distinct textures differ significantly in their dominant characterizing frequencies. By encoding images into multiple narrow spatial frequency and orientation channels, the slowly varying channel envelopes (amplitude and phase) are used to segregate textural regions of different spatial frequency, orientation, or phase characteristics. Thus, an interpretation of image texture as a region code, or carrier of region information, is emphasized. The channel filters used, known as the two-dimensional Gabor functions, are useful for these purposes in several senses: they have tunable orientation and radial frequency bandwidths and tunable center frequencies, and they optimally achieve joint resolution in space and in spatial frequency. By comparing the channel amplitude responses, one can detect boundaries between textures. Locating large variations in the channel phase responses allows discontinuities in the texture phase to be detected. Examples are given of both types of texture processing using a variety of real and synthetic textures. >

...read moreread less

1,582 citations

Journal Article•DOI•

How parallel are the primate visual pathways

[...]

William H. Merigan¹, John H. R. Maunsell•Institutions (1)

University of Rochester¹

01 Jan 1993-Annual Review of Neuroscience

TL;DR: This proposal that the cortical and subcortical pathways are continuous, so that distinct channels of information that arise in the retina remain segregated up to the highest levels of visual cortex has far-reaching implications for the understanding of the functional organization of the visual system.

...read moreread less

Abstract: The visual system, like all sensory systems, contains parallel pathways (see Stone 1 983). Recently, m uch emphasis has been placed on the relationship between two subcortical and two cortical pathways. It has been suggested that the cortical and subcortical pathways are continuous, so that distinct channels of information that arise in the retina remain segregated up to the highest levels of visual cortex. According to this view, the visual system comprises two largely independent subsystems that mediate different classes of visual behaviors. In this paper, we evaluate this proposal, which has far-reaching implications for our understanding of the functional organization of the visual system. The subcortical projection from the retina to cerebral cortex is strongly dominated by the two pathways (M and P pathways) that are relayed by the magnocellular and parvocellular subdivisions of the lateral geniculate nucleus (LGN) (see Shapley & Perry 1 986). The importance of these pathways is demonstrated by the fact that they include about 90% of the axons that leave the retinas (Silveira & Perry 1 99 1 ) and that little vision survives when both pathways are destroyed (Schiller et al 1 990a). The P and M pathways maintain their sharp anatomical segregation through the termination of the LGN projection in layer 4C of VI (striate cortex). The complex network of connections in primate extrastriate visual cor-

...read moreread less

1,580 citations

Cites background from "Model of human visual-motion sensin..."

...Lesion studies suggest that the most fundamental specialization of these two pathways may be the ability to transmit different regions of the "window of visibility" (Watson & Ahumada 1985), i.e. the range of temporal and spatial frequencies that can be seen....
[...]

Journal Article•DOI•

The computation of optical flow

[...]

Steven S. Beauchemin¹, John L. Barron¹•Institutions (1)

University of Western Ontario¹

01 Sep 1995-ACM Computing Surveys

TL;DR: The computation of optical flow is investigated in this survey: widely known methods for estimating optical flow are classified and examined by scrutinizing the hypothesis and assumptions they use.

...read moreread less

Abstract: Two-dimensional image motion is the projection of the three-dimensional motion of objects, relative to a visual sensor, onto its image plane. Sequences of time-orderedimages allow the estimation of projected two-dimensional image motion as either instantaneous image velocities or discrete image displacements. These are usually called the optical flow field or the image velocity field. Provided that optical flow is a reliable approximation to two-dimensional image motion, it may then be used to recover the three-dimensional motion of the visual sensor (to within a scale factor) and the three-dimensional surface structure (shape or relative depth) through assumptions concerning the structure of the optical flow field, the three-dimensional environment, and the motion of the sensor. Optical flow may also be used to perform motion detection, object segmentation, time-to-collision and focus of expansion calculations, motion compensated encoding, and stereo disparity measurement. We investigate the computation of optical flow in this survey: widely known methods for estimating optical flow are classified and examined by scrutinizing the hypothesis and assumptions they use. The survey concludes with a discussion of current research issues.

...read moreread less

1,317 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

The Laplacian Pyramid as a Compact Image Code

[...]

Peter J. Burt¹, Edward H. Adelson²•Institutions (2)

Rensselaer Polytechnic Institute¹, Sarnoff Corporation²

01 Apr 1983-IEEE Transactions on Communications

TL;DR: A technique for image encoding in which local operators of many scales but identical shape serve as the basis functions, which tends to enhance salient image features and is well suited for many image analysis tasks as well as for image compression.

...read moreread less

Abstract: We describe a technique for image encoding in which local operators of many scales but identical shape serve as the basis functions. The representation differs from established techniques in that the code elements are localized in spatial frequency as well as in space. Pixel-to-pixel correlations are first removed by subtracting a lowpass filtered copy of the image from the image itself. The result is a net data compression since the difference, or error, image has low variance and entropy, and the low-pass filtered image may represented at reduced sample density. Further data compression is achieved by quantizing the difference image. These steps are then repeated to compress the low-pass image. Iteration of the process at appropriately expanded scales generates a pyramid data structure. The encoding process is equivalent to sampling the image with Laplacian operators of many scales. Thus, the code tends to enhance salient image features. A further advantage of the present code is that it is well suited for many image analysis tasks as well as for image compression. Fast algorithms are described for coding and decoding.

...read moreread less

6,975 citations

Book•

The perception of the visual world

[...]

James J. Gibson

01 Jan 1950

3,843 citations

Journal Article•DOI•

Application of fourier analysis to the visibility of gratings

[...]

F. W. Campbell, John G. Robson

01 Aug 1968-The Journal of Physiology

TL;DR: The contrast thresholds of a variety of grating patterns have been measured over a wide range of spatial frequencies and the results show clear patterns of uniformity in the response to grating noise.

...read moreread less

Abstract: 1. The contrast thresholds of a variety of grating patterns have been measured over a wide range of spatial frequencies.2. Contrast thresholds for the detection of gratings whose luminance profiles are sine, square, rectangular or saw-tooth waves can be simply related using Fourier theory.3. Over a wide range of spatial frequencies the contrast threshold of a grating is determined only by the amplitude of the fundamental Fourier component of its wave form.4. Gratings of complex wave form cannot be distinguished from sine-wave gratings until their contrast has been raised to a level at which the higher harmonic components reach their independent threshold.5. These findings can be explained by the existence within the nervous system of linearly operating independent mechanisms selectively sensitive to limited ranges of spatial frequencies.

...read moreread less

3,073 citations

Journal Article•DOI•

The Perception of the Visual World. By James J. Gibson. U.S.A.: Houghton Mifflin Company, 1950 (George Allen & Unwin, Ltd., London). Price 35s.

[...]

Ralph Hetherington

01 Oct 1952-British Journal of Psychiatry

TL;DR: In this paper, the authors offer a new book that enPDFd the perception of the visual world to read, which they call "Let's Read". But they do not discuss how to read it.

...read moreread less

Abstract: Let's read! We will often find out this sentence everywhere. When still being a kid, mom used to order us to always read, so did the teacher. Some books are fully read in a week and we need the obligation to support reading. What about now? Do you still love reading? Is reading only for you who have obligation? Absolutely not! We here offer you a new book enPDFd the perception of the visual world to read.

...read moreread less

2,250 citations

Book•

The Interpretation of Visual Motion

[...]

Shimon Ullman¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 1979

TL;DR: In this paper, the authors used the methodology of artificial intelligence to investigate the phenomena of visual motion perception: how the visual system constructs descriptions of the environment in terms of objects, their three-dimensional shape, and their motion through space, on the basis of the changing image that reaches the eye.

...read moreread less

Abstract: This book uses the methodology of artificial intelligence to investigate the phenomena of visual motion perception: how the visual system constructs descriptions of the environment in terms of objects, their three-dimensional shape, and their motion through space, on the basis of the changing image that reaches the eye. The author has analyzed the computations performed in the course of visual motion analysis. Workable schemes able to perform certain tasks performed by the visual system have been constructed and used as vehicles for investigating the problems faced by the visual system and its methods for solving them.Two major problems are treated: first, the correspondence problem, which concerns the identification of image elements that represent the same object at different times, thereby maintaining the perceptual identity of the object in motion or in change. The second problem is the three-dimensional interpretation of the changing image once a correspondence has been established.The author's computational approach to visual theory makes the work unique, and it should be of interest to psychologists working in visual perception and readers interested in cognitive studies in general, as well as computer scientists interested in machine vision, theoretical neurophysiologists, and philosophers of science.

...read moreread less

2,070 citations