scispace - formally typeset
Open AccessBook

Early processing of visual information

Reads0
Chats0
TLDR
It is argued that "non-attentive" vision is in practice implemented by these grouping operations and first order discriminations acting on the primal sketch, and implies that such knowledge should influence the control of, rather than interfering with, the actual data-processing that is taking place lower down.
Abstract
An introduction is given to a theory of early visual information processing. The theory has been implemented, and examples are given of images at various stages of analysis. It is argued that the first step of consequence is to compute a primitive but rich description of the grey-level changes present in an image. The description is expressed in a vocabulary of kinds of intensity change (EDGE, SHADING-EDGE, EXTENDED-EDGE, LINE, BLOB etc.). Modifying parameters are bound to the elements in the description, specifying their POSITION, ORIENTATION, TERMINATION points, CONTRAST, SIZE and FUZZINESS. This description is obtained from the intensity array by fixed techniques, and it is called the primal sketch. For most images, the primal sketch is large and unwieldy. The second important step in visual information processing is to group its contents in a way that is appropriate for later recognition. From our ability to interpret drawings with little semantic content, one may infer the presence in our perceptual equipment of symbolic processes that can define "place-tokens" in an image in various ways, and can group them according to certain rules. Homomorphic techniques fail to account for many of these grouping phenomena, whose explanations require mechanisms of construction rather than mechanisms of detection. The necessary grouping of elements in the primal sketch may be achieved by a mechanism that has available the processes inferred from above, together with the ability to select items by first order discriminations acting on the elements' parameters. Only occasionally do these mechanisms use downward-flowing information about the contents of the particular image being processed. It is argued that "non-attentive" vision is in practice implemented by these grouping operations and first order discriminations acting on the primal sketch. The class of computations so obtained differs slightly from the class of second order operations on the intensity array. The extraction of a form from the primal sketch using these techniques amounts to the separation of figure from ground. It is concluded that most of the separation can be carried out by using techniques that do not depend upon the particular image in question. Therefore, figure-ground separation can normally precede the description of the shape of the extracted form. Up to this point, higher-level knowledge and purpose are brought to bear on only a few of the decisions taken during the processing. This relegates the widespread use of downward-flowing information to a later stage than is found in current machine-vision programs, and implies that such knowledge should influence the control of, rather than interfering with, the actual data-processing that is taking place lower down.

read more

Citations
More filters
Book ChapterDOI

The quantized geometry of visual space: the coherent computation of depth, form, and lightness

TL;DR: A theory is presented of how global visual interactions between depth, length, lightness, and form percepts can occur and suggests how quantized activity patterns which reflect these visual properties can coherently fill-in, or complete, visually ambiguous regions starting with visually informative data features.
Journal ArticleDOI

Multi-Scale Blur Estimation and Edge Type Classification for Scene Analysis

TL;DR: Signatures, in this work, are multi-scale representations of local gray-level information tied to places in gray scale images where regional differences are locally maximal, and theory on apparent widths, absence/presence of edges in pulse edge pairs is developed.
Journal ArticleDOI

A comprehensive review of past and present vision-based techniques for gait recognition

TL;DR: In this paper, the authors survey current techniques of gait recognition and modelling with the environment in which the research was conducted and discuss the issues arising from deriving gait data, such as perspective and occlusion effects, together with the associated computer vision challenges of reliable tracking of human movement.
Journal ArticleDOI

Reference frames and shape perception

TL;DR: Four experiments showing that orientational transformations which aligned different axes with the vertical severely disrupted the matching of shapes with an ambiguous model axis are interpreted in favor of a computational approach to vision in which shapes are internally represented by description relative to a perceptual reference frame.
References
More filters
Journal ArticleDOI

Receptive fields, binocular interaction and functional architecture in the cat's visual cortex

TL;DR: This method is used to examine receptive fields of a more complex type and to make additional observations on binocular interaction and this approach is necessary in order to understand the behaviour of individual cells, but it fails to deal with the problem of the relationship of one cell to its neighbours.
Journal ArticleDOI

Simple memory: a theory for archicortex.

TL;DR: It is shown that rather general numerical constraints roughly determine the dimensions of memorizing models for the mammalian brain, and from these is derived a general model for archicortex.
Journal ArticleDOI

Edge and Curve Detection for Visual Scene Analysis

TL;DR: Simple sets of parallel operations are described which can be used to detect texture edges, "spots," and "streaks" in digitized pictures and it is shown that a composite output is constructed in which edges between differently textured regions are detected, and isolated objects are also detected, but the objects composing the textures are ignored.
Journal ArticleDOI

The visual cortex as a spatial frequency analyser

TL;DR: Unitary responses to sinusoidal gratings either moving or alternating in phase have been investigated in the optic tract, lateral geniculate body and visual cortex of the cat as a function of the spatial frequency, position of the grating with respect to the cell receptive field and grating contrast.