scispace - formally typeset
Search or ask a question

Showing papers in "Journal of Vision in 2002"


Journal Article•DOI•
TL;DR: A framework is developed that transforms biological motion into a representation allowing for analysis using linear methods from statistics and pattern recognition, and reveals that the dynamic part of the motion contains more information about gender than motion-mediated structural cues.
Abstract: Biological motion contains information about the identity of an agent as well as about his or her actions, intentions, and emotions. The human visual system is highly sensitive to biological motion and capable of extracting socially relevant information from it. Here we investigate the question of how such information is encoded in biological motion patterns and how such information can be retrieved. A framework is developed that transforms biological motion into a representation allowing for analysis using linear methods from statistics and pattern recognition. Using gender classification as an example, simple classifiers are constructed and compared to psychophysical data from human observers. The analysis reveals that the dynamic part of the motion contains more information about gender than motion-mediated structural cues. The proposed framework can be used not only for analysis of biological motion but also to synthesize new motion patterns. A simple motion modeler is presented that can be used to visualize and exaggerate the differences in male and female walking patterns.

866 citations


Journal Article•DOI•
TL;DR: This study conducted experiments in which human participants trace perceived contours in natural images, and employed this novel methodology to investigate the inferential power of three classical Gestalt cues for contour grouping: proximity, good continuation, and luminance similarity.
Abstract: Although numerous studies have measured the strength of visual grouping cues for controlled psychophysical stimuli, little is known about the statistical utility of these various cues for natural images. In this study, we conducted experiments in which human participants trace perceived contours in natural images. These contours are automatically mapped to sequences of discrete tangent elements detected in the image. By examining relational properties between pairs of successive tangents on these traced curves, and between randomly selected pairs of tangents, we are able to estimate the likelihood distributions required to construct an optimal Bayesian model for contour grouping. We employed this novel methodology to investigate the inferential power of three classical Gestalt cues for contour grouping: proximity, good continuation, and luminance similarity. The study yielded a number of important results: (1) these cues, when appropriately defined, are approximately uncorrelated, suggesting a simple factorial model for statistical inference; (2) moderate image-to-image variation of the statistics indicates the utility of general probabilistic models for perceptual organization; (3) these cues differ greatly in their inferential power, proximity being by far the most powerful; and (4) statistical modeling of the proximity cue indicates a scale-invariant power law in close agreement with prior psychophysics.

333 citations


Journal Article•DOI•
TL;DR: A peripheral cue improved observers' abilities to indicate which side of a Landolt-square target had a gap whether or not a local postmask was used and with both central- and spread-neutral cues, indicating that transient attention can produce signal enhancement through finer spatial resolution.
Abstract: Visual attention can increase spatial resolution even when it leads to a decrease in performance. Whether this effect is mediated by reduction of external noise or by signal enhancement is an unsettled question. Although we previously demonstrated that attention can improve speed and accuracy in an acuity task, those experiments made use of a local postmask, which could be considered a source of external noise. In this work, a peripheral cue improved observers' abilities to indicate which side of a Landolt-square target had a gap whether or not a local postmask was used and with both central- and spread-neutral cues. In addition, we documented the presence of visual field inhomogeneities in a resolution task. Given that these experiments presented the target alone with no external noise added (i.e., without distracters or masks), our results indicate that transient attention enhanced the quality of the stimulus representation. Furthermore, because performance in the Landolt-square task indexes resolution, this attentional benefit indicates that transient attention can produce signal enhancement through finer spatial resolution.

277 citations


Journal Article•DOI•
TL;DR: The results suggest that inhibitory spatial interactions in peripheral crowding extend over larger distances than in the fovea for targets of the same size.
Abstract: Peripheral vision is characterized by reduced spatial resolution and inhibitory spatial interactions that extend over long distances. This work had three goals. (1) We considered whether the extensive crowding in peripheral vision is a consequence of a shift in the spatial scale of analysis. To test this, using a large range of target sizes and spatial frequencies, we measured the extent of crowding for targets that were limited in their spatial frequency content. (2) We considered whether crowding in peripheral vision can be explained on the basis of contrast masking by remote flanks. To test this hypothesis, we measured and compared crowding in a direction-identification experiment with masking by remote flanks in a detection experiment. In each of the experiments, our targets and flanks were composed of Gabor features, thus allowing us to control the feature contrast, spatial frequency, and orientation. (3) We examined the relationship between the suppressive and facilitatory interactions in peripheral contrast detection and crowding. Our results show that unlike the normal fovea (Levi, Klein, & Hariharan, 2002), peripheral crowding is not scale invariant nor is it attributable to simple contrast masking. Rather, our results suggest that inhibitory spatial interactions in peripheral crowding extend over larger distances than in the fovea for targets of the same size. In peripheral vision, the critical distance for crowding is approximately 0.1 times the target eccentricity. Observers can easily detect the features that compose our targets (Gabor patches) under conditions where crowding is strong. Thus, we speculate that peripheral crowding occurs because the target and flanks are combined or pooled at a second stage, following the stage of feature extraction. In peripheral vision, this pooling takes place over a large distance.

239 citations


Journal Article•DOI•
TL;DR: In this article, the authors compared perceptual learning in 16 psychophysical studies, ranging from low-level spatial frequency and orientation discrimination tasks to high-level object and face recognition tasks, and found that the amount of learning varies widely between different tasks.
Abstract: We compared perceptual learning in 16 psychophysical studies, ranging from low-level spatial frequency and orientation discrimination tasks to high-level object and face-recognition tasks. All studies examined learning over at least four sessions and were carried out foveally or using free fixation. Comparison of learning effects across this wide range of tasks demonstrates that the amount of learning varies widely between different tasks. A variety of factors seems to affect learning, including the number of perceptual dimensions relevant to the task, external noise, familiarity, and task complexity.

236 citations


Journal Article•DOI•
TL;DR: Direct imaging with the Rochester adaptive optics ophthalmoscope used to directly image the tuning properties of individual cones in living human eyes found that cone disarray is very small, accounting for less than 1% of the breadth of the tuning function of an ensemble of cones.
Abstract: The tuning properties of the ensemble of cone photoreceptors is due to the tuning properties of individual cones convolved with the disarray in pointing direction between the cones. We used direct imaging with the Rochester adaptive optics ophthalmoscope to directly image these properties in individual cones in living human eyes. We found that cone disarray is very small, accounting for less than 1% of the breadth of the tuning function of an ensemble of cones. The implication is that the optical fiber properties of an ensemble of cones mimic the tuning properties of a single cone.

203 citations


Journal Article•DOI•
Albert J. Ahumada1•
TL;DR: For the linear discrimination of two stimuli in white Gaussian noise in the presence of internal noise, a method is described for estimating linear classification weights from the sum of noise images segregated by stimulus and response.
Abstract: For the linear discrimination of two stimuli in white Gaussian noise in the presence of internal noise, a method is described for estimating linear classification weights from the sum of noise images segregated by stimulus and response. The recommended method for combining the two response images for the same stimulus is to difference the average images. Weights are derived for combining images over stimuli and observers. Methods for estimating the level of internal noise are described with emphasis on the case of repeated presentations of the same noise sample. Simple tests for particular hypotheses about the weights are shown based on observer agreement with a noiseless version of the hypothesis.

192 citations


Journal Article•DOI•

167 citations


Journal Article•DOI•
TL;DR: A difference in the magnitude of the classification images is found, supporting the idea that visual attention changes the weighting of information at the cued and uncued location, but does not change the quality of processing at each individual location.
Abstract: In the Posner cueing paradigm, observers' performance in detecting a target is typically better in trials in which the target is present at the cued location than in trials in which the target appears at the uncued location. This effect can be explained in terms of a Bayesian observer where visual attention simply weights the information differently at the cued (attended) and uncued (unattended) locations without a change in the quality of processing at each location. Alternatively, it could also be explained in terms of visual attention changing the shape of the perceptual filter at the cued location. In this study, we use the classification image technique to compare the human perceptual filters at the cued and uncued locations in a contrast discrimination task. We did not find statistically significant differences between the shapes of the inferred perceptual filters across the two locations, nor did the observed differences account for the measured cueing effects in human observers. Instead, we found a difference in the magnitude of the classification images, supporting the idea that visual attention changes the weighting of information at the cued and uncued location, but does not change the quality of processing at each individual location.

161 citations


Journal Article•DOI•
TL;DR: The technique provides an estimate of the conventional first-order receptive field of a neuron, similar to what can be obtained with other reverse-correlation schemes, and demonstrates for the first time that it might be feasible to investigate the receptive field structure of visual neurons from their responses to natural image sequences.
Abstract: Probing the visual system with the ensemble of signals that occur in the natural environment may reveal aspects of processing that are not evident in the neural responses to artificial stimulus sets, such as conventional bars and sinusoidal gratings. However, unsolved is the question of how to use complex natural stimulation, many aspects of which the experimenter cannot completely specify, to study neural processing. Here a method is presented to investigate the structure of a neuron's receptive field based on its response to movie clips and other stimulus ensembles. As a particular case, the technique provides an estimate of the conventional first-order receptive field of a neuron, similar to what can be obtained with other reverse-correlation schemes. This is demonstrated experimentally and with computer simulations. Our analysis also revealed that the receptive fields of both simple and complex cells had regions where image boundaries, independent of their contrast sign, would enhance or suppress the cell's response. In some cases, these signals were tuned for the orientation of the boundary. This demonstrates for the first time that it might be feasible to investigate the receptive field structure of visual neurons from their responses to natural image sequences.

153 citations


Journal Article•DOI•
TL;DR: Foveal crowding is proportional to feature size over the more than 50-fold range of target sizes that the authors examined and is scale invariant, showing it is the size of the envelope rather than the carrier that determines the extent of crowding in the fovea.
Abstract: Spatial interactions are a critical and ubiquitous feature of spatial vision. These interactions may be inhibitory (reducing sensitivity as occurs in crowding) or facilitatory (enhancing sensitivity). In this work, we had four goals. 1. To test the hypothesis that foveal crowding depends on target size by measuring the extent of crowding for novel targets that were limited in their spatial frequency content. We used a large range of target sizes and spatial frequencies. 2. To assess whether the critical spatial frequency model (Hess, Dakin, & Kapoor, 2000) provides a general model for foveal crowding. To test this model, we measured crowding for a direction-identification task that did not require judging the orientation of the gap. 3. To test the hypothesis that foveal crowding is simply contrast masking by remote flanks we measured and compared crowding in a direction-identification experiment with masking by remote flanks in a detection experiment. In each of the experiments, our targets and flanks were composed of Gabor features, thus allowing us to control the feature contrast, spatial frequency, and orientation. 4. To assess the relationship between suppressive and facilitatory spatial interactions in foveal vision. Our results show that (1) foveal crowding is proportional to feature size over the more than 50-fold range of target sizes that we examined. Over this large range, foveal crowding is scale invariant. Our results also show it is the size of the envelope (SD) rather than the carrier (SF) that determines the extent of crowding in the fovea. 2. Crowding that occurs in the direction-identification task is quite similar to crowding where orientation information is available. Thus we conclude that the critical spatial frequency model does not provide a general explanation for foveal crowding. 3. Threshold elevation for crowding is similar to threshold elevation for masking as predicted by our test-pedestal model. Thus we conclude that foveal crowding is simple contrast masking. 4. Based on our comparison of threshold changes in crowding and masking, we conclude that in foveal vision, the suppressive spatial interactions due to nearby flanks are similar in the two tasks. However, the facilitatory interactions are quite different. In the crowding task, we find very little evidence for facilitation by flankers, whereas in the detection task, we find strong facilitation. We suggest that facilitation of detection by remote flanks may be, at least in part, a consequence of uncertainty reduction.

Journal Article•DOI•
TL;DR: This work considers estimation and statistical hypothesis testing on classification images obtained from the two-alternative forced-choice experimental paradigm, and describes an estimation procedure for obtaining classification images from observer data.
Abstract: We consider estimation and statistical hypothesis testing on classification images obtained from the two-alternative forced-choice experimental paradigm. We begin with a probabilistic model of task performance for simple forced-choice detection and discrimination tasks. Particular attention is paid to general linear filter models because these models lead to a direct interpretation of the classification image as an estimate of the filter weights. We then describe an estimation procedure for obtaining classification images from observer data. A number of statistical tests are presented for testing various hypotheses from classification images based on some more compact set of features derived from them. As an example of how the methods we describe can be used, we present a case study investigating detection of a Gaussian bump profile.


Journal Article•DOI•
TL;DR: The optimal weighted sum of noise fields for calculating classification images in several experimental designs is derived, and the signal-to-noise ratio (SNR) of the resulting classification images is derived.
Abstract: In signal detection theory, an observer's responses are often modeled as being based on a decision variable obtained by cross-correlating the stimulus with a template, possibly after corruption by external and internal noise. The response classification method estimates an observer's template by measuring the influence of each pixel of external noise on the observer's responses. A map that shows the influence of each pixel is called a classification image. Other authors have shown how to calculate classification images from external noise fields, but the optimal calculation has never been determined, and the quality of the resulting classification images has never been evaluated. Here we derive the optimal weighted sum of noise fields for calculating classification images in several experimental designs, and we derive the signal-to-noise ratio (SNR) of the resulting classification images. Using the expressions for the SNR, we show how to choose experimental parameters, such as the observer's performance level and the external noise power, to obtain classification images with a high SNR. We discuss two-alternative identification experiments in which the stimulus is presented at one or more contrast levels, in which each stimulus is presented twice so that we can estimate the power of the internal noise from the consistency of the observer's responses, and in which the observer rates the confidence of his responses. We illustrate these methods in a series of contrast increment detection experiments.

Journal Article•DOI•
TL;DR: This work confirmed the findings of earlier imaging studies that neuronal events associated with binocular rivalry occur as early as V1 and extended those earlier findings by demonstrating robust neural suppression during Binocular rivalry regardless of the stimulus complexity of the rivaling targets.
Abstract: During binocular rivalry, one of two incompatible monocular stimuli is erased from perceptual awareness for seconds at a time. To examine whether this "rivalry suppression" occurs in V1, we measured functional magnetic resonance imaging activity during binocular rivalry and compared it with those in the two reference conditions: one representing complete suppression and the other representing no suppression. We found that the amplitude of V1 activity during rivalry fell midway between those in the two reference conditions; the amount of V1 activity associated with the nondominant pattern was reduced by 48% to 77% during rivalry. The same pattern of results was obtained with meaningful rival targets (i.e., a human face and a house). In this work, using a different experimental protocol, we confirmed the findings of earlier imaging studies that neuronal events associated with binocular rivalry occur as early as V1. Furthermore, our findings extend those earlier findings by demonstrating robust neural suppression during binocular rivalry regardless of the stimulus complexity of the rivaling targets.

Journal Article•DOI•
TL;DR: A maximum-likelihood approach to quantifying the significance of differences between estimates of template is described, which suggests that observers use optimally tuned detection templates for orientation discrimination.
Abstract: When performance is limited by stochastically defined masks, (psychophysical) reverse correlation has proven to be an especially efficient tool for estimating the templates used by detection and discrimination mechanisms. Here I describe a maximum-likelihood approach to quantifying the significance of differences between estimates of template. Four methodologically related experiments illustrate the versatility of reverse correlation. Experiment 1 shows significant differences between the templates used by different observers when detecting a bright Gaussian blob. The results of Experiment 2 are consistent with observers not using information about the phase of a parafoveal wavelet when detecting it. Experiments 3 and 4 reveal not only the templates used by detection mechanisms but also aspects of their response functions. Both results are consistent with a sensory threshold. Experiment 3 shows that 2-alternative forced-choice detection errors are caused when the target's effective contrast is reduced, not when the mask looks more like the expected target+mask than the actual target+mask. Experiment 4 suggests that observers use optimally tuned detection templates for orientation discrimination.

Journal Article•DOI•
TL;DR: A particular experimental approach employed in the study of depth and shape that is useful in determining whether hypothetical illuminant cues are actually used in color vision is described.
Abstract: This work briefly describes a model for illuminant estimation based on combination of candidate illuminant cues. Many of the research issues concerning cue combination in depth and shape perception translate well to the study of surface color perception. I describe and illustrate a particular experimental approach (perturbation analysis) employed in the study of depth and shape that is useful in determining whether hypothetical illuminant cues are actually used in color vision.

Journal Article•DOI•
TL;DR: Evidence for independent changes in chromatic and achromatic visual processing throughout the life span is reviewed with particular emphasis on electrophysiological results.
Abstract: It has been argued that the development and aging of the different achromatic and chromatic visual pathways may proceed independently. We review here the evidence for such independent changes with particular emphasis on electrophysiological results. Changes in chromatic and achromatic visual processing throughout the life span were studied using visual evoked potentials (VEPs). VEPs were recorded in response to the presentation of patterns designed to preferentially stimulate achromatic and S-(L+M) and (L-M) chromatic mechanisms. Recordings were made in subjects aged 1 week to 90+ years. Longitudinal measurements were obtained from several infants and cross-sectional measurements were obtained from infants and older subjects. Responses to achromatic reversing patterns at low spatial frequencies appeared early and changed rapidly. Latencies of the achromatic reversal response decreased to mature values within the first 12-15 weeks of life. Responses to chromatic pattern onsets, however, appeared later (L-M: 4 weeks; S: 6-8 weeks) and changed continuously throughout the first year of life. Chromatic waveforms from 1 year to puberty appeared inverted relative to the adult waveform. The waveforms did not appear adultlike until about 12-14 years of age. The latencies of the major negative component of the adult response reached a minimum around 17-18 years of age. Throughout the remainder of the life span, VEP latencies steadily increased and amplitudes slightly decreased. Latencies of responses to chromatic pattern onsets increased more rapidly than latencies to moderate contrast achromatic pattern reversals.

Journal Article•DOI•
TL;DR: Visual evoked potentials are used to measure the time course of neural events associated with the extraction of symmetry in random dot fields and results are consistent with the hypothesis that the symmetry property is extracted by processing in extrastriate cortex.
Abstract: Symmetry is a highly salient feature of animals, plants, and the constructed environment. Although the perceptual phenomenology of symmetry processing is well understood, little is known about the underlying neural mechanisms. Here we use visual evoked potentials to measure the time course of neural events associated with the extraction of symmetry in random dot fields. We presented sparse random dot patterns that were symmetric about both the vertical and horizontal axes. Symmetric patterns were alternated with random patterns of the same density every 500 msec, using new exemplars of symmetric and random patterns on each image update. Random/random exchanges were used as a control. The response to updates of random patterns was multiphasic, consisting of P65, N90, P110, N140 and P220 peaks. The response to symmetric/random sequences was indistinguishable from that for random/random sequences up to about 220 msec, after which the response to symmetric patterns became relatively more negative. Symmetry in random dot patterns thus appears to be extracted after an initial response phase that is indifferent to configuration. These results are consistent with the hypothesis (Lee, Mumford, Romero, & Lamme, 1998; Tyler & Baseler, 1998) that the symmetry property is extracted by processing in extrastriate cortex. Keywords: evoked potentials, shape, form

Journal Article•DOI•
TL;DR: Central spatial precuing effects in 16 conditions suggest that spatial attention mostly excludes external noise at the target location; the presence of external noise and/or signal stimuli in non-target regions has little effect on spatial performance when location uncertainty is eliminated by report cues.
Abstract: To investigate the nature of external noise exclusion, we compared central spatial precuing effects in 16 conditions that varied the amount of external noise, the number of signal stimuli, the number of locations masked by external noise, and the number and style of frames surrounding potential target locations. In the absence of external noise, precuing produced only marginal performance improvements in a small number of display conditions. In the presence of high external noise, precuing improved task performance in all the display conditions. The magnitude of these spatial attention effects, as gauged by contrast threshold reduction, is nearly constant across all the display conditions. This suggests that spatial attention mostly excludes external noise at the target location; the presence of external noise and/or signal stimuli in non-target regions has little effect on spatial performance when location uncertainty is eliminated by report cues. However, the presence of other potential locations for the target is critical, because if target location is known in advance, attention can be focused on that location with or without a cue.

Journal Article•DOI•
TL;DR: Evidence for psychophysical facilitation of contrast detection by cross surround stimuli (orthogonal to the target orientation) that may provide insights into both the neurophysiology and psychophysics of cross surround modulation is reported.
Abstract: Neurophysiological and psychophysical evidence indicates that neuronal surround modulation at cross-orientation (orthogonal to the preferred orientation of the classical receptive field) plays a key role in intermediate-level visual tasks, such as textural segregation and perceptual pop-out. What is missing is a psychophysical description of cross surround modulation at the spatial filter level in low-level vision. Moreover, neurophysiological evidence for how cross surround modulation is expressed at the neuronal level has been inconsistent. Here we report evidence for psychophysical facilitation of contrast detection by cross surround stimuli (orthogonal to the target orientation) that may provide insights into both the neurophysiology and psychophysics of cross surround modulation. We found that cross surround facilitation is a surround-contrast dependent effect mainly evident at low surround contrasts, and is narrowly tuned to spatial frequency and broadly tuned to orientation. To understand whether cross surround facilitation results from low-level processing of signal-to-noise enhancement or is due to uncertainty reduction at a higher-level decision stage, we (1) studied cross surround facilitation with an equivalent noise protocol, (2) estimated the changes in the slope of the psychometric function and the uncertainty parameter, M, and (3) measured cross surround effects at the dipper of the TvC function. The converging evidence suggests that cross surround facilitation of contrast detection is mainly a result of low-level signal-to-noise enhancement, and is little affected by uncertainty change.

Journal Article•DOI•
TL;DR: Johnson, Hawken, and Shapley as discussed by the authors used electrophysiological data to reconcile the results from different techniques and conclude that V1 contains relatively large numbers of color-opponent neurons.
Abstract: The signals in visual cortex that ultimately give rise to color perception remain poorly understood. Controversy has particularly surrounded one aspect of color's encoding in the visual system-opponent processing in primary visual cortex. Early single-unit studies suggested that V1 contains relatively few color-opponent neurons. Neuroimaging measurements, however, have suggested that such neurons might be relatively numerous. Here, we reconcile these apparently discrepant results and conclude that V1 contains relatively large numbers of color-opponent neurons. We first review results from each method and find that most neuroimaging studies provide evidence of substantial color opponency in V1, and that despite apparent controversy, most single-unit studies agree that relatively large numbers of V1 neurons show some sort of color opponency. To reconcile the results from different techniques more formally, we used electrophysiological data to predict the outcomes of neuroimaging experiments. We simulated the expected fMRI response in V1 to spatial patterns of different color, based on the neurons' properties, as reported in Johnson, Hawken, and Shapley, (2001). The simulated responses to stimuli used in Engel, Zhang, and Wandell, (1997) agree well with the actually observed fMRI results. The model identifies several factors that led to the apparent discrepancy between techniques, and makes testable predictions about how these factors influence the magnitude of color-opponent signals. fMRI and single-unit data converge to show that large numbers of color-opponent neurons exist in V1.

Journal Article•DOI•
Kevin R. Brooks1•
TL;DR: There are two separate, specialised mechanisms for encoding stereomotion: one through IOVD, the other through CD; and at a disparity pedestal both the IovD and the CD cues have a significant influence.
Abstract: Two experiments are presented assessing the contributions of the rate of change of disparity (CD) and interocular velocity difference (IOVD) cues to stereomotion speed perception. Using a two-interval forced-choice paradigm, the perceived speed of directly approaching and receding stereomotion and of monocular lateral motion in random dot stereogram (RDS) targets was measured. Prior adaptation using dysjunctively moving random dot stimuli induced a velocity aftereffect (VAE). The degree of interocular correlation in the adapting images was manipulated to assess the effectiveness of each cue. While correlated adaptation involved a conventional RDS stimulus, containing both IOVD and CD cues, uncorrelated adaptation featured an independent dot array in each monocular half-image, and hence lacked a coherent disparity signal. Adaptation produced a larger VAE for stereomotion than for monocular lateral motion, implying effects at neural sites beyond that of binocular combination. For motion passing through the horopter, correlated and uncorrelated adaptation stimuli produced equivalent stereomotion VAEs. The possibility that these results were due to the adaptation of a CD mechanism through random matches in the uncorrelated stimulus was discounted in a control experiment. Here both simultaneous and sequential adaptation of left and right eyes produced similar stereomotion VAEs. Motion at uncrossed disparities was also affected by both correlated and uncorrelated adaptation stimuli, but showed a significantly greater VAE in response to the former. These results show that (1) there are two separate, specialised mechanisms for encoding stereomotion: one through IOVD, the other through CD; (2) the IOVD cue dominates the perception of stereomotion speed for stimuli passing through the horopter; and (3) at a disparity pedestal both the IOVD and the CD cues have a significant influence.

Journal Article•DOI•
TL;DR: A dual-masking paradigm is used to study how contrast discrimination is influenced by the presence of adjacent stimuli differing in orientation to detect a vertical Gabor target superimposed on a horizontal Gabor pedestal.
Abstract: We used a dual-masking paradigm to study how contrast discrimination is influenced by the presence of adjacent stimuli differing in orientation. The task of the observer was to detect a vertical Gabor target superimposed on a vertical Gabor pedestal in the presence of flankers. The Gabor flankers had orientations ranging from 0 degrees (parallel to the target) to 90 degrees (orthogonal). The flankers had two different facilitatory effects: (a) Threshold facilitation. The flankers facilitated target detection at low pedestal contrasts. This facilitation was narrowly tuned to flanker orientation. (b) Pedestal enhancement. The flankers at high contrast enhanced the masking effectiveness of the pedestal. This pedestal enhancement changed little with flanker orientation. We fitted the data with a sensitivity modulation model in which the flanker effects were implemented as multiplicative factors modulating the sensitivity of the target mechanism to both excitatory and inhibitory inputs. The model parameters showed that, (a) pedestal enhancement occurs when flanker facilitation to the pedestal is greater than to the target; (b) while the sensitivity modulation was tuned sharply with flanker orientation, the ratio between the excitatory and the inhibitory factors remained constant. The explanation of the flanker orientation effect requires the both the values of each factor and the ratio between them.

Journal Article•DOI•
TL;DR: Although the function relating thresholds to set size had a slope consistent with both the Signed-Max and the Summation models, the shape of individual psychometric functions was in the most crucial conditions better predicted by the signed-Max model, which chooses the largest tilt while keeping track of the direction of tilt.
Abstract: Search performance for a target tilted in a known direction among vertical distractors is well explained by signal detection theory models. Typically these models use a maximum-of-outputs rule (Max rule) to predict search performance. The Max rule bases its decision on the largest response from a set of independent noisy detectors. When the target is tilted in either direction from the reference orientation and the task is to identify the sign of tilt, the loss of performance with set size is much greater than predicted by the Max rule. Here we varied the target tilt and measured psychometric functions for identifying the direction of tilt from vertical. Measurements were made at different set sizes in the presence of various levels of orientation jitter. The orientation jitter was set at multiples of the estimated internal noise, which was invariant across set sizes and measurement techniques. We then compared the data to the predictions of two models: a Summation model that integrates both signal and noise from local detectors and a Signed-Max model that first picks the maxima on both sides of vertical and then chooses the value with the highest absolute deviation from the reference. Although the function relating thresholds to set size had a slope consistent with both the Signed-Max and the Summation models, the shape of individual psychometric functions was in the most crucial conditions better predicted by the Signed-Max model, which chooses the largest tilt while keeping track of the direction of tilt.

Journal Article•DOI•
TL;DR: It is shown that for a large class of static nonlinearities, one can obtain analytical expressions for the estimates and the proposed technique is shown to converge much faster than the currently used linear-reconstruction method.
Abstract: A statistical model used extensively in vision research consists of a cascade of a linear operator followed by a static (mem- oryless) nonlinearity. Common applications include the measurement of simple-cell receptive fields in primary visual cortex and the modeling of human performance in various psychophysical tasks. It is well known that the front-end linear filter of the model can readily be recovered, up to a multiplicative constant, using reverse-correlation techniques. However, a full identification of the model also requires an estimation of the output nonlinearity. Here, we show that for a large class of stat- ic nonlinearities, one can obtain analytical expressions for the estimates. The technique works with both Gaussian and binary noise stimuli. The applicability of the method in physiology and psychophysics is demonstrated. Finally, the proposed technique is shown to converge much faster than the currently used linear-reconstruction method.

Journal Article•DOI•
TL;DR: It is suggested that the primacy of spatial information for both pursuit and saccades reflects the importance of spatial attention in selecting the stimulus target for both eye movements.
Abstract: Previous studies have examined the facilitative effects of prior spatial information on target selection for saccadic eye movements. More recently, studies have shown that prior spatial information also influences target selection for smooth pursuit. However, direct comparisons of the effects of prior information on target selection for pursuit and saccades have not been made. To this end, we provided different classes of prior information and measured their effects on target selection for pursuit and saccades. In Experiment 1, we assessed the relative effects of spatial cues (indicating the target stimulus' initial location) and color cues (indicating the target stimulus' color) on eye movement latencies. In Experiment 2, we assessed the effects of motion cues (indicating the target stimulus' direction of motion) in addition to spatial cues. For both pursuit and saccades, we found that spatial cues reduced eye movement latencies more than color cues (Experiment 1). Spatial cues also reduced eye movement latencies more than motion cues (Experiment 2), even for pursuit, despite the fact that stimulus motion is essential for the generation of pursuit eye movements. These results indicate that both pursuit and saccades are affected to a greater degree by spatial information than motion or color information. We suggest that the primacy of spatial information for both pursuit and saccades reflects the importance of spatial attention in selecting the stimulus target for both eye movements.

Journal Article•DOI•
TL;DR: It was found that observers perceived only one grid orientation when the two specified orientations were similar, but more interestingly, when the monocular--and binocular-specified orientation were rather different, observers experienced perceptual bi-stability (they were able to select either a perspective--or a disparity-dominated percept).
Abstract: We examined how much depth we perceive when viewing a depiction of a slanted plane in which binocular disparity and monocular perspective provide different slant information. We exposed observers to a grid stimulus in which the monocular--and binocular-specified grid orientations were varied independently across stimulus presentations. The grids were slanted about the vertical axis and observers estimated the slant relative to the frontal plane. We were particularly interested in the metrical aspects of perceived slant for a broad spectrum of possible combinations of disparity--and perspective-specified slants. We found that observers perceived only one grid orientation when the two specified orientations were similar. More interestingly, when the monocular--and binocular-specified orientations were rather different, observers experienced perceptual bi-stability (they were able to select either a perspective--or a disparity-dominated percept).

Journal Article•DOI•
TL;DR: A new method using one-dimensional sums of sinusoids as both test stimuli (discrete frequency patterns [DFP]) and as noise is introduced to study and compare the templates used to detect a target and to discriminate the target's position in central and parafoveal vision.
Abstract: Classification images provide an important new method for learning about which parts of the stimulus are used to make perceptual decisions and provide a new tool for measuring the template an observer uses to accomplish a task. Here we introduce a new method using one-dimensional sums of sinusoids as both test stimuli (discrete frequency patterns [DFP]) and as noise. We use this method to study and compare the templates used to detect a target and to discriminate the target's position in central and parafoveal vision. Our results show that, unsurprisingly, the classification images for detection in both foveal and parafoveal vision resemble the DFP test stimulus, but are considerably broader in spatial frequency tuning than the ideal observer. In contrast, the classification images for foveal position discrimination are not ideal, and depend on the size of the position offset. Over a range of offsets from close to threshold to about 90 arc sec, our observers appear to use a peak strategy (responding to the location of the peak of the luminance profile of the target plus noise). Position acuity is much less acute in the parafovea, and this is reflected in the reduced root efficiency (i.e., square root of efficiency) and the coarse classification images for peripheral position discrimination. The peripheral position template is a low spatial frequency template.

Journal Article•DOI•
TL;DR: The ganglion cells of primate retina have center-surround receptive fields and it is proposed that the strong narrow field reflects direct synaptic input from the cones overlying the dendritic tree whereas the weak wide field reflects coupled inputs from neighboring H1 cells.
Abstract: The ganglion cells of primate retina have center-surround receptive fields. A strong candidate for mediating linear surround circuitry is negative feedback from the H1 horizontal cell onto the cone pedicle. We measured the spatial properties of H1 cell receptive fields in the in vitro macaque monkey retina using sinusoidal gratings, spots, and annuli. Spatial tuning curves ranged in shape from smoothly low pass to prominently notched. The tuning curves of approximately 80% of cells could be well described by a sum of two exponentials, giving a prominent central peak superimposed on a broad shallow skirt. The mean diameter of the combined receptive field decreased with eccentricity from 309 micro m at 11 mm to 122 micro m at 4 mm. We propose that the strong narrow field reflects direct synaptic input from the cones overlying the dendritic tree whereas the weak wide field reflects coupled inputs from neighboring H1 cells. Those cells not well fit by a sum of exponentials had tuning curves with additional peaks at higher spatial frequencies that were likely due to undersampling in the cone-H1 network. Unlike other vertebrates, the macaque H1 network is less strongly coupled, has smaller receptive fields, and shows no functional plasticity. Macaque H1 receptive fields are surprisingly small, suggesting a great reduction in electrical coupling. Because the center of the H1 receptive field gets only a small percentage of its total response from the coupled field, the smallest receptive fields are similar in diameter to the dendritic trees. They are probably small enough to form the surrounds of foveal midget cells. The H1 network is compatible with a mixed-surround model of spectral opponency.