scispace - formally typeset
Search or ask a question

Showing papers by "Eero P. Simoncelli published in 2006"


Journal ArticleDOI
TL;DR: An augmented observer model is presented that accounts for the variability of subjective responses in a speed discrimination task and is found that the fitted model provides an accurate description of the data across a wide range of stimulus parameters.
Abstract: Human visual speed perception is qualitatively consistent with a Bayesian observer that optimally combines noisy measurements with a prior preference for lower speeds. Quantitative validation of this model, however, is difficult because the precise noise characteristics and prior expectations are unknown. Here, we present an augmented observer model that accounts for the variability of subjective responses in a speed discrimination task. This allowed us to infer the shape of the prior probability as well as the internal noise characteristics directly from psychophysical data. For all subjects, we found that the fitted model provides an accurate description of the data across a wide range of stimulus parameters. The inferred prior distribution shows significantly heavier tails than a Gaussian, and the amplitude of the internal noise is approximately proportional to stimulus speed and depends inversely on stimulus contrast. The framework is general and should prove applicable to other experiments and perceptual modalities.

719 citations


Journal ArticleDOI
TL;DR: This work shows that the responses of MT cells can be captured by a linear-nonlinear model that operates not on the visual stimulus, but on the afferent responses of a population of nonlinear V1 cells, and robustly predicts the separately measured responses to gratings and plaids.
Abstract: Neurons in area MT (V5) are selective for the direction of visual motion. In addition, many are selective for the motion of complex patterns independent of the orientation of their components, a behavior not seen in earlier visual areas. We show that the responses of MT cells can be captured by a linear-nonlinear model that operates not on the visual stimulus, but on the afferent responses of a population of nonlinear V1 cells. We fit this cascade model to responses of individual MT neurons and show that it robustly predicts the separately measured responses to gratings and plaids. The model captures the full range of pattern motion selectivity found in MT. Cells that signal pattern motion are distinguished by having convergent excitatory input from V1 cells with a wide range of preferred directions, strong motion opponent suppression and a tuned normalization that may reflect suppressive input from the surround of V1 cells.

534 citations


Journal ArticleDOI
TL;DR: Spike-triggered average and covariance analyses can be used to estimate the filters and nonlinear combination rule from extracellular experimental data and demonstrated with simulated model neuron examples that emphasize practical issues that arise in experimental situations.
Abstract: Response properties of sensory neurons are commonly described using receptive fields. This description may be formalized in a model that operates with a small set of linear filters whose outputs are nonlinearly combined to determine the instantaneous firing rate. Spike-triggered average and covariance analyses can be used to estimate the filters and nonlinear combination rule from extracellular experimental data. We describe this methodology, demonstrating it with simulated model neuron examples that emphasize practical issues that arise in experimental situations.

466 citations


Journal ArticleDOI
TL;DR: A practical quality-aware image encoding, decoding and quality analysis system, which employs a novel reduced-reference image quality assessment algorithm based on a statistical model of natural images and a previously developed quantization watermarking-based data hiding technique in the wavelet transform domain.
Abstract: We propose the concept of quality-aware image , in which certain extracted features of the original (high-quality) image are embedded into the image data as invisible hidden messages. When a distorted version of such an image is received, users can decode the hidden messages and use them to provide an objective measure of the quality of the distorted image. To demonstrate the idea, we build a practical quality-aware image encoding, decoding and quality analysis system, which employs: 1) a novel reduced-reference image quality assessment algorithm based on a statistical model of natural images and 2) a previously developed quantization watermarking-based data hiding technique in the wavelet transform domain.

308 citations


Journal ArticleDOI
TL;DR: An information-theoretic framework for fitting neural spike responses with a Linear-Nonlinear-Poisson cascade model that provides an explicit "default" model of the nonlinear stage mapping the filter responses to spike rate, in the form of a ratio of Gaussians.
Abstract: We describe an information-theoretic framework for fitting neural spike responses with a Linear-Nonlinear-Poisson cascade model. This framework unifies the spike-triggered average and spike-triggered covariance approaches to neural characterization, and recovers a set of linear filters that maximize mean and variance-dependent information between stimuli and spike responses. The resulting approach has several useful properties: (1) it recovers a set of linear filters sorted according to their informativeness about the neural response; (2) it is both computationally efficient and robust, allowing recovery of multiple linear filters from a data set of relatively modest size; (3) it provides an explicit “default” model of the nonlinear stage mapping the filter responses to spike rate, in the form of a ratio of Gaussians. (4) it is equivalent to maximum likelihood estimation of this default model, but also converges to the correct filter estimates whenever the conditions for the consistency of spike-triggered average or covariance analysis are met; (5) it can be augmented with additional constraints, such as space-time separability, on the filters. We demonstrate the effectiveness of the method by applying it to simulated responses of a Hodgkin-Huxley neuron, and the recorded extracellular responses of macaque retinal ganglion cells and V1 cells.

154 citations


Journal ArticleDOI
TL;DR: This work proposes an adaptive nonlinear image representation in which each coefficient of a linear transform is divided by a weighted sum of coefficient amplitudes in a generalized neighborhood and shows that the divisive operation greatly reduces both the statistical and the perceptual redundancy amongst representation elements.
Abstract: Image compression systems commonly operate by transforming the input signal into a new representation whose elements are independently quantized. The success of such a system depends on two properties of the representation. First, the coding rate is minimized only if the elements of the representation are statistically independent. Second, the perceived coding distortion is minimized only if the errors in a reconstructed image arising from quantization of the different elements of the representation are perceptually independent. We argue that linear transforms cannot achieve either of these goals and propose, instead, an adaptive nonlinear image representation in which each coefficient of a linear transform is divided by a weighted sum of coefficient amplitudes in a generalized neighborhood. We then show that the divisive operation greatly reduces both the statistical and the perceptual redundancy amongst representation elements. We develop an efficient method of inverting this transformation, and we demonstrate through simulations that the dual reduction in dependency can greatly improve the visual quality of compressed images.

114 citations


Proceedings Article
04 Dec 2006
TL;DR: It is shown that for a wide variety of observation models, the Bayes least squares (BLS) estimator may be formulated without explicit reference to the prior, and a direct expression for the estimator is derived in terms of the density of the observed measurements.
Abstract: Bayesian estimators are defined in terms of the posterior distribution. Typically, this is written as the product of the likelihood function and a prior probability density, both of which are assumed to be known. But in many situations, the prior density is not known, and is difficult to learn from data since one does not have access to uncorrupted samples of the variable being estimated. We show that for a wide variety of observation models, the Bayes least squares (BLS) estimator may be formulated without explicit reference to the prior. Specifically, we derive a direct expression for the estimator, and a related expression for the mean squared estimation error, both in terms of the density of the observed measurements. Each of these prior-free formulations allows us to approximate the estimator given a sufficient amount of observed data. We use the first form to develop practical nonparametric approximations of BLS estimators for several different observation processes, and the second form to develop a parametric family of estimators for use in the additive Gaussian noise case. We examine the empirical performance of these estimators as a function of the amount of observed data.

67 citations


Proceedings Article
04 Dec 2006
TL;DR: An algorithm for image denoising based on the FoGSM model is developed, and substantial improvements over current state-of-the-art denoisers method are demonstrated.
Abstract: The local statistical properties of photographic images, when represented in a multi-scale basis, have been described using Gaussian scale mixtures (GSMs). Here, we use this local description to construct a global field of Gaussian scale mixtures (FoGSM). Specifically, we model subbands of wavelet coefficients as a product of an exponentiated homogeneous Gaussian Markov random field (hGMRF) and a second independent hGMRF. We show that parameter estimation for FoGSM is feasible, and that samples drawn from an estimated FoGSM model have marginal and joint statistics similar to wavelet coefficients of photographic images. We develop an algorithm for image denoising based on the FoGSM model, and demonstrate substantial improvements over current state-of-the-art denoising method based on the local GSM model.

63 citations


Proceedings ArticleDOI
01 Oct 2006
TL;DR: A statistical model for images that explicitly captures variations in local orientation and contrast is developed and an optimal Bayesian least squares estimator is developed by conditioning upon and integrating over the hidden orientation and scale variables.
Abstract: We develop a statistical model for images that explicitly captures variations in local orientation and contrast. Patches of wavelet coefficients are described as samples of a fixed Gaussian process that are rotated and scaled according to a set of hidden variables representing the local image contrast and orientation. An optimal Bayesian least squares estimator is developed by conditioning upon and integrating over the hidden orientation and scale variables. The resulting denoising procedure gives results that are visually superior to those obtained with a Gaussian scale mixture model that does not explicitly incorporate local image orientation.

22 citations