scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

Whitened Expectation Propagation: Non-Lambertian Shape from Shading and Shadow

23 Jun 2013-pp 1674-1681
TL;DR: This work proposes a variation of EP that exploits regularities in natural scene statistics to achieve run times that are linear in both number of pixels and clique size, and uses large, non-local cliques to exploit cast shadow, which is traditionally ignored in shape from shading.
Abstract: For problems over continuous random variables, MRFs with large cliques pose a challenge in probabilistic inference. Difficulties in performing optimization efficiently have limited the probabilistic models explored in computer vision and other fields. One inference technique that handles large cliques well is Expectation Propagation. EP offers run times independent of clique size, which instead depend only on the rank, or intrinsic dimensionality, of potentials. This property would be highly advantageous in computer vision. Unfortunately, for grid-shaped models common in vision, traditional Gaussian EP requires quadratic space and cubic time in the number of pixels. Here, we propose a variation of EP that exploits regularities in natural scene statistics to achieve run times that are linear in both number of pixels and clique size. We test these methods on shape from shading, and we demonstrate strong performance not only for Lambertian surfaces, but also on arbitrary surface reflectance and lighting arrangements, which requires highly non-Gaussian potentials. Finally, we use large, non-local cliques to exploit cast shadow, which is traditionally ignored in shape from shading.

Summary (2 min read)

1. Introduction

  • Probabilistic inference for large loopy graphical models has become an important subfield with a growing body of applications, including many in computer vision.
  • These methods have resulted in significant progress for several applications.
  • The principal difference between BP and Gaussian EP can thus be summarized by a trade-off in their respective approximating families: BP favors flexible non-Gaussian marginals, while Gaussian EP favors a flexible covariance structure.
  • Another possible explanation is that for a grid-based graphical model with D pixels, Gaussian EP requires O(D2) space and a run time of O(D3).
  • Finally, the authors use the method to efficiently perform inference over large cliques produced by cast shadows and by global spatial priors.

2. Expectation Propagation

  • The family P̃ is chosen so that EP̃ [τj( x)] can be estimated easily.
  • EP achieves this goal by approximating each potential function φi( x) with an exponential family distribution P̃i( xi| θ(i)).
  • Regardless of the rank of each potential, the covariance matrix of the posterior S remains full-rank, and must be stored as a D×D matrix.
  • For large problems with tens of thousands of variables or more, this becomes limiting.
  • When the underlying graphical model is highly sparse, such as a nearest-neighbor pairwiseconnected MRFs, each iteration can be performed in time O(D1.5) [2].

3. Whitened EP

  • For many problems of computer vision, both the number of variables D and the number of potentials N grow linearly with the number of pixels.
  • Low-rank potentials of large clique size have a wide array of promising applications in computer vision [17, 10].
  • Expectation propagation can be made more efficient by limiting the forms of covariance structure expressible by S. Let S denote the covariance matrix for natural scenes.

4. Shape from Shading

  • Whitened EP permits inference over images in linear time with respect to both pixels and clique size.
  • In particular, the authors are interested in whether Gaussian message approximation will be effective when the potentials φi are highly non-Gaussian.
  • In recent years, several methods have been developed that solve the classical SfS problem well as long as surface reflectance R is assumed to be Lambertian [19, 17, 6, 3, 7].
  • For each pixel, one potential φR(p, q|i) enforces the surface normal to be consistent with the known pixel intensity i(x, y).
  • Whitened EP provides two benefits for spatial priors.

5. Conclusions

  • The methods in this paper reduce the run time of EP from cubic to linear in the number of pixels for visual inference, while retaining a run time that is linear in clique size.
  • The computational expense of inference for large cliques has prohibited the investigation of complex probabilistic models for vision.
  • The authors hope is that whitened EP will facilitate further research in these directions.
  • Results for whitened EP on SfS shows that the sacrifice in performance for this approach is small, even in problems with highly non-Gaussian potentials.
  • Performance remained strong for surfaces with arbitrary reflectance and arbitrary lighting, which is a novel finding in SfS.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

Citations
More filters
Dissertation
31 May 2014
TL;DR: This thesis focuses on studying the statistical properties of single objects and their range images which can bene t shape inference techniques, including laser-acquired depth, binocular stereo, photometric stereo and High Dynamic Range (HDR) photography.
Abstract: Depth inference is a fundamental problem of computer vision with a broad range of potential applications. Monocular depth inference techniques, particularly shape from shading dates back to as early as the 40's when it was rst used to study the shape of the lunar surface. Since then there has been ample research to develop depth inference algorithms using monocular cues. Most of these are based on physical models of image formation and rely on a number of simplifying assumptions that do not hold for real world and natural imagery. Very few make use of the rich statistical information contained in real world images and their 3D information. There have been a few notable exceptions though. The study of statistics of natural scenes has been concentrated on outdoor scenes which are cluttered. Statistics of scenes of single objects has been less studied, but is an essential part of daily human interaction with the environment. Inferring shape of single objects is a very important computer vision problem which has captured the interest of many researchers over the past few decades and has applications in object recognition, robotic grasping, fault detection and Content Based Image Retrieval (CBIR). This thesis focuses on studying the statistical properties of single objects and their range images which can bene t shape inference techniques. I acquired two databases: Single Object Range and HDR (SORH) and the Eton Myers Database of single objects, including laser-acquired depth, binocular stereo, photometric stereo and High Dynamic Range (HDR) photography. I took a data driven approach and studied the statistics of color and range images of real scenes of single objects along with whole 3D objects and uncovered

2 citations

Posted Content
TL;DR: In this article, patch-based prior distributions are used to approximate the posterior distributions using products of multivariate Gaussian densities, imposing structural constraints on the covariance matrices of these densities allows for greater scalability and distributed computation.
Abstract: This paper presents a new Expectation Propagation (EP) framework for image restoration using patch-based prior distributions. While Monte Carlo techniques are classically used to sample from intractable posterior distributions, they can suffer from scalability issues in high-dimensional inference problems such as image restoration. To address this issue, EP is used here to approximate the posterior distributions using products of multivariate Gaussian densities. Moreover, imposing structural constraints on the covariance matrices of these densities allows for greater scalability and distributed computation. While the method is naturally suited to handle additive Gaussian observation noise, it can also be extended to non-Gaussian noise. Experiments conducted for denoising, inpainting and deconvolution problems with Gaussian and Poisson noise illustrate the potential benefits of such flexible approximate Bayesian method for uncertainty quantification in imaging problems, at a reduced computational cost compared to sampling techniques.
Dissertation
31 Aug 2013
TL;DR: This research builds an intelligent system based on brachiopod fossil images and their descriptions published in Treatise on Invertebrate Paleontology to compare fossil images directly, without referring to textual information.
Abstract: Science advances not only because of new discoveries, but also due to revolutionary ideas drawn from accumulated data. The quality of studies in paleontology, in particular, depends on accessibility of fossil data. This research builds an intelligent system based on brachiopod fossil images and their descriptions published in Treatise on Invertebrate Paleontology. The project is still on going and some significant developments will be discussed here. This thesis has two major parts. The first part describes the digitization, organization and integration of information extracted from the Treatise. The Treatise is in PDF format and it is non-trivial to convert large volumes into a structured, easily accessible digital library. Three important topics will be discussed: (1) how to extract data entries from the text, and save them in a structured manner; (2) how to crop individual specimen images from figures automatically, and associate each image with text entries; (3) how to build a search engine to perform both keyword search and natural language search. The search engine already has a web interface and many useful tasks can be done with ease. Verbal descriptions are second-hand information of fossil images and thus have limitations. The second part of the thesis develops an algorithm to compare fossil images directly, without referring to textual information. After similarities between fossil images are calculated, we can use the results in image search, fossil classification, and so on. The algorithm is based on deformable templates, and utilizes expectation propagation to find the optimal deformation. Specifically, I superimpose a “warp” on each image. Each node of the warp encapsulates a vector of local texture features, and comparing two images involves two steps: (1) deform the warp to the optimal configuration, so the energy function is minimized; and (2) based on the optimal configuration, compute the distance of two images. Experiment results confirmed that the method is reasonable and robust.
References
More filters
Proceedings ArticleDOI
17 Jun 2007
TL;DR: A new technique to compute belief propagation messages in time linear with respect to clique size for a large class of potential functions over real-valued variables and may benefit a variety of applications using belief propagation to infer images or range images.
Abstract: Belief propagation over pairwise connected Markov random fields has become a widely used approach, and has been successfully applied to several important computer vision problems However, pairwise interactions are often insufficient to capture the full statistics of the problem Higher-order interactions are sometimes required Unfortunately, the complexity of belief propagation is exponential in the size of the largest clique In this paper, we introduce a new technique to compute belief propagation messages in time linear with respect to clique size for a large class of potential functions over real-valued variables We demonstrate this technique in two applications First, we perform efficient inference in graphical models where the spatial prior of natural images is captured by 2 times 2 cliques This approach shows significant improvement over the commonly used pairwise-connected models, and may benefit a variety of applications using belief propagation to infer images or range images Finally, we apply these techniques to shape-from-shading and demonstrate significant improvement over previous methods, both in quality and in flexibility

83 citations


"Whitened Expectation Propagation: N..." refers background or methods or result in this paper

  • ...While there has been some success in applying methods such as Lax-Friedrichs and fastmarching to non-Lambertian reflectance [1, 23], these generalizations must proceed on a case-by-case basis for each class of reflectance functions....

    [...]

  • ...Other Lambertian SfS algorithms have reported image errors for the penny image of 0.0071 [9] and 0.0517 [13]....

    [...]

  • ...Subfigure c) shows the results of linear constraint node BP [17]....

    [...]

  • ...The SfS solution of [17] used p and q variables with 300 bins, and would thus sacrifice a 300-fold speed decrease to infer depth z directly....

    [...]

  • ...Lambertian SfS We first test our approach on Lambertian SfS, where it can be compared to past Lambertian SfS algorithms....

    [...]

Proceedings ArticleDOI
20 Jun 2011
TL;DR: Using models normally reserved for natural image statistics, “naturalness” priors are imposed over the albedo and shape of a scene, which allows us to simultaneously recover the most likely albeda and shape that explain a single image.
Abstract: We relax the long-held and problematic assumption in shape-from-shading (SFS) that albedo must be uniform or known, and address the problem of “shape and albedo from shading” (SAFS) Using models normally reserved for natural image statistics, we impose “naturalness” priors over the albedo and shape of a scene, which allows us to simultaneously recover the most likely albedo and shape that explain a single image A simplification of our algorithm solves classic SFS, and our SAFS algorithm can solve the intrinsic image decomposition problem, as it solves a superset of that problem We present results for SAFS, SFS, and intrinsic image decomposition on real lunar imagery from the Apollo missions, on our own pseudo-synthetic lunar dataset, and on a subset of the MIT Intrinsic Images dataset[15] Our one unified technique appears to outperform the previous best individual algorithms for all three tasks Our technique allows a coarse observation of shape (from a laser rangefinder or a stereo algorithm, etc) to be incorporated a priori We demonstrate that even a small amount of low-frequency information dramatically improves performance, and motivate the usage of shading for high-frequency shape (and albedo) recovery

73 citations


"Whitened Expectation Propagation: N..." refers methods in this paper

  • ...While there has been some success in applying methods such as Lax-Friedrichs and fastmarching to non-Lambertian reflectance [1, 23], these generalizations must proceed on a case-by-case basis for each class of reflectance functions....

    [...]

  • ...Other Lambertian SfS algorithms have reported image errors for the penny image of 0.0071 [9] and 0.0517 [13]....

    [...]

  • ...Lambertian SfS We first test our approach on Lambertian SfS, where it can be compared to past Lambertian SfS algorithms....

    [...]

  • ...We then test this approach on a problem with highly non-Gaussian potentials: non-Lambertian shape from shading (SfS)....

    [...]

  • ...SfS is one example, and non-Lambertian SfS produces especially non-Gaussian potentials....

    [...]

Proceedings ArticleDOI
17 Jun 2006
TL;DR: This paper proposes a new solution of the SFS problem based on a more comprehensive diffuse reflectance model: the Oren and Nayar model which eliminates the concave/convex ambiguity which is a well known problem in SFS.
Abstract: Lambert’s model for diffuse reflection is a main assumption in most of shape from shading (SFS) literature. Even with this simplified model, the SFS is still a difficult problem. Nevertheless, Lambert’s model has been proven to be an inaccurate approximation of the diffuse component of the surface reflectance. In this paper, we propose a new solution of the SFS problem based on a more comprehensive diffuse reflectance model: the Oren and Nayar model. In this work, we slightly modify this more realistic model in order to take into account the attenuation of the illumination due to distance. Using the modified non-Lambertian reflectance, we design a new explicit Partial Differential Equation (PDE) and then solve it using Lax-Friedrichs Sweeping method. Our experiments on synthetic data show that the proposed modeling gives a unique solution without any information about the height at the singular points of the surface. Additional results for real data are presented to show the efficiency of the proposed method . To the best of our knowledge, this is the first non-Lambertian SFS formulation that eliminates the concave/convex ambiguity which is a well known problem in SFS.

66 citations


"Whitened Expectation Propagation: N..." refers methods in this paper

  • ...While there has been some success in applying methods such as Lax-Friedrichs and fastmarching to non-Lambertian reflectance [1, 23], these generalizations must proceed on a case-by-case basis for each class of reflectance functions....

    [...]

Proceedings ArticleDOI
13 Jun 2010
TL;DR: This work examines the shape from shading problem without boundary conditions as a polynomial system, and proposes a semidefinite programming relaxation procedure and an exact line search iterative procedure with a new smoothness term that favors folds at edges.
Abstract: We examine the shape from shading problem without boundary conditions as a polynomial system. This view allows, in generic cases, a complete solution for ideal polyhedral objects. For the general case we propose a semidefinite programming relaxation procedure, and an exact line search iterative procedure with a new smoothness term that favors folds at edges. We use this numerical technique to inspect shading ambiguities.

52 citations


"Whitened Expectation Propagation: N..." refers background or methods in this paper

  • ...While there has been some success in applying methods such as Lax-Friedrichs and fastmarching to non-Lambertian reflectance [1, 23], these generalizations must proceed on a case-by-case basis for each class of reflectance functions....

    [...]

  • ...Other Lambertian SfS algorithms have reported image errors for the penny image of 0.0071 [9] and 0.0517 [13]....

    [...]

  • ...Lambertian SfS We first test our approach on Lambertian SfS, where it can be compared to past Lambertian SfS algorithms....

    [...]

  • ...We then test this approach on a problem with highly non-Gaussian potentials: non-Lambertian shape from shading (SfS)....

    [...]

  • ...SfS is one example, and non-Lambertian SfS produces especially non-Gaussian potentials....

    [...]

Journal ArticleDOI
TL;DR: This work introduces a novel method of accounting for variations in irradiance resulting from interreflections, complex sources and the like, which uses a spatially varying source model with a local shading model to yield a physically plausible shading model.
Abstract: The shading on curved surfaces is a cue to shape. Current computer vision methods for analyzing shading use physically unrealistic models, have serious mathematical problems, cannot exploit geometric information if it is available, and are not reliable in practice. We introduce a novel method of accounting for variations in irradiance resulting from interreflections, complex sources and the like. Our approach uses a spatially varying source model with a local shading model. Fast spatial variation in the source is penalised, consistent with the rendering community's insight that interreflections are spatially slow. This yields a physically plausible shading model. Because modern cameras can make accurate reports of observed radiance, our method compels the reconstructed surface to have shading exactly consistent with that of the image. For inference, we use a variational formulation, with a selection of regularization terms which guarantee that a solution exists. Our method is evaluated on physically accurate renderings of virtual objects, and on images of real scenes, for a variety of different kinds of boundary condition. Reconstructions for single sources compare well with photometric stereo reconstructions and with ground truth.

31 citations


"Whitened Expectation Propagation: N..." refers methods in this paper

  • ...SfS is one example, and non-Lambertian SfS produces especially non-Gaussian potentials....

    [...]

  • ...While there has been some success in applying methods such as Lax-Friedrichs and fastmarching to non-Lambertian reflectance [1, 23], these generalizations must proceed on a case-by-case basis for each class of reflectance functions....

    [...]

  • ...Other Lambertian SfS algorithms have reported image errors for the penny image of 0.0071 [9] and 0.0517 [13]....

    [...]

  • ...Lambertian SfS We first test our approach on Lambertian SfS, where it can be compared to past Lambertian SfS algorithms....

    [...]

  • ...We then test this approach on a problem with highly non-Gaussian potentials: non-Lambertian shape from shading (SfS)....

    [...]