scispace - formally typeset
Search or ask a question
Book ChapterDOI

Belief Propagation with Directional Statistics for Solving the Shape-from-Shading Problem

12 Oct 2008-pp 780-791
TL;DR: The Shape-from-Shading problem infers shape from reflected light, collected using a camera at a single point in space only using the Fisher-Bingham distribution to marginalise a probabilistic model.
Abstract: The Shape-from-Shading [SfS] problem infers shape from reflected light, collected using a camera at a single point in space only. Reflected light alone does not provide sufficient constraint and extra information is required; typically a smoothness assumption is made. A surface with Lambertian reflectance lit by a single infinitely distant light source is also typical. We solve this typical SfS problem using belief propagation to marginalise a probabilistic model. The key novel step is in using a directional probability distribution, the Fisher-Bingham distribution. This produces a fast and relatively simple algorithm that does an effective job of both extracting details and being robust to noise. Quantitative comparisons with past algorithms are provided using both synthetic and real data.

Summary (2 min read)

1 Introduction

  • A known or inferred reflectance function provides the relationship between irradiance and surface orientation.
  • This constrained scenario has been tackled many times since[2–7, to cite a few], and will again be the focus of this work.
  • More recent methods include Worthington and Hancock[5], which iterated between smoothing a normal map and correcting it to satisfy the reflectance information; Prados et al[6], which solved the problem with viscosity solutions; and Potetz[7] which used belief propagation.
  • Belief propagation estimates the marginals of a multivariate probability distribution, often represented by a graphical model.

2 Formulation

  • Using previously given assumptions, of Lambertian reflectance, constant known albedo, orthographic projection, an infinitely distant light source and no interreflection the irradiance at each pixel in the input image is given by Ix,y = A(̂l · n̂x,y) (1) where Ix,y is the irradiance provided by the input image.
  • The normal map can be integrated to obtain a depth map, a step with which the authors are not concerned.
  • By substituting the dot product with the cosine of the angle between the two vectors you get Ix,y A = cos θx,y (2) where θ is therefore the angle of a cone around l̂ which the normal is constrained to[5].
  • This leaves one degree of freedom per pixel that is not constrained by the available information.
  • The authors propose a new SfS algorithm using such distributions within a belief propagation framework.

3 Belief Propagation

  • Such an equation can be represented by a graphical model where each variable is a node and nodes that interact via ψ functions are linked.
  • Message passing then occurs within this model, with messages passed along the links between the nodes.
  • The method uses belief propagation to obtain the maximum a posteriori estimate of a pairwise Markov random field where each node represents the orientation of the surface at a pixel in the image.

5 Method

  • The authors construct a graphical model, specifically a pairwise Markov random field.
  • For each node the authors have an irradiance value.
  • The formulation presented so far will converge to a bi-modal distribution at each node, with the modes corresponding to the concave and convex interpretations.
  • Each level’s messages are initialised with the previous, lower resolution, levels messages.

6 Message Passing

  • Doing this directly is not tractable, so the authors propose a novel three step procedure to solve this problem: 1. Convert the FB8 distribution into a sum of Fisher distributions.
  • All three steps involve approximation, in practise this proves not to be a problem.
  • To derive a Fisher-Bingham distribution from the convolved sum of Fisher distributions the authors first need the rotational component of the Bingham distribution, which they calculate with principal component analysis.
  • This is irrelevant as multiplicative constants have no effect.

7 Results & Analysis

  • The authors compare the presented algorithm to two others, Lee & Kuo[4] and Worthington & Hancock[5], using both synthetic and real data.
  • Figure 1 gives the four synthetic inputs used, figure 2 gives the results and ground truth for just one of the four inputs.
  • For the Mozart 90◦ input their approach consistently exceeds Lee & Kuo but does not do so well at getting a high percentage of spot on estimates as Worthington & Hancock.
  • The presented algorithm doing poorly as the light source moves away from [0, 0, 1]T can be put down to the bias introduced to handle the concave/convex ambiguity[12].
  • Looking at figure 5 Worthington & Hancock is quantitatively ahead, but looking at the actual output it is more blob than face, though some features are recognisable.

8 Conclusion

  • The authors have presented a new algorithm for solving the classical shape from shading algorithm, and demonstrated its competitiveness with previously published algorithms.
  • The use of belief propagation with FB8 distributions is in itself new, and a method for the convolution of a FB8 distribution by a Fisher distribution has been devised.
  • The algorithm does suffer a noticeable flaw in that overcoming the convex/concave problem biases the result, making the algorithm weak in the presence of oblique lighting.
  • An alternative solution to the current bias is an obvious area for future research.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

Citation for published version:
Fincham Haines, T & Wilson, RC 2008, 'Belief propagation with directional statistics for solving the shape-from-
shading problem', Paper presented at European Conference on Computer Vision, Marseille, France, 12/10/08 -
18/10/08 pp. 780-791.
Publication date:
2008
Document Version
Peer reviewed version
Link to publication
University of Bath
Alternative formats
If you require this document in an alternative format, please contact:
openaccess@bath.ac.uk
General rights
Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners
and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.
Take down policy
If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately
and investigate your claim.
Download date: 10. Aug. 2022

Belief Propagation with Directional Statistics for
solving the Shape-from-Shading problem
Tom S. F. Haines and Richard C. Wilson
The University of York,
Heslington, YO10 5DD, U.K.
Abstract. The Shape-from-Shading [SfS] problem infers shape from re-
flected light, collected using a camera at a single point in space only.
Reflected light alone does not provide sufficient constraint and extra
information is required; typically a smoothness assumption is made. A
surface with Lambertian reflectance lit by a single infinitely distant light
source is also typical.
We solve this typical SfS problem using belief propagation to marginalise
a probabilistic model. The key novel step is in using a directional prob-
ability distribution, the Fisher-Bingham distribution. This produces a
fast and relatively simple algorithm that does an effective job of both
extracting details and being robust to noise. Quantitative comparisons
with past algorithms are provided using both synthetic and real data.
1 Intro duction
The classical problem of Shape-from-Shading [SfS] uses irradiance captured by a
photo to calculate the shape of a scene. A known or inferred reflectance function
provides the relationship between irradiance and surface orientation. Surface ori-
entation may then be integrated to obtain a depth map. Horn[1] introduced this
problem with the assumptions of Lambertian reflectance, orthographic projec-
tion, constant known albedo, a smooth surface, no surface inter-reflectance and
a single infinitely distant light source in a known relation with the photo. This
constrained scenario has been tackled many times since[2–7, to cite a few], and
will again be the focus of this work.
Zhang et al.[8] surveyed the area in 1999, concluding that Lee and Kuo[4]
was the then state of the art. Lee and Kuo iteratively linearised the reflectance
map and solved the resulting linear equation using the multigrid method. More
recent methods include Worthington and Hancock[5], which iterated between
smoothing a normal map and correcting it to satisfy the reflectance informa-
tion; Prados et al[6], which solved the problem with viscosity solutions; and
Potetz[7] which used belief propagation. This last work by Potetz is particularly
relevant due to it also using belief propagation, though in all further details it
differs. Belief propagation estimates the marginals of a multivariate probability
distribution, often represented by a graphical model. Potetz makes use of two
variables per pixel, δx/δz and δyz, and uses various factor nodes to provide
the reflectance information, smoothness assumption and integrability constraint.

2 T. S. F. Haines and R. C. Wilson
Whilst this model can be implemented simply with discrete belief propagation
it would never converge and require a large number of labels, instead advanced
continuous methods are used.
The following three sections, 2 through to 4, cover the component details,
starting with the formulation, then belief propagation and finally directional
statistics. Section 5 brings it all together into a cohesive whole, and is followed
by section 6 which solves a specific problem. Following these sections we give
results in section 7 and conclusions in the final section.
2 Formulation
Using previously given assumptions, of Lambertian reflectance, constant known
albedo, orthographic projection, an infinitely distant light source and no inter-
reflection the irradiance at each pixel in the input image is given by
I
x,y
= A(
ˆ
l ·
ˆ
n
x,y
) (1)
where I
x,y
is the irradiance provided by the input image. A is the albedo and
ˆ
l R
3
, |
ˆ
l| = 1 is the direction to the infinitely distant light source; these are both
provided by the user.
ˆ
n
x,y
R
3
, |
ˆ
n
x,y
| = 1 is the normal map to be inferred as
the algorithm’s output. The normal map can be integrated to obtain a depth
map, a step with which we are not concerned. By substituting the dot product
with the cosine of the angle between the two vectors you get
I
x,y
A
= cos θ
x,y
(2)
where θ is therefore the angle of a cone around
ˆ
l which the normal is constrained
to[5]. This leaves one degree of freedom per pixel that is not constrained by the
available information. A smoothness assumption provides the extra constraint.
Directional statistics is the field of statistics on directions, such as surface
normals. Using a directional distribution allows the representation of surface
orientation with a single variable, rather than the two used in Potetz[7] and
many others. We propose a new SfS algorithm using such distributions within
a belief propagation framework. This leads to a belief propagation formulation
not dissimilar to Gaussian belief propagation[9] in its simplicity and speed.
3 Belief Propagation
Loopy sum-product belief propagation is a message passing algorithm for marginal-
ising an equation of the form
P (x) =
Y
vV
ψ
v
(y
v
) (3)
where x is a set of random variables and v, y
v
x. Such an equation can be
represented by a graphical model where each variable is a node and nodes that

BP with Directional Statistics for solving the SfS problem 3
interact via ψ functions are linked. In this case the random variables are di-
rections, represented by normalised vectors. Message passing then occurs within
this model, with messages passed along the links between the nodes. As the vari-
ables are directions the messages are probability distributions on directions. The
method uses belief propagation to obtain the maximum a posteriori estimate of
a pairwise Markov random field where each node represents the orientation of
the surface at a pixel in the image. The message passed from node p to node q
at iteration t is
m
t
pq
(
ˆ
x
q
) =
Z
ˆ
x
p
ψ
pq
(
ˆ
x
p
,
ˆ
x
q
)ψ
p
(
ˆ
x
p
)
Y
u(N\q)
m
t1
up
(
ˆ
x
p
)δ
ˆ
x
p
(4)
where ψ
pq
(
ˆ
x
p
,
ˆ
x
q
) is the compatibility between adjacent nodes, ψ
p
(
ˆ
x
p
) is the
prior on each node’s orientation and N is the 4-way neighbourhood of each
node. Once message passing has iterated sufficiently for convergence to occur
the belief at each node is
b
p
(
ˆ
x
p
) = ψ
p
(
ˆ
x
p
)
Y
uN
m
t1
up
(
ˆ
x
p
) (5)
From b
p
(
ˆ
x
p
) the most probable direction is selected as output.
4 Directional Statistics
The Fisher distribution, using proportionality rather than a normalising con-
stant, is given by
P
F
(
ˆ
x; u) exp(u
T
ˆ
x) (6)
where
ˆ
x, u R
3
and |
ˆ
x| = 1. Similarly, the Bingham distribution may be defined
as
P
B
(
ˆ
x; A) exp(
ˆ
x
T
A
ˆ
x) (7)
where A = A
T
. By multiplying the above we get the 8 parameter Fisher-
Bingham[10] [FB
8
] distribution
P
FB
8
(
ˆ
x; u, A) exp(u
T
ˆ
x +
ˆ
x
T
A
ˆ
x) (8)
All three of these distributions have the advantage that they can be multiplied
together without introducing further variables, which is critical in a belief prop-
agation framework. We may decompose the FB
8
distribution. As A is symmetric
we may apply the eigen-decomposition to obtain A = BDB
T
, where B is or-
thogonal and D diagonal. This allows us to write
P
FB
8
(
ˆ
x; u, A) exp(v
T
ˆ
y +
ˆ
y
T
D
ˆ
y) (9)
where v = B
T
u and
ˆ
y = B
T
ˆ
x. As |
ˆ
y| = 1 we may offset D by an arbitrary
multiple of the identity matrix, this allows any given entry to be set to 0. We
can therefore consider it the case that D = Diag(α, β, 0), with α > 0 and β > 0
so that
P
FB
8
(
ˆ
x; u, A) exp(v
T
ˆ
y + α
ˆ
y
2
x
+ β
ˆ
y
2
y
) (10)

4 T. S. F. Haines and R. C. Wilson
For convenience we may represent the FB
8
distribution as
exp(u
T
ˆ
x +
ˆ
x
T
A
ˆ
x) = [u, A] (11)
Using this notation multiplication is
[u, A][v, B] = [u + v, A + B] (12)
Various distributions may be represented by the Fisher-Bingham distribu-
tion, of particular use is the Bingham-Mardia distribution[11]
exp(k(
ˆ
u
T
ˆ
x cos θ)
2
) = [2k cos(θ)
ˆ
u, k
ˆ
u
ˆ
u
T
] (13)
where
ˆ
u is the direction of the axis of a cone and θ the angle of that cone.
This distribution has a small circle as its maximum, which allows the irradiance
information (Eq. 2) to be expressed as a FB
8
distribution.
5 Method
We construct a graphical model, specifically a pairwise Markov random field.
Each node of the model is a random variable that represents an unknown normal
on the surface. Belief propagation, as described in section 3, is then used to
determine the marginal distribution for each node. To define the distribution to
be marginalised two sources are used: the irradiance information (Eq. 2) and a
smoothness assumption.
We model the smoothing assumption on the premise that adjacent points on
the surface will be more likely to have a small angular difference than a large
angular difference. We can express this idea by setting
ψ
pq
(
ˆ
x
p
,
ˆ
x
q
) = exp(k(
ˆ
x
T
p
ˆ
x
q
)) (14)
where ψ
pq
(
ˆ
x
p
,
ˆ
x
q
) is from the message passing equation (Eq. 4). This is a Fisher
distribution with concentration k. Using FB
8
for the messages and dropping
equation 14 into equation 4 we have
m
t
pq
(
ˆ
x
q
) =
Z
S
2
exp(k(
ˆ
x
T
p
ˆ
x
q
))t(
ˆ
x
p
)δ
ˆ
x
p
(15)
t(
ˆ
x
p
) = ψ
p
(
ˆ
x
p
)
Y
u(N\q)
m
t1
up
(
ˆ
x
p
) (16)
Message passing therefore consists of two steps: calculating t(
ˆ
x
p
) by multiplying
FB
8
distributions together using equation 12, followed by convolution of the
resulting FB
8
distribution by a Fisher distribution to get m
t
pq
(
ˆ
x
q
). The next
section documents a method for doing the convolution.
For each node we have an irradiance value. Using equations 2 and 13 we can
define a distribution
[2k
I
x,y
A
ˆ
l, k
ˆ
l
ˆ
l
T
] (17)

Citations
More filters
Journal ArticleDOI
TL;DR: Using results from stochastic theory, it is shown that the state-dependent measurement uncertainty can be evaluated exactly and derive linear measurement models for applications that use position, surface-normal, and pose measurements.
Abstract: Pose estimation is central to several robotics applications such as registration, hand–eye calibration, and simultaneous localization and mapping (SLAM). Online pose estimation methods typically us...

31 citations


Cites methods from "Belief Propagation with Directional..."

  • ...The Bingham distribution is widely used in analyzing palaeomagnetic data (Kunze and Schaeben, 2004; Onstott, 1980), computer vision (Haines and Wilson, 2008), and directional statistics (Bingham, 1974) and recently in robotics (Gilitschenski et al., 2014, 2016; Glover et al., 2012; Kurz et al.,…...

    [...]

  • ...The Bingham distribution is widely used in analyzing palaeomagnetic data (Kunze and Schaeben, 2004; Onstott, 1980), computer vision (Haines and Wilson, 2008), and directional statistics (Bingham, 1974) and recently in robotics (Gilitschenski et al....

    [...]

Proceedings ArticleDOI
12 Jul 2017
TL;DR: To the best of the knowledge, this approach is the first implementation to use a Bingham distribution for 6 DoF pose estimation, and takes fewer iterations to converge onto the correct pose estimate.
Abstract: Pose estimation is central to several robotics applications such as registration, hand-eye calibration, SLAM, etc. Online pose estimation methods typically use Gaussian distributions to describe the uncertainty in the pose parameters. Such a description can be inadequate when using parameters such as unit-quaternions that are not unimodally distributed. A Bingham distribution can effectively model the uncertainty in unit-quaternions, as it has antipodal symmetry and is defined on a unit-hypersphere. A combination of Gaussian and Bingham distributions is used to develop a linear filter that accurately estimates the distribution of the pose parameters, in their true space. To the best of our knowledge our approach is the first implementation to use a Bingham distribution for 6 DoF pose estimation. Experiments assert that this approach is robust to initial estimation errors as well as sensor noise. Compared to state of the art methods, our approach takes fewer iterations to converge onto the correct pose estimate. The efficacy of the formulation is illustrated with a number of simulated examples on standard datasets as well as real-world experiments.

23 citations


Cites methods from "Belief Propagation with Directional..."

  • ...The Bingham distribution is widely used to analyze paleomagnetic data [17], computer vision [12] and directional statistics [3] and recently in robotics [18, 9, 10, 8]....

    [...]

Proceedings ArticleDOI
16 Jun 2012
TL;DR: This paper shows that even if the knowledge of the reflectance/illumination is inaccurate, the first derivatives of the photometrically measured orientation can be accurately estimated at the surface points where they have small values and proposes a probabilistic framework to quantitate the (in)accuracy of the knowledge and connect it to the estimation accuracy of these derivatives.
Abstract: In this paper, we present a method for accurately estimating the shape of an object by integrating the surface orientation measured by photometric stereo and the position measured by some range-measuring method. We first show that even if the knowledge of the reflectance/illumination is inaccurate, the first derivatives of the photometrically measured orientation can be accurately estimated at the surface points where they have small values. We propose a probabilistic framework to quantitate the (in)accuracy of the knowledge and connect it to the estimation accuracy of these derivatives. Based on this framework, we consider optimally integrating the surface orientation and position to obtain the object shape with higher accuracy. The integration reduces to an optimization problem, and it is efficiently solved by belief propagation. We present several experimental results showing the effectiveness of the proposed approach.

16 citations


Cites methods from "Belief Propagation with Directional..."

  • ...[6, 7] describe a method for integrating binocular stereo with SFS that also uses LBP....

    [...]

Proceedings ArticleDOI
01 Sep 2009
TL;DR: This paper shows how machine learning can be applied to an SFS model with a large number of parameters, and learns a set of weighting parameters that use the intensity of each pixel in the image to gauge the importance of thatpixel in the shape reconstruction process.
Abstract: Shape-from-shading (SFS) methods tend to rely on models with few parameters because these parameters need to be hand-tuned. This limits the number of different cues that the SFS problem can exploit. In this paper, we show how machine learning can be applied to an SFS model with a large number of parameters. Our system learns a set of weighting parameters that use the intensity of each pixel in the image to gauge the importance of that pixel in the shape reconstruction process. We show empirically that this leads to a significant increase in the accuracy of the recovered surfaces. Our learning approach is novel in that the parameters are optimized with respect to actual surface output by the system. In the first, offline phase, a hemisphere is rendered using a known illumination direction. The isophotes in the resulting reflectance map are then modelled using Gaussian mixtures to obtain a parametric representation of the isophotes. This Gaussian parameterization is then used in the second phase to learn intensity-based weights using a database of 3D shapes. The weights can also be optimized for a particular input image.

12 citations


Cites background or methods from "Belief Propagation with Directional..."

  • ...Our untrained system is comparable to [6] and [23] while the trained one is quantitatively superior for more relaxed angular errors....

    [...]

  • ...Table 1 provides a quantitative comparison of our system with [6] and [23] for the reconstruction of the Mozart surface....

    [...]

  • ...Even without learning the weighting parameters, the system presented so far compares favorably against [6] and [23] while for the more relaxed angular errors, the trained system is quantitatively superior....

    [...]

Book ChapterDOI
29 Aug 2009
TL;DR: A new framework for shape-from-shading which relies on a novel regularisation term which preserves surface structure is presented, which can recover stable surface estimates from both synthetic and real world images of complex objects, even under extreme illumination.
Abstract: In this paper we present a new framework for shape-from-shading which relies on a novel regularisation term which preserves surface structure. The resulting algorithm is both robust and accurate. We show that it can recover stable surface estimates from both synthetic and real world images of complex objects, even under extreme illumination.

7 citations


Cites background or methods from "Belief Propagation with Directional..."

  • ...More recently, several authors have posed shape-from-shading in terms of pairwise Markov Random Fields [6, 7 ,8]....

    [...]

  • ...Ground truth in first column, remainder show: proposed algorithm, [4] and [ 7 ] respectively....

    [...]

  • ...Haines and Wilson [ 7 ] describe surface normal direction probabilistically in terms of a Fisher-Bingham distribution for each pixel....

    [...]

  • ...Coil database). In Fig. 1, we show a novel view of the surfaces recovered using the proposed algorithm, Worthington and Hancock [4] and Haines and Wilson [ 7 ]....

    [...]

  • ...Fig. 1. Surfaces recovered from the input image shown in the top left panel of Fig. 2. From left to right: ground truth, proposed algorithm, [4], [ 7 ]....

    [...]

References
More filters
Journal ArticleDOI
TL;DR: Six well-known SFS algorithms are implemented and compared, and the performance of the algorithms was analyzed on synthetic images using mean and standard deviation of depth error, mean of surface gradient error, and CPU timing.
Abstract: Since the first shape-from-shading (SFS) technique was developed by Horn in the early 1970s, many different approaches have emerged. In this paper, six well-known SFS algorithms are implemented and compared. The performance of the algorithms was analyzed on synthetic images using mean and standard deviation of depth (Z) error, mean of surface gradient (p, q) error, and CPU timing. Each algorithm works well for certain images, but performs poorly for others. In general, minimization approaches are more robust, while the other approaches are faster.

1,879 citations


"Belief Propagation with Directional..." refers background or methods in this paper

  • ...Synthetic inputs, derived from the set used by Zhang et al[8]....

    [...]

  • ...Zhang et al.[8] surveyed the area in 1999, concluding that Lee and Kuo[4] was the then state of the art....

    [...]

Journal ArticleDOI
TL;DR: Algorithmic techniques are presented that substantially improve the running time of the loopy belief propagation approach and reduce the complexity of the inference algorithm to be linear rather than quadratic in the number of possible labels for each pixel, which is important for problems such as image restoration that have a large label set.
Abstract: Markov random field models provide a robust and unified framework for early vision problems such as stereo and image restoration. Inference algorithms based on graph cuts and belief propagation have been found to yield accurate results, but despite recent advances are often too slow for practical use. In this paper we present some algorithmic techniques that substantially improve the running time of the loopy belief propagation approach. One of the techniques reduces the complexity of the inference algorithm to be linear rather than quadratic in the number of possible labels for each pixel, which is important for problems such as image restoration that have a large label set. Another technique speeds up and reduces the memory requirements of belief propagation on grid graphs. A third technique is a multi-grid method that makes it possible to obtain good results with a small fixed number of message passing iterations, independent of the size of the input images. Taken together these techniques speed up the standard algorithm by several orders of magnitude. In practice we obtain results that are as accurate as those of other global methods (e.g., using the Middlebury stereo benchmark) while being nearly as fast as purely local methods.

1,560 citations

Proceedings ArticleDOI
19 Jul 2004
TL;DR: New algorithmic techniques are presented that substantially improve the running time of the belief propagation approach and reduce the complexity of the inference algorithm to be linear rather than quadratic in the number of possible labels for each pixel.
Abstract: Markov random field models provide a robust and unified framework for early vision problems such as stereo, optical flow and image restoration. Inference algorithms based on graph cuts and belief propagation yield accurate results, but despite recent advances are often still too slow for practical use. In this paper we present new algorithmic techniques that substantially improve the running time of the belief propagation approach. One of our techniques reduces the complexity of the inference algorithm to be linear rather than quadratic in the number of possible labels for each pixel, which is important for problems such as optical flow or image restoration that have a large label set. A second technique makes it possible to obtain good results with a small fixed number of message passing iterations, independent of the size of the input images. Taken together these techniques speed up the standard algorithm by several orders of magnitude. In practice we obtain stereo, optical flow and image restoration algorithms that are as accurate as other global methods (e.g., using the Middlebury stereo benchmark) while being as fast as local techniques.

889 citations


"Belief Propagation with Directional..." refers background in this paper

  • ...This results in less message passes being required for overall convergence[13]....

    [...]

Journal ArticleDOI
14 Jan 1988-Nature
TL;DR: In this paper, the human visual system can rapidly and accurately derive the three-dimensional orientation of surfaces by using variations in image intensity alone, which is one of the most important yet poorly understood aspects of human vision.
Abstract: The human visual system can rapidly and accurately derive the three-dimensional orientation of surfaces by using variations in image intensity alone. This ability to perceive shape from shading is one of the most important yet poorly understood aspects of human vision. Here we present several findings which may help reveal computational mechanisms underlying this ability. First, we find that perception of shape from shading is a global operation which assumes that there is only one light source illuminating the entire visual image. This implies that if two identical objects are viewed simultaneously and illuminated from different angles, then we would be able to perceive three-dimensional shape accurately in only one of them at a time. Second, three-dimensional shapes that are defined exclusively by shading can provide tokens for the perception of apparent motion, suggesting that the motion mechanism is remarkably versatile in the kinds of inputs it can use. Lastly, the occluding edges which delineate an object from its background can also powerfully influence the perception of three-dimensional shape from shading.

695 citations

Dissertation
01 Nov 1970
TL;DR: A method will be described for finding the shape of a smooth opaque object from a monocular image, given a knowledge of the surface photometry, the position of the light-source and certain auxiliary information to resolve ambiguities, complementary to the use of stereoscopy.
Abstract: A method will be described for finding the shape of a smooth opaque object from a monocular image, given a knowledge of the surface photometry, the position of the light-source and certain auxiliary information to resolve ambiguities This method is complementary to the use of stereoscopy which relies on matching up sharp detail and will fail on smooth objects Until now the image processing of single views has been restricted to objects which can meaningfully be considered two-dimensional or bounded by plane surfaces It is possible to derive a first-order non-linear partial differential equation in two unknowns relating the intensity at the image points to the shape of the object This equation can be solved by means of an equivalent set of five ordinary differential equations A curve traced out by solving this set of equations for one set of starting values is called a characteristic strip Starting one of these strips from each point on some initial curve will produce the whole solution surface The initial curves can usually be constructed around so-called singular points A number of applications of this method will be discussed including one to lunar topography and one to the scanning electron microscope In both of these cases great simplifications occur in the equations A note on polyhedra follows and a quantitative theory of facial make-up is touched upon An implementation of some of these ideas on the PDP-6 computer with its attached image-dissector camera at the Artificial Intelligence Laboratory will be described, and also a nose-recognition program

634 citations


"Belief Propagation with Directional..." refers background in this paper

  • ...Horn[1] introduced this problem with the assumptions of Lambertian reflectance, orthographic projection, constant known albedo, a smooth surface, no surface inter-reflectance and a single infinitely distant light source in a known relation with the photo....

    [...]

Frequently Asked Questions (8)
Q1. What contributions have the authors mentioned in the paper "Belief propagation with directional statistics for solving the shape-from-shading problem" ?

In this paper, the authors use belief propagation to marginalise a probabilistic model and use a directional probability distribution, the Fisher-Bingham distribution, to estimate the marginals of a multivariate probability distribution. 

An alternative solution to the current bias is an obvious area for future research. 

The presented algorithm doing poorly as the light source moves away from [0, 0, 1]T can be put down to the bias introduced to handle the concave/convex ambiguity[12]. 

For the head image the run time is over 12 hours for Lee & Kuo, 54 minutes for Worthington and Hancock and 9.5 minutes for the presented algorithm on a 2Ghz Athlon. 

The method uses belief propagation to obtain the maximum a posteriori estimate of a pairwise Markov random field where each node represents the orientation of the surface at a pixel in the image. 

To define the distribution to be marginalised two sources are used: the irradiance information (Eq. 2) and a smoothness assumption. 

Using previously given assumptions, of Lambertian reflectance, constant known albedo, orthographic projection, an infinitely distant light source and no interreflection the irradiance at each pixel in the input image is given byIx,y = A(̂l · n̂x,y) (1)where Ix,y is the irradiance provided by the input image. 

Using equations 2 and 13 the authors can define a distributionΩ[2k Ix,y A l̂,−kl̂̂lT ] (17)In principle ψp(x̂p), from equation 16, can be set to this Bingham-Mardia distribution to complete the model to be marginalised.