scispace - formally typeset
Search or ask a question
Journal ArticleDOI

A survey of content based 3D shape retrieval methods

01 Sep 2008-Multimedia Tools and Applications (Springer US)-Vol. 39, Iss: 3, pp 441-471
TL;DR: A survey of content-based 3D shape retrieval methods can be found in this article, where the authors evaluate the suitability of several requirements of content based shape retrieval, such as shape representation requirements, properties of dissimilarity measures, efficiency, discrimination abilities, ability to perform partial matching, robustness, and necessity of pose normalization.
Abstract: Recent developments in techniques for modeling, digitizing and visualizing 3D shapes has led to an explosion in the number of available 3D models on the Internet and in domain-specific databases. This has led to the development of 3D shape retrieval systems that, given a query object, retrieve similar 3D objects. For visualization, 3D shapes are often represented as a surface, in particular polygonal meshes, for example in VRML format. Often these models contain holes, intersecting polygons, are not manifold, and do not enclose a volume unambiguously. On the contrary, 3D volume models, such as solid models produced by CAD systems, or voxels models, enclose a volume properly. This paper surveys the literature on methods for content based 3D retrieval, taking into account the applicability to surface models as well as to volume models. The methods are evaluated with respect to several requirements of content based 3D shape retrieval, such as: (1) shape representation requirements, (2) properties of dissimilarity measures, (3) efficiency, (4) discrimination abilities, (5) ability to perform partial matching, (6) robustness, and (7) necessity of pose normalization. Finally, the advantages and limitations of the several approaches in content based 3D shape retrieval are discussed.

Summary (6 min read)

1. Introduction

  • The advancement of modeling, digitizing and visualizing techniques for 3D shapes has led to an increasing amount of 3D models, both on the Internet and in domain-specific databases.
  • Unlike text documents, 3D models are not easily retrieved.
  • In contrast, content based 3D shape retrieval methods, that use shape properties of the 3D models to search for similar models, work better than text based methods [58].
  • Matching and indexing are often part of the retrieval process.

2.1. 3D shape retrieval framework

  • 1 consists of a database with an index structure created offline and an online query engine.
  • Each 3D model has to be identified with a shape descriptor, providing a compact overall description of the shape.
  • To efficiently search a large collection online, an indexing data structure and searching algorithm should be available.
  • The similarity between two descriptors is quantified by a dissimilarity measure.
  • Finally, the retrieved models can be visualized.

2.2. Shape representations

  • An important issue is the type of shape representation(s) that a shape retrieval system accepts.
  • Most of the 3D models found on the World Wide Web are meshes defined in a file format supporting visual appearance.
  • In particular, they are represented by “polygon soups”, consisting of unorganized sets of polygons.
  • Also, in general these models are not “watertight” meshes, i.e. they do not enclose a volume.
  • By contrast, for volume models retrieval methods depending on a properly defined volume can be applied.

2.3. Measuring similarity

  • In order to measure how similar two objects are, it is necessary to compute distances between pairs of descriptors using a dissimilarity measure.
  • This property is very strong for a high-level shape descriptor, and is often not satisfied.
  • This is not a severe drawback, if the loss of uniqueness depends on negligible details.
  • When all the properties (i)-(iv) hold, the dissimilarity measure is called a metric.
  • If a dissimilarity measure is a pseudo-metric, the triangle inequality can be applied to make retrieval more efficient [7, 83].

2.4. Efficiency

  • For large shape collections, it is inefficient to sequentially match all objects in the database with the query object.
  • Because retrieval should be fast, efficient indexing search structures are needed to support efficient retrieval.
  • Since for query by example the shape descriptor is computed online, it is reasonable to require that the shape descriptor computation is fast enough for interactive querying.

2.5. Discriminative power

  • A shape descriptor should capture properties that discriminate objects well.
  • The judgement of the similarity of the shapes of two 3D objects is somewhat subjective, depending on the user preference or the application at hand.
  • E.g. for solid modeling applications often topology properties such as the numbers of holes in a model are more important than minor differences in shapes.
  • On the contrary, if a user searches for models looking visually similar the existence of a small hole in the model, may be of no importance to the user.

2.6. Partial matching

  • In contrast to global shape matching, partial matching finds a shape of which a part is similar to a part of another shape.
  • Partial matching can be applied if 3D shape models are not complete, e.g. for objects obtained by laser scanning from one or two directions only.
  • Another application is the search for “3D scenes” containing an instance of the query object.
  • Also, this feature can potentially give the user flexibility towards the matching problem, if parts of interest of an object can be selected or weighted by the user.

2.7. Robustness

  • It is often desirable that a shape descriptor is insensitive to noise and small extra features, and robust against arbitrary topological degeneracies, e.g. if it is obtained by laser scanning.
  • Also, if a model is given in multiple levels-ofdetail, representations of different levels should not differ significantly from the original model.

2.8. Pose normalization

  • In the absence of prior knowledge, 3D models have arbitrary scale, orientation and position in the 3D space.
  • Because not all dissimilarity measures are invariant under rotation and translation, it may be necessary to place the 3D models into a canonical coordinate system.
  • For volume models it is natural to translate the center of mass to the origin.
  • Eral not possible, because they have not to enclose a volume.
  • Similar eigenvalues may imply an almost symmetrical mass distribution around an axis (e.g. nearly cylindrical shapes) or around the center of mass (e.g. nearly spherical shapes).

3. Shape matching methods

  • In this section the authors discuss 3D shape matching methods.
  • The authors divide shape matching methods in three broad categories: (1) feature based methods, (2) graph based methods and (3) other methods.
  • Note, that the classes of these methods are not completely disjoined.
  • A graph-based shape descriptor, in some way, describes also the global feature distribution.
  • By this point of view the taxonomy should be a graph.

3.1. Feature based methods

  • In the context of 3D shape matching, features denote geometric and topological properties of 3D shapes.
  • Feature based methods can be divided into four categories according to the type of shape features used: (1) global features, (2) global feature distributions, (3) spatial maps, and (4) local features.
  • Feature based Graph based Other Global features Spatial map Local features Global feature distribution Model graph Skeleton Reeb graph View based Volumetric error based Weighted point set based Deformation based Figure 3.
  • Retrieving the k best matches for a 3D query model is equivalent to solving the k nearest neighbors problem.
  • For this purpose, for each surface point a descriptor is used instead of a single descriptor.

3.1.1. Global feature based similarity

  • Global features characterize the global shape of a 3D model.
  • Zhang and Chen [88] describe methods to compute global features such as volume, area, statistical moments, and Fourier transform coefficients efficiently.
  • Paquet et al. [67] apply bounding boxes, cords-based, moments-based and wavelets-based descriptors for 3D shape matching.
  • Kazhdan et al. [42] describe a reflective symmetry descriptor as a 2D function associating a measure of reflective symmetry to every plane (specified by 2 parameters) through the model’s centroid.
  • They improve the retrieval performance by an active learning phase in which a human annotator assigns attributes such as airplane, car, body, and so on to a number of sample models.

3.1.2. Global feature distribution based similarity

  • The concept of global feature based similarity has been refined recently by comparing distributions of global features instead of the global features directly.
  • Osada et al. [66] introduce and compare shape distributions, which measure properties based on distance, angle, area and volume measurements between random surface points.
  • They evaluate the similarity between the objects using a pseudo-metric that measures distances between distributions.
  • Since their method requires that a line segment can be classified as lying inside or outside the model it is required that the model defines a volume properly.
  • Ohbuchi et al. [65] improved this method further by a multi-resolution approach computing a number of alpha-shapes at different scales, and computing for each alpha-shape their Absolute Angle-Distance descriptor.

3.1.3. Spatial map based similarity

  • Spatial maps are representations that capture the spatial location of an object.
  • The shape histograms are defined on concentric shells and sectors around a model’s centroid and compare shapes using a quadratic form distance measure to compare the histograms taking into account the distances between the shape histogram bins.
  • Their method requires pose normalization to provide rotational invariance.
  • Therefore, he compares his ray-based spherical harmonic method [85] and a variation of it using functions defined on concentric shells with the voxel-based spherical harmonics shape descriptor proposed by Funkhouser et al. [30].
  • Their experimental results show that the eigenvalue method and the cover sequence method outperform the volume and solid-angle feature method.

3.1.4. Local feature based similarity

  • Local feature based methods provide various approaches to take into account the surface shape in the neighbourhood of points on the boundary of the shape.
  • Unfortunately, the method is limited to objects which contain no holes, i.e. have genus zero.
  • Zaharia and Prêteux [87] describe the 3D Shape Spectrum Descriptor, which is defined as the histogram of shape index values, calculated over an entire mesh.
  • They apply spin images to recognize models in a cluttered 3D scene.
  • In the global matching stage, correspondences between similar sample points on the two shapes are found.

3.2. Graph based methods

  • In general, the feature based methods discussed in the previous section take into account only the pure geometry of the shape.
  • Graph based methods can be divided into three broad categories according to the type of graph used: (1) model graphs, (2) Reeb graphs, and (3) skeletons.
  • Efficient computation of existing graph metrics for general graphs is not possible: computing the edit distance is NP-hard [90] and computing the maximal common subgraph [32] is even NP-complete.
  • Polynomial solutions can be obtained for directed acyclic graphs such as shock graphs.
  • It is obtained by exhaustively searching for the optimal deformation path between two 2D shapes, and using the cost of this path as a distance between two shapes.

3.2.1. Model graph based similarity

  • Model graph based similarity methods are applicable to 3D solid models as produced by CAD most systems.
  • The most dominant solid modeling representation methods are Boundary Representation (B-rep) and Constructive Solid Geometry (CSG).
  • By contrast to the facets in meshes, the faces of a B-rep may be represented as freeform surfaces.
  • McWerther et al. [53, 54, 55], and El-Mehalawi and Miller [26, 27] apply model signature graphs, that both model the topology of a shape model by a graph structure, and map a number of engineering features to a highdimensional feature vector.
  • Therefore, McWerther et al. [53, 54, 55] apply approximate graph comparison using the spectrum of the graph.

3.2.2. Skeleton based similarity

  • Sundar et al. [76] use as a shape descriptor a skeletal graph that encodes geometric and topological information.
  • This topological signature vector is defined recursively over the subgraphs of the node using eigenvalues of their adjacency matrices.
  • Sundar et al. [76] match two shapes by approximate comparison of their hierarchical skeletal graphs using Figure 5.
  • They obtain a skeletal graph by a thinning algorithm iteratively eroding voxels until a one-voxel width skeleton is left.
  • Their results show the feasibility of their approach for relatively small volume models.

3.2.3. Reeb graph based similarity

  • Biasotti et al. [15] compare Reeb graphs obtained by using different quotient functions f and highlight how the choice of f determines the final matching result.
  • Their method uses Reeb graphs based on a quotient function defined by an integral geodesic distance.
  • Since for solid models topological insensitivity is important, they conclude that the Reeb graph technique requires some improvements.
  • Bespalov et al. [13] present preliminary research on a modification of Hilaga’s method, which computes a scale-space decomposition of a shape, represented as a rooted undirected tree instead of a Reeb graph.
  • In summary, Reeb graphs defined by a geodesic distance are suited for matching articulated objects, but they are sensitive to topological changes.

3.3.1. View based similarity

  • The main idea of view based similarity methods is that two 3D models are similar, if they look similar from all viewing angles.
  • Löffler [49] applies view based similarity to retrieve 3D models using a 2D query interface.
  • The number of views of each object is kept small by clustering views, and by representing each cluster with one view, which is represented by a shock graph.
  • Therefore, a lightfield descriptor is introduced, which compares ten silhouettes of the 3D shape obtained from ten viewing angles distributed evenly on the viewing sphere.
  • The running time of the retrieval process is reduced by a clever multi-step approach supporting early rejection of non-relevant models.

3.3.2. Volumetric error based similarity

  • Novotni and Klein [60] describe a geometry similarity approach to 3D shape matching based on calculating a volumetric error between one object and a sequence of offset hulls of the other object.
  • A drawback of their method is that their dissimilarity measure is not symmetric and does not obey the triangle inequality.
  • Sánchez-Cruz and Bribiesca present a method [69] relating the volumetric error between two voxelized shapes to a transportation distance measuring how many voxels have to move and how far to change one shape into another.
  • Since in general these voxelized shapes will have many voxels, the computation of this transportation distance will be high.

3.3.3. Weighted point set based similarity

  • Another approach is based on shape descriptors consisting of weighted 3D points.
  • They match weighted point sets by a measure which does not obey the triangle inequality.
  • Tangelder and Veltkamp [78] use as shape descriptor a weighted point sets consisting of points with a high curvature value.
  • They compare weighted point sets using a variant of the Earth Mover’s distance, the proportional transportation distance, which obeys the triangle inequality [33].
  • They utilize this multiresolution approximation to implement an algorithm to simultaneously align and compare two shapes.

3.3.4. Deformation based similarity

  • A number of methods [20, 8] compare a pair of 2D shapes by measuring the amount of deformation required to register the shapes exactly.
  • These methods depend on the natural arc length parameterization of their contours, which is not straightforwardly generalized to 3D.
  • As a result, methods that apply deformation for shape recovery [79] or shape evolution [23] are very difficult to apply for 3D shape matching.

4. Overview and conclusions

  • In this section the authors summarize their discussion on shape matching methods from the previous section and indicate directions for further research.
  • Hence, these methods are less robust than feature based methods.
  • The Princeton Shape Benchmark database contains 1,834 3D models downloaded from the web, subdivided into a training set and a test set, containing 907 models each, classified into 90 and 92 classes respectively.
  • The vantage method [83] can be applied to compute an efficient index structure for pseudo-metrics that require much computing time, also known as Efficient indexing.
  • Since the capabilities of feature based methods (fast computation, pseudo-metric, discriminative abilities, robustness) are orthogonal to the capabilities of graph based methods (partial matching, no normalization required), combining different approaches may produce more powerful shape matching methods.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

A Survey of Content Based 3D Shape Retrieval Methods
Johan W.H. Tangelder and Remco C. Veltkamp
Institute of Information and Computing Sciences, Utrecht University
hanst@cs.uu.nl, Remco.Veltkamp@cs.uu.nl
Abstract
Recent developments in techniques for modeling, digitiz-
ing and visualizing 3D shapes has led to an explosion in
the number of available 3D models on the Internet and in
domain-specific databases. This has led to the development
of 3D shape retrieval systems that, given a query object,
retrieve similar 3D objects. For visualization, 3D shapes
are often represented as a surface, in particular polygo-
nal meshes, for example in VRML format. Often these mod-
els contain holes, intersecting polygons, are not manifold,
and do not enclose a volume unambiguously. On the con-
trary, 3D volume models, such as solid models produced
by CAD systems, or voxels models, enclose a volume prop-
erly. This paper surveys the literature on methods for con-
tent based 3D retrieval, taking into account the applicabil-
ity to surface models as well as to volume models. The meth-
ods are evaluated with respect to several requirements of
content based 3D shape retrieval, such as: (1) shape repre-
sentation requirements, (2) properties of dissimilarity mea-
sures, (3) efficiency, (4) discrimination abilities, (5) ability
to perform partial matching, (6) robustness, and (7) neces-
sity of pose normalization. Finally, the advantages and lim-
its of the several approaches in content based 3D shape re-
trieval are discussed.
1. Introduction
The advancementof modeling, digitizing and visualizing
techniques for 3D shapes has led to an increasing amount
of 3D models, both on the Internet and in domain-specific
databases. This has led to the development of the first exper-
imental search engines for 3D shapes, such as the 3D model
search engine at Princeton university [2, 57], the 3D model
retrieval system at the National Taiwan University [1, 17],
the Ogden IV system at the National Institute of Multimedia
Education, Japan [62, 77], the 3D retrieval engine at Utrecht
University [4, 78], and the 3D model similarity search en-
gine at the University of Konstanz [3, 84].
Laser scanning has been applied to obtain archives
recording cultural heritage like the Digital Michelan-
gelo Project [25, 48], and the Stanford Digital Formae
Urbis Romae Project [75]. Furthermore, archives contain-
ing domain-specific shape models are now accessible by
the Internet. Examples are the National Design Repos-
itory, an online repository of CAD models [59, 68],
and the Protein Data Bank, an online archive of struc-
tural data of biological macromolecules [10, 80].
Unlike text documents, 3D models are not easily re-
trieved. Attempting to find a 3D model using textual an-
notation and a conventional text-based search engine would
not work in many cases. The annotations added by human
beings depend on language, culture, age, sex, and other fac-
tors. They may be too limited or ambiguous. In contrast,
content based 3D shape retrieval methods, that use shape
properties of the 3D models to search for similar models,
work better than text based methods [58].
Matching is the process of determining how similar two
shapes are. This is often done by computing a distance. A
complementary process is indexing. In this paper, indexing
is understood as the process of building a datastructure to
speed up the search. Note that the term indexing is also of-
ten used for the identification of features in models, or mul-
timedia documents in general. Retrieval is the process of
searching and delivering the query results. Matching and in-
dexing are often part of the retrieval process.
Recently, a lot of researchers have investigated the spe-
cific problem of content based 3D shape retrieval. Also, an
extensive amount of literature can be found in the related
fields of computer vision, object recognition and geomet-
ric modelling. Survey papers to this literature have been
provided by Besl and Jain [11], Loncaric [50] and Camp-
bell and Flynn [16]. For an overview of 2D shape match-
ing methods we refer the reader to the paper by Veltkamp
[82]. Unfortunately, most 2D methods do not generalize di-
rectly to 3D model matching. Work in progress by Iyer et
al. [40] provides an extensive overview of 3D shape search-
ing techniques. Atmosukarto and Naval [6] describe a num-
ber of 3D model retrieval systems and methods, but do not
provide a categorization and evaluation.
In contrast, this paper evaluates 3D shape retrieval meth-
ods with respect to several requirements on content based
3D shape retrieval, such as: (1) shape representation re-
quirements, (2) properties of dissimilarity measures, (3) ef-
ficiency, (4) discrimination abilities, (5) ability to perform
partial matching, (6) robustness, and (7) necessity of pose

3Dmodel
database
descriptors
index
structure
query
sketch
result
query
descriptor
fetching
descriptor
extraction
descriptor
extraction
matching
index
construction
visualization
query
formulation
browsing
queryby
example
directquery
model
id’s
offline
online
Figure 1. Conceptual framework for shape retrieval.
normalization. In section 2 we discuss several aspects of 3D
shape retrieval. The literature on 3D shape matching meth-
ods is discussed in section 3 and evaluated in section 4.
2. 3D shape retrieval aspects
In this section we discuss several issues related to 3D
shape retrieval.
2.1. 3D shape retrieval framework
At a conceptual level, a typical 3D shape retrieval frame-
work as illustrated by fig. 1 consists of a database with an
index structure created
offline
and an
online
query engine.
Each 3D model has to be identified with a shape descrip-
tor, providing a compact overall description of the shape.
To efficiently search a large collection online, an indexing
data structure and searching algorithm should be available.
The
online
query engine computes the query descriptor, and
models similar to the query model are retrieved by match-
ing descriptors to the query descriptor from the index struc-
ture of the database. The similarity between two descriptors
is quantified by a dissimilarity measure. Three approaches
can be distinguished to provide a query object: (1) browsing
to select a new query object from the obtained results, (2)
a direct query by providing a query descriptor, (3) query by
example by providing an existing 3D model or by creating
a 3D shape query from scratch using a 3D tool or sketch-
ing 2D projections of the 3D model. Finally, the retrieved
models can be visualized.
2.2. Shape representations
An important issue is the type of shape representation(s)
that a shape retrievalsystem accepts. Most of the 3D models
found on the World Wide Web are meshes defined in a file
format supporting visual appearance. Currently, the most
common format used for this purpose is the Virtual Real-
ity Modeling Language (VRML) format. Since these mod-
els have been designed for visualization, they often contain
only geometry and appearance attributes. In particular, they
are represented by “polygon soups”, consisting of unorga-
nized sets of polygons. Also, in general these models are
not “watertight” meshes, i.e. they do not enclose a volume.
By contrast, for volume models retrieval methods depend-
ing on a properly defined volume can be applied.
2.3. Measuring similarity
In order to measure how similar two objects are, it is nec-
essary to compute distances between pairs of descriptors us-
ing a dissimilarity measure. Although the term similarity is
often used, dissimilarity corresponds to the notion of dis-
tance: small distances means small dissimilarity, and large
similarity.
A dissimilarity measure can be formalized by a func-
tion defined on pairs of descriptors indicating the degree
of their resemblance. Formally speaking, a dissimilarity
measure d on a set S is a non-negative valued function
d : S × S R
+
{0}. Function d may have some of
the following properties:
i. Identity: For all x S, d(x, x) = 0.
ii. Positivity: For all x 6= y in S, d(x, y) > 0.
iii. Symmetry: For all x, y S, d(x, y) = d(y, x).
iv. Triangle inequality:
For all x, y, z S, d(x, z) d(x, y) + d(y, z).
v. Transformation invariance: For a chosen transforma-
tion group G, for all x, y S, g G, d(g(x), g(y)) =
d(x, y).
The identity property says that a shape is completely
similar to itself, while the positivity property claims that dif-
ferent shapes are never completely similar. This property is
very strong for a high-level shape descriptor, and is often
not satisfied. However, this is not a severe drawback, if the
loss of uniqueness depends on negligible details.
Symmetry is not always wanted. Indeed, human percep-
tion does not always find that shape x is equally similar to
shape y, as y is to x. In particular, a variant x of prototype
y, is often found more similar to y then vice versa [81].
Dissimilarity measures for partial matching, giving a
small distance d(x, y) if a part of x matches a part of y,
do not obey the triangle inequality.
Transformation invariance has to be satisfied, if the com-
parison and the extraction process of shape descriptors have
to be independent of the place, orientation and scale of the
object in its Cartesian coordinate system. If we want that
a dissimilarity measure is not affected by any transforma-
tion on x, then we may use as alternative formulation for
(v): Transformation invariance: For a chosen transforma-
tion group G, for all x, y S, g G, d(g(x), y) = d(x, y).
When all the properties (i)-(iv) hold, the dissimilarity
measure is called a
metric
. Other combinations are possi-
ble: a pseudo-metric is a dissimilarity measure that obeys
(i), (iii) and (iv) while a semi-metric obeys only (i), (ii) and

(iii). If a dissimilarity measure is a pseudo-metric, the tri-
angle inequality can be applied to make retrieval more effi-
cient [7, 83].
2.4. Efficiency
For large shape collections, it is inefficient to sequen-
tially match all objects in the database with the query object.
Because retrieval should be fast, efficient indexing search
structures are needed to support efficient retrieval. Since for
query by example the shape descriptor is computed online,
it is reasonable to require that the shape descriptor compu-
tation is fast enough for interactive querying.
2.5. Discriminative power
A shape descriptor should capture properties that dis-
criminate objects well. However, the judgement of the sim-
ilarity of the shapes of two 3D objects is somewhat sub-
jective, depending on the user preference or the application
at hand. E.g. for solid modeling applications often topol-
ogy properties such as the numbers of holes in a model are
more important than minor differences in shapes. On the
contrary, if a user searches for models looking visually sim-
ilar the existence of a small hole in the model, may be of no
importance to the user.
2.6. Partial matching
In contrast to global shape matching, partial matching
finds a shape of which a part is similar to a part of another
shape. Partial matching can be applied if 3D shape mod-
els are not complete, e.g. for objects obtained by laser scan-
ning from one or two directions only. Another application
is the search for “3D scenes” containing an instance of the
query object. Also, this feature can potentially give the user
flexibility towards the matching problem, if parts of inter-
est of an object can be selected or weighted by the user.
2.7. Robustness
It is often desirable that a shape descriptor is insensitive
to noise and small extra features, and robust against arbi-
trary topological degeneracies, e.g. if it is obtained by laser
scanning. Also, if a model is given in multiple levels-of-
detail, representations of different levels should not differ
significantly from the original model.
2.8. Pose normalization
In the absence of prior knowledge, 3D models have ar-
bitrary scale, orientation and position in the 3D space. Be-
cause not all dissimilarity measures are invariant under ro-
tation and translation, it may be necessary to place the 3D
models into a canonical coordinate system. This should be
the same for a translated, rotated or scaled copy of the
model.
A natural choice is to first translate the center to the ori-
gin. For volume models it is natural to translate the cen-
ter of mass to the origin. But for meshes this is in gen-
Figure 2. Similar mugs oriented by principal axes in dif-
ferent ways [30].
eral not possible, because they have not to enclose a vol-
ume. For meshes it is an alternative to translate the cen-
ter of mass of all the faces to the origin. For example
the Principal Component Analysis (PCA) method computes
for each model the principal axes of inertia e
1
, e
2
and e
3
and their eigenvalues λ
1
, λ
2
and λ
3
, and make the nec-
essary conditions to get right-handed coordinate systems.
These principal axes define an orthogonal coordinate sys-
tem (e
1
, e
2
, e
3
), with λ
1
λ
2
λ
3
. Next, the polyhe-
dral model is rotated around the origin such that the co-
ordinate system (e
x
, e
y
, e
z
) coincides with the coordinate
system(e
1
, e
2
, e
3
).
The PCA algorithm for pose estimation is fairly simple
and efficient. However, if the eigenvalues are equal, prin-
cipal axes may switch, without affecting the eigenvalues.
Similar eigenvalues may imply an almost symmetrical mass
distribution around an axis (e.g. nearly cylindrical shapes)
or around the center of mass (e.g. nearly spherical shapes).
Fig. 2 illustrates the problem.
3. Shape matching methods
In this section we discuss 3D shape matching methods.
We divide shape matching methods in three broad cate-
gories: (1) feature based methods, (2) graph based meth-
ods and (3) other methods. Fig. 3 illustrates a more detailed
categorization of shape matching methods. Note, that the
classes of these methods are not completely disjoined. For
instance, a graph-based shape descriptor, in some way, de-
scribes also the global feature distribution. By this point of
view the taxonomy should be a graph.
3.1. Feature based methods
In the context of 3D shape matching, features denote ge-
ometric and topological properties of 3D shapes. So 3D
shapes can be discriminated by measuring and comparing
their features. Feature based methods can be divided into
four categories according to the type of shape features used:
(1) global features, (2) global feature distributions, (3) spa-
tial maps, and (4) local features. Feature based methods
from the first three categories represent features of a shape
using a single descriptor consisting of a d-dimensional vec-
tor of values, where the dimension d is fixed for all shapes.

Featurebased
Graphbased
Other
Globalfeatures
Spatialmap
Localfeatures
Globalfeature
distribution
Modelgraph
Skeleton
Reebgraph
Viewbased
Volumetricerrorbased
Weightedpointsetbased
Deformationbased
Figure 3. Taxonomy of shape matching methods.
The value of d can easily be a few hundred. The descriptor
of a shape is a point in a high dimensional space, and two
shapes are considered to be similar if they are close in this
space. Retrieving the k best matches for a 3D query model is
equivalent to solving the k nearest neighbors problem. Us-
ing the Euclidean distance, matching feature descriptors can
be done efficiently in practice by searching in multiple 1D
spaces to solve the approximate k nearest neighbor prob-
lem as shown by Indyk and Motwani [36]. In contrast with
the feature based methods from the first three categories, lo-
cal feature based methods describe for a number of surface
points the 3D shape around the point. For this purpose, for
each surface point a descriptor is used instead of a single de-
scriptor.
3.1.1. Global feature based similarity
Global features characterize the global shape of a 3D model.
Examples of these features are the statistical moments of the
boundary or the volume of the model, volume-to-surface ra-
tio, or the Fourier transform of the volume or the boundary
of the shape.
Zhang and Chen [88] describe methods to com-
pute global features such as volume, area, statistical mo-
ments, and Fourier transform coefficients efficiently.
Paquet et al. [67] apply bounding boxes, cords-based,
moments-based and wavelets-based descriptors for 3D
shape matching.
Corney et al. [21] introduce convex-hull based indices
like hull crumpliness (the ratio of the object surface area
and the surface area of its convex hull), hull packing (the
percentage of the convex hull volume not occupied by the
object), and hull compactness (the ratio of the cubed sur-
face area of the hull and the squared volume of the convex
hull).
Kazhdan et al. [42] describe a reflective symmetry de-
scriptor as a 2D function associating a measure of reflec-
tive symmetry to every plane (specified by 2 parameters)
through the model’s centroid. Every function value provides
a measure of global shape, where peaks correspond to the
planes near reflective symmetry, and valleys correspond to
the planes of near anti-symmetry. Their experimental results
show that the combination of the reflective symmetry de-
scriptor with existing methods provides better results.
Since only global features are used to characterize the
overall shape of the objects, these methods are not very dis-
criminative about object details, but their implementation is
straightforward. Therefore, these methods can be used as an
active filter, after which more detailed comparisons can be
made, or they can be used in combination with other meth-
ods to improve results.
Global feature methods are able to support user feed-
back as illustrated by the following research. Zhang and
Chen [89] applied features such as volume-surface ratio,
moment invariants and Fourier transform coefficients for
3D shape retrieval. They improve the retrieval performance
by an active learning phase in which a human annotator as-
signs attributes such as airplane, car, body, and so on to a
number of sample models. Elad et al. [28] use a moments-
based classifier and a weighted Euclidean distance measure.
Their method supports iterative and interactive database
searching where the user can improve the weights of the
distance measure by marking relevant search results.
3.1.2. Global feature distribution based similarity
The concept of global feature based similarity has been re-
fined recently by comparing distributions of global features
instead of the global features directly.
Osada et al. [66] introduce and compare shape distribu-
tions, which measure properties based on distance, angle,
area and volume measurements between random surface
points. They evaluate the similarity between the objects us-
ing a pseudo-metric that measures distances between distri-
butions. In their experiments the D2 shape distribution mea-
suring distances between random surface points is most ef-
fective.
Ohbuchi et al. [64] investigate shape histograms that are
discretely parameterized along the principal axes of inertia
of the model. The shape descriptor consists of three shape
histograms: (1) the moment of inertia about the axis, (2)
the average distance from the surface to the axis, and (3)
the variance of the distance from the surface to the axis.
Their experiments show that the axis-parameterized shape
features work only well for shapes having some form of ro-
tational symmetry.
Ip et al. [37] investigate the application of shape distri-
butions in the context of CAD and solid modeling. They re-
fined Osada’s D2 shape distribution function by classifying

2 random points as 1) IN distances if the line segment con-
necting the points lies complete inside the model, 2) OUT
distances if the line segment connecting the points lies com-
plete outside the model, 3) MIXED distances if the line seg-
ment connecting the points lies passes both inside and out-
side the model. Their dissimilarity measure is a weighted
distance measure comparing D2, IN, OUT and MIXED dis-
tributions. Since their method requires that a line segment
can be classified as lying inside or outside the model it is
required that the model defines a volume properly. There-
fore it can be applied to volume models, but not to polyg-
onal soups. Recently, Ip et al. [38] extend this approach
with a technique to automatically categorize a large model
database, given a categorization on a number of training ex-
amples from the database.
Ohbuchi et al. [63], investigate another extension of the
D2 shape distribution function, called the Absolute Angle-
Distance histogram, parameterized by a parameter denot-
ing the distance between two random points and by a pa-
rameter denoting the angle between the surfaces on which
two random points are located. The latter parameter is ac-
tually computed as an inner product of the surface normal
vectors. In their evaluation experiment this shape distribu-
tion function outperformed the D2 distribution function at
about 1.5 times higher computational costs. Ohbuchi et al.
[65] improved this method further by a multi-resolution ap-
proach computing a number of alpha-shapes at different
scales, and computing for each alpha-shape their Absolute
Angle-Distance descriptor. Their experimental results show
that this approach outperforms the Angle-Distance descrip-
tor at the cost of high processing time needed to compute
the alpha-shapes.
Shape distributions distinguish models in broad cate-
gories very well: aircraft, boats, people, animals, etc. How-
ever, they perform often poorly when having to discrimi-
nate between shapes that have similar gross shape proper-
ties but vastly different detailed shape properties.
3.1.3. Spatial map based similarity
Spatial maps are representations that capture the spatial lo-
cation of an object. The map entries correspond to physi-
cal locations or sections of the object, and are arranged in a
manner that preserves the relative positions of the features
in an object. Spatial maps are in general not invariant to ro-
tations, except for specially designed maps. Therefore, typ-
ically a pose normalization is done first.
Ankerst et al. [5] use shape histograms as a means of an-
alyzing the similarity of 3D molecular surfaces. The his-
tograms are not built from volume elements but from uni-
formly distributed surface points taken from the molecular
surfaces. The shape histograms are defined on concentric
shells and sectors around a model’s centroid and compare
shapes using a quadratic form distance measure to compare
the histograms taking into account the distances between
the shape histogram bins.
Vrani´c et al. [85] describe a surface by associating to
each ray from the origin, the value equal to the distance to
the last point of intersection of the model with the ray and
compute spherical harmonics for this spherical extent func-
tion. Spherical harmonics form a Fourier basis on a sphere
much like the familiar sine and cosine do on a line or a cir-
cle. Their method requires pose normalization to provide
rotational invariance. Also, Yu et al. [86] propose a descrip-
tor similar to a spherical extent function and a descriptor
counting the number of intersections of a ray from the ori-
gin with the model. In both cases the dissimilarity between
two shapes is computed by the Euclidean distance of the
Fourier transforms of the descriptors of the shapes. Their
method requirespose normalization to provide rotational in-
variance.
Kazhdan et al. [43] present a general approach based on
spherical harmonics to transform rotation dependent shape
descriptors into rotation independent ones. Their method is
applicable to a shape descriptor which is defined as either a
collection of spherical functions or as a function on a voxel
grid. In the latter case a collection of spherical functions is
obtained from the function on the voxel grid by restricting
the grid to concentric spheres. From the collection of spher-
ical functions they compute a rotation invariant descriptor
by (1) decomposing the function into its spherical harmon-
ics, (2) summing the harmonics within each frequency, and
computing the L
2
-norm for each frequency component. The
resulting shape descriptor is a 2D histogram indexed by ra-
dius and frequency, which is invariant to rotations about the
center of the mass. This approach offers an alternative for
pose normalization, because their method obtains rotation
invariant shape descriptors. Their experimental results show
indeed that in general the performance of the obtained ro-
tation independent shape descriptors is better than the cor-
responding normalized descriptors. Their experiments in-
clude the ray-based spherical harmonic descriptor proposed
by Vrani´c et al. [85]. Finally, note that their approach gen-
eralizes the method to compute voxel-based spherical har-
monics shape descriptor, described by Funkhouser et al.
[30], which is defined as a binary function on the voxel grid,
where the value at each voxel is given by the negatively ex-
ponentiated Euclidean Distance Transform of the surface of
a 3D model.
Novotni and Klein [61] present a method to compute
3D Zernike descriptors from voxelized models as natural
extensions of spherical harmonics based descriptors. 3D
Zernike descriptors capture object coherence in the radial
direction as well as in the direction along a sphere. Both
3D Zernike descriptors and spherical harmonics based de-
scriptors achieve rotation invariance. However, by sampling
the space only in radial direction the latter descriptors do

Citations
More filters
Journal ArticleDOI
TL;DR: This article uses multiscale diffusion heat kernels as “geometric words” to construct compact and informative shape descriptors by means of the “bag of features” approach, and shows that shapes can be efficiently represented as binary codes.
Abstract: The computer vision and pattern recognition communities have recently witnessed a surge of feature-based methods in object recognition and image retrieval applications. These methods allow representing images as collections of “visual words” and treat them using text search approaches following the “bag of features” paradigm. In this article, we explore analogous approaches in the 3D world applied to the problem of nonrigid shape retrieval in large databases. Using multiscale diffusion heat kernels as “geometric words,” we construct compact and informative shape descriptors by means of the “bag of features” approach. We also show that considering pairs of “geometric words” (“geometric expressions”) allows creating spatially sensitive bags of features with better discriminative power. Finally, adopting metric learning approaches, we show that shapes can be efficiently represented as binary codes. Our approach achieves state-of-the-art results on the SHREC 2010 large-scale shape retrieval benchmark.

894 citations

Journal ArticleDOI
TL;DR: This survey is motivated in part by recent developments in space–time registration, where one seeks a correspondence between non‐rigid and time‐varying surfaces, and semantic shape analysis, which underlines a recent trend to incorporate shape understanding into the analysis pipeline.
Abstract: We review methods designed to compute correspondences between geometric shapes represented by triangle meshes, contours or point sets. This survey is motivated in part by recent developments in space–time registration, where one seeks a correspondence between non-rigid and time-varying surfaces, and semantic shape analysis, which underlines a recent trend to incorporate shape understanding into the analysis pipeline. Establishing a meaningful correspondence between shapes is often difficult because it generally requires an understanding of the structure of the shapes at both the local and global levels, and sometimes the functionality of the shape parts as well. Despite its inherent complexity, shape correspondence is a recurrent problem and an essential component of numerous geometry processing applications. In this survey, we discuss the different forms of the correspondence problem and review the main solution methods, aided by several classification criteria arising from the problem definition. The main categories of classification are defined in terms of the input and output representation, objective function and solution approach. We conclude the survey by discussing open problems and future perspectives.

617 citations

Proceedings ArticleDOI
13 Jun 2010
TL;DR: A scale-invariant version of the heat kernel descriptor that can be used in the bag-of-features framework for shape retrieval in the presence of transformations such as isometric deformations, missing data, topological noise, and global and local scaling.
Abstract: One of the biggest challenges in non-rigid shape retrieval and comparison is the design of a shape descriptor that would maintain invariance under a wide class of transformations the shape can undergo. Recently, heat kernel signature was introduced as an intrinsic local shape descriptor based on diffusion scale-space analysis. In this paper, we develop a scale-invariant version of the heat kernel descriptor. Our construction is based on a logarithmically sampled scale-space in which shape scaling corresponds, up to a multiplicative constant, to a translation. This translation is undone using the magnitude of the Fourier transform. The proposed scale-invariant local descriptors can be used in the bag-of-features framework for shape retrieval in the presence of transformations such as isometric deformations, missing data, topological noise, and global and local scaling. We get significant performance improvement over state-of-the-art algorithms on recently established non-rigid shape retrieval benchmarks.

613 citations

Journal ArticleDOI
TL;DR: A thorough experimental evaluation vouches that SHOT outperforms state-of-the-art local descriptors in experiments addressing descriptor matching for object recognition, 3D reconstruction and shape retrieval.

602 citations

Journal ArticleDOI
TL;DR: A hypergraph analysis approach to address the problem of view-based 3-D object retrieval and recognition by avoiding the estimation of the distance between objects by constructing multiple hypergraphs based on their 2-D views.
Abstract: View-based 3-D object retrieval and recognition has become popular in practice, e.g., in computer aided design. It is difficult to precisely estimate the distance between two objects represented by multiple views. Thus, current view-based 3-D object retrieval and recognition methods may not perform well. In this paper, we propose a hypergraph analysis approach to address this problem by avoiding the estimation of the distance between objects. In particular, we construct multiple hypergraphs for a set of 3-D objects based on their 2-D views. In these hypergraphs, each vertex is an object, and each edge is a cluster of views. Therefore, an edge connects multiple vertices. We define the weight of each edge based on the similarities between any two views within the cluster. Retrieval and recognition are performed based on the hypergraphs. Therefore, our method can explore the higher order relationship among objects and does not use the distance between objects. We conduct experiments on the National Taiwan University 3-D model dataset and the ETH 3-D object collection. Experimental results demonstrate the effectiveness of the proposed method by comparing with the state-of-the-art methods.

573 citations

References
More filters
Book
01 Jan 1979
TL;DR: The second edition of a quarterly column as discussed by the authors provides a continuing update to the list of problems (NP-complete and harder) presented by M. R. Garey and myself in our book "Computers and Intractability: A Guide to the Theory of NP-Completeness,” W. H. Freeman & Co., San Francisco, 1979.
Abstract: This is the second edition of a quarterly column the purpose of which is to provide a continuing update to the list of problems (NP-complete and harder) presented by M. R. Garey and myself in our book ‘‘Computers and Intractability: A Guide to the Theory of NP-Completeness,’’ W. H. Freeman & Co., San Francisco, 1979 (hereinafter referred to as ‘‘[G&J]’’; previous columns will be referred to by their dates). A background equivalent to that provided by [G&J] is assumed. Readers having results they would like mentioned (NP-hardness, PSPACE-hardness, polynomial-time-solvability, etc.), or open problems they would like publicized, should send them to David S. Johnson, Room 2C355, Bell Laboratories, Murray Hill, NJ 07974, including details, or at least sketches, of any new proofs (full papers are preferred). In the case of unpublished results, please state explicitly that you would like the results mentioned in the column. Comments and corrections are also welcome. For more details on the nature of the column and the form of desired submissions, see the December 1981 issue of this journal.

40,020 citations

Journal ArticleDOI
TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.
Abstract: The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.

34,239 citations

Journal ArticleDOI
TL;DR: The metric and dimensional assumptions that underlie the geometric representation of similarity are questioned on both theoretical and empirical grounds and a set of qualitative assumptions are shown to imply the contrast model, which expresses the similarity between objects as a linear combination of the measures of their common and distinctive features.
Abstract: The metric and dimensional assumptions that underlie the geometric representation of similarity are questioned on both theoretical and empirical grounds. A new set-theoretical approach to similarity is developed in which objects are represented as collections of features, and similarity is described as a feature-matching process. Specifically, a set of qualitative assumptions is shown to imply the contrast model, which expresses the similarity between objects as a linear combination of the measures of their common and distinctive features. Several predictions of the contrast model are tested in studies of similarity with both semantic and perceptual stimuli. The model is used to uncover, analyze, and explain a variety of empirical phenomena such as the role of common and distinctive features, the relations between judgments of similarity and difference, the presence of asymmetric similarities, and the effects of context on judgments of similarity. The contrast model generalizes standard representations of similarity data in terms of clusters and trees. It is also used to analyze the relations of prototypicality and family resemblance

7,251 citations

Journal ArticleDOI
TL;DR: This paper presents work on computing shape models that are computationally fast and invariant basic transformations like translation, scaling and rotation, and proposes shape detection using a feature called shape context, which is descriptive of the shape of the object.
Abstract: We present a novel approach to measuring similarity between shapes and exploit it for object recognition. In our framework, the measurement of similarity is preceded by: (1) solving for correspondences between points on the two shapes; (2) using the correspondences to estimate an aligning transform. In order to solve the correspondence problem, we attach a descriptor, the shape context, to each point. The shape context at a reference point captures the distribution of the remaining points relative to it, thus offering a globally discriminative characterization. Corresponding points on two similar shapes will have similar shape contexts, enabling us to solve for correspondences as an optimal assignment problem. Given the point correspondences, we estimate the transformation that best aligns the two shapes; regularized thin-plate splines provide a flexible class of transformation maps for this purpose. The dissimilarity between the two shapes is computed as a sum of matching errors between corresponding points, together with a term measuring the magnitude of the aligning transform. We treat recognition in a nearest-neighbor classification framework as the problem of finding the stored prototype shape that is maximally similar to that in the image. Results are presented for silhouettes, trademarks, handwritten digits, and the COIL data set.

6,693 citations

Proceedings ArticleDOI
23 May 1998
TL;DR: In this paper, the authors present two algorithms for the approximate nearest neighbor problem in high-dimensional spaces, for data sets of size n living in R d, which require space that is only polynomial in n and d.
Abstract: We present two algorithms for the approximate nearest neighbor problem in high-dimensional spaces. For data sets of size n living in R d , the algorithms require space that is only polynomial in n and d, while achieving query times that are sub-linear in n and polynomial in d. We also show applications to other high-dimensional geometric problems, such as the approximate minimum spanning tree. The article is based on the material from the authors' STOC'98 and FOCS'01 papers. It unifies, generalizes and simplifies the results from those papers.

4,478 citations

Frequently Asked Questions (16)
Q1. What are the contributions mentioned in the paper "A survey of content based 3d shape retrieval methods" ?

This paper surveys the literature on methods for content based 3D retrieval, taking into account the applicability to surface models as well as to volume models. Finally, the advantages and limits of the several approaches in content based 3D shape retrieval are discussed. 

In this section the authors summarize their discussion on shape matching methods from the previous section and indicate directions for further research. Further research is needed to compare both approaches using the same benchmarks and the best PCA method. Because a feature vector is a point in a fixed d-dimensional space, two models can be compared fast by computing their distance in this space. Also, indexing is straightforward and retrieval can be implemented efficiently by nearest neighbour search. 

The most dominant solid modeling representation methods are Boundary Representation (B-rep) and Constructive Solid Geometry (CSG). 

From the collection of spherical functions they compute a rotation invariant descriptor by (1) decomposing the function into its spherical harmonics, (2) summing the harmonics within each frequency, and computing the L2-norm for each frequency component. 

Because not all dissimilarity measures are invariant under rotation and translation, it may be necessary to place the 3D models into a canonical coordinate system. 

The shape histograms are defined on concentric shells and sectors around a model’s centroid and compare shapes using a quadratic form distance measure to comparethe histograms taking into account the distances between the shape histogram bins. 

They improve the retrieval performance by an active learning phase in which a human annotator assigns attributes such as airplane, car, body, and so on to a number of sample models. 

In order to measure how similar two objects are, it is necessary to compute distances between pairs of descriptors using a dissimilarity measure. 

In the preprocessing phase a descriptor of each 3D model is obtained by 13 thumbnail images of boundary contours of the 3D object as seen from 13 orthographic view directions. 

Graph based methods can be divided into three broad categories according to the type of graph used: (1) model graphs, (2) Reeb graphs, and (3) skeletons. 

Since a topological signature vector has a fixed size, the size of its shape descriptor is a constant multiplied by the number of nodes of the skeleton. 

Compared to the methods from the first three categories, local feature methods, which compute feature value vectors for a number of surface points, matching is less efficient, efficient indexing is not straightforward, and the obtained dissimilarity measure is not obey the triangle inequality. 

note that their approach generalizes the method to compute voxel-based spherical harmonics shape descriptor, described by Funkhouser et al. [30], which is defined as a binary function on the voxel grid, where the value at each voxel is given by the negatively exponentiated Euclidean Distance Transform of the surface of a 3D model. 

Due to the lack of publicly available benchmarks in the past, it was not possible to compare the shape matching results obtained by different researchers. 

Three approaches can be distinguished to provide a query object: (1) browsing to select a new query object from the obtained results, (2) a direct query by providing a query descriptor, (3) query by example by providing an existing 3D model or by creating a 3D shape query from scratch using a 3D tool or sketching 2D projections of the 3D model. 

Dey et al. [24] present a method to obtain a descriptor of a shape, given by a point sample, by first decomposing the shape into its components.