Journal Article•DOI•

A survey of content based 3D shape retrieval methods

Johan W. H. Tangelder, Remco C. Veltkamp¹•Institutions (1)

01 Sep 2008-Multimedia Tools and Applications (Springer US)-Vol. 39, Iss: 3, pp 441-471

TL;DR: A survey of content-based 3D shape retrieval methods can be found in this article, where the authors evaluate the suitability of several requirements of content based shape retrieval, such as shape representation requirements, properties of dissimilarity measures, efficiency, discrimination abilities, ability to perform partial matching, robustness, and necessity of pose normalization.

read less

Abstract: Recent developments in techniques for modeling, digitizing and visualizing 3D shapes has led to an explosion in the number of available 3D models on the Internet and in domain-specific databases. This has led to the development of 3D shape retrieval systems that, given a query object, retrieve similar 3D objects. For visualization, 3D shapes are often represented as a surface, in particular polygonal meshes, for example in VRML format. Often these models contain holes, intersecting polygons, are not manifold, and do not enclose a volume unambiguously. On the contrary, 3D volume models, such as solid models produced by CAD systems, or voxels models, enclose a volume properly. This paper surveys the literature on methods for content based 3D retrieval, taking into account the applicability to surface models as well as to volume models. The methods are evaluated with respect to several requirements of content based 3D shape retrieval, such as: (1) shape representation requirements, (2) properties of dissimilarity measures, (3) efficiency, (4) discrimination abilities, (5) ability to perform partial matching, (6) robustness, and (7) necessity of pose normalization. Finally, the advantages and limitations of the several approaches in content based 3D shape retrieval are discussed.

...read moreread less

Summary (6 min read)

1. Introduction

The advancement of modeling, digitizing and visualizing techniques for 3D shapes has led to an increasing amount of 3D models, both on the Internet and in domain-specific databases.
Unlike text documents, 3D models are not easily retrieved.
In contrast, content based 3D shape retrieval methods, that use shape properties of the 3D models to search for similar models, work better than text based methods [58].
Matching and indexing are often part of the retrieval process.

2.1. 3D shape retrieval framework

1 consists of a database with an index structure created offline and an online query engine.
Each 3D model has to be identified with a shape descriptor, providing a compact overall description of the shape.
To efficiently search a large collection online, an indexing data structure and searching algorithm should be available.
The similarity between two descriptors is quantified by a dissimilarity measure.
Finally, the retrieved models can be visualized.

2.2. Shape representations

An important issue is the type of shape representation(s) that a shape retrieval system accepts.
Most of the 3D models found on the World Wide Web are meshes defined in a file format supporting visual appearance.
In particular, they are represented by “polygon soups”, consisting of unorganized sets of polygons.
Also, in general these models are not “watertight” meshes, i.e. they do not enclose a volume.
By contrast, for volume models retrieval methods depending on a properly defined volume can be applied.

2.3. Measuring similarity

In order to measure how similar two objects are, it is necessary to compute distances between pairs of descriptors using a dissimilarity measure.
This property is very strong for a high-level shape descriptor, and is often not satisfied.
This is not a severe drawback, if the loss of uniqueness depends on negligible details.
When all the properties (i)-(iv) hold, the dissimilarity measure is called a metric.
If a dissimilarity measure is a pseudo-metric, the triangle inequality can be applied to make retrieval more efficient [7, 83].

2.4. Efficiency

For large shape collections, it is inefficient to sequentially match all objects in the database with the query object.
Because retrieval should be fast, efficient indexing search structures are needed to support efficient retrieval.
Since for query by example the shape descriptor is computed online, it is reasonable to require that the shape descriptor computation is fast enough for interactive querying.

2.5. Discriminative power

A shape descriptor should capture properties that discriminate objects well.
The judgement of the similarity of the shapes of two 3D objects is somewhat subjective, depending on the user preference or the application at hand.
E.g. for solid modeling applications often topology properties such as the numbers of holes in a model are more important than minor differences in shapes.
On the contrary, if a user searches for models looking visually similar the existence of a small hole in the model, may be of no importance to the user.

2.6. Partial matching

In contrast to global shape matching, partial matching finds a shape of which a part is similar to a part of another shape.
Partial matching can be applied if 3D shape models are not complete, e.g. for objects obtained by laser scanning from one or two directions only.
Another application is the search for “3D scenes” containing an instance of the query object.
Also, this feature can potentially give the user flexibility towards the matching problem, if parts of interest of an object can be selected or weighted by the user.

2.7. Robustness

It is often desirable that a shape descriptor is insensitive to noise and small extra features, and robust against arbitrary topological degeneracies, e.g. if it is obtained by laser scanning.
Also, if a model is given in multiple levels-ofdetail, representations of different levels should not differ significantly from the original model.

2.8. Pose normalization

In the absence of prior knowledge, 3D models have arbitrary scale, orientation and position in the 3D space.
Because not all dissimilarity measures are invariant under rotation and translation, it may be necessary to place the 3D models into a canonical coordinate system.
For volume models it is natural to translate the center of mass to the origin.
Eral not possible, because they have not to enclose a volume.
Similar eigenvalues may imply an almost symmetrical mass distribution around an axis (e.g. nearly cylindrical shapes) or around the center of mass (e.g. nearly spherical shapes).

3. Shape matching methods

In this section the authors discuss 3D shape matching methods.
The authors divide shape matching methods in three broad categories: (1) feature based methods, (2) graph based methods and (3) other methods.
Note, that the classes of these methods are not completely disjoined.
A graph-based shape descriptor, in some way, describes also the global feature distribution.
By this point of view the taxonomy should be a graph.

3.1. Feature based methods

In the context of 3D shape matching, features denote geometric and topological properties of 3D shapes.
Feature based methods can be divided into four categories according to the type of shape features used: (1) global features, (2) global feature distributions, (3) spatial maps, and (4) local features.
Feature based Graph based Other Global features Spatial map Local features Global feature distribution Model graph Skeleton Reeb graph View based Volumetric error based Weighted point set based Deformation based Figure 3.
Retrieving the k best matches for a 3D query model is equivalent to solving the k nearest neighbors problem.
For this purpose, for each surface point a descriptor is used instead of a single descriptor.

3.1.1. Global feature based similarity

Global features characterize the global shape of a 3D model.
Zhang and Chen [88] describe methods to compute global features such as volume, area, statistical moments, and Fourier transform coefficients efficiently.
Paquet et al. [67] apply bounding boxes, cords-based, moments-based and wavelets-based descriptors for 3D shape matching.
Kazhdan et al. [42] describe a reflective symmetry descriptor as a 2D function associating a measure of reflective symmetry to every plane (specified by 2 parameters) through the model’s centroid.
They improve the retrieval performance by an active learning phase in which a human annotator assigns attributes such as airplane, car, body, and so on to a number of sample models.

3.1.2. Global feature distribution based similarity

The concept of global feature based similarity has been refined recently by comparing distributions of global features instead of the global features directly.
Osada et al. [66] introduce and compare shape distributions, which measure properties based on distance, angle, area and volume measurements between random surface points.
They evaluate the similarity between the objects using a pseudo-metric that measures distances between distributions.
Since their method requires that a line segment can be classified as lying inside or outside the model it is required that the model defines a volume properly.
Ohbuchi et al. [65] improved this method further by a multi-resolution approach computing a number of alpha-shapes at different scales, and computing for each alpha-shape their Absolute Angle-Distance descriptor.

3.1.3. Spatial map based similarity

Spatial maps are representations that capture the spatial location of an object.
The shape histograms are defined on concentric shells and sectors around a model’s centroid and compare shapes using a quadratic form distance measure to compare the histograms taking into account the distances between the shape histogram bins.
Their method requires pose normalization to provide rotational invariance.
Therefore, he compares his ray-based spherical harmonic method [85] and a variation of it using functions defined on concentric shells with the voxel-based spherical harmonics shape descriptor proposed by Funkhouser et al. [30].
Their experimental results show that the eigenvalue method and the cover sequence method outperform the volume and solid-angle feature method.

3.1.4. Local feature based similarity

Local feature based methods provide various approaches to take into account the surface shape in the neighbourhood of points on the boundary of the shape.
Unfortunately, the method is limited to objects which contain no holes, i.e. have genus zero.
Zaharia and Prêteux [87] describe the 3D Shape Spectrum Descriptor, which is defined as the histogram of shape index values, calculated over an entire mesh.
They apply spin images to recognize models in a cluttered 3D scene.
In the global matching stage, correspondences between similar sample points on the two shapes are found.

3.2. Graph based methods

In general, the feature based methods discussed in the previous section take into account only the pure geometry of the shape.
Graph based methods can be divided into three broad categories according to the type of graph used: (1) model graphs, (2) Reeb graphs, and (3) skeletons.
Efficient computation of existing graph metrics for general graphs is not possible: computing the edit distance is NP-hard [90] and computing the maximal common subgraph [32] is even NP-complete.
Polynomial solutions can be obtained for directed acyclic graphs such as shock graphs.
It is obtained by exhaustively searching for the optimal deformation path between two 2D shapes, and using the cost of this path as a distance between two shapes.

3.2.1. Model graph based similarity

Model graph based similarity methods are applicable to 3D solid models as produced by CAD most systems.
The most dominant solid modeling representation methods are Boundary Representation (B-rep) and Constructive Solid Geometry (CSG).
By contrast to the facets in meshes, the faces of a B-rep may be represented as freeform surfaces.
McWerther et al. [53, 54, 55], and El-Mehalawi and Miller [26, 27] apply model signature graphs, that both model the topology of a shape model by a graph structure, and map a number of engineering features to a highdimensional feature vector.
Therefore, McWerther et al. [53, 54, 55] apply approximate graph comparison using the spectrum of the graph.

3.2.2. Skeleton based similarity

Sundar et al. [76] use as a shape descriptor a skeletal graph that encodes geometric and topological information.
This topological signature vector is defined recursively over the subgraphs of the node using eigenvalues of their adjacency matrices.
Sundar et al. [76] match two shapes by approximate comparison of their hierarchical skeletal graphs using Figure 5.
They obtain a skeletal graph by a thinning algorithm iteratively eroding voxels until a one-voxel width skeleton is left.
Their results show the feasibility of their approach for relatively small volume models.

3.2.3. Reeb graph based similarity

Biasotti et al. [15] compare Reeb graphs obtained by using different quotient functions f and highlight how the choice of f determines the final matching result.
Their method uses Reeb graphs based on a quotient function defined by an integral geodesic distance.
Since for solid models topological insensitivity is important, they conclude that the Reeb graph technique requires some improvements.
Bespalov et al. [13] present preliminary research on a modification of Hilaga’s method, which computes a scale-space decomposition of a shape, represented as a rooted undirected tree instead of a Reeb graph.
In summary, Reeb graphs defined by a geodesic distance are suited for matching articulated objects, but they are sensitive to topological changes.

3.3.1. View based similarity

The main idea of view based similarity methods is that two 3D models are similar, if they look similar from all viewing angles.
Löffler [49] applies view based similarity to retrieve 3D models using a 2D query interface.
The number of views of each object is kept small by clustering views, and by representing each cluster with one view, which is represented by a shock graph.
Therefore, a lightfield descriptor is introduced, which compares ten silhouettes of the 3D shape obtained from ten viewing angles distributed evenly on the viewing sphere.
The running time of the retrieval process is reduced by a clever multi-step approach supporting early rejection of non-relevant models.

3.3.2. Volumetric error based similarity

Novotni and Klein [60] describe a geometry similarity approach to 3D shape matching based on calculating a volumetric error between one object and a sequence of offset hulls of the other object.
A drawback of their method is that their dissimilarity measure is not symmetric and does not obey the triangle inequality.
Sánchez-Cruz and Bribiesca present a method [69] relating the volumetric error between two voxelized shapes to a transportation distance measuring how many voxels have to move and how far to change one shape into another.
Since in general these voxelized shapes will have many voxels, the computation of this transportation distance will be high.

3.3.3. Weighted point set based similarity

Another approach is based on shape descriptors consisting of weighted 3D points.
They match weighted point sets by a measure which does not obey the triangle inequality.
Tangelder and Veltkamp [78] use as shape descriptor a weighted point sets consisting of points with a high curvature value.
They compare weighted point sets using a variant of the Earth Mover’s distance, the proportional transportation distance, which obeys the triangle inequality [33].
They utilize this multiresolution approximation to implement an algorithm to simultaneously align and compare two shapes.

3.3.4. Deformation based similarity

A number of methods [20, 8] compare a pair of 2D shapes by measuring the amount of deformation required to register the shapes exactly.
These methods depend on the natural arc length parameterization of their contours, which is not straightforwardly generalized to 3D.
As a result, methods that apply deformation for shape recovery [79] or shape evolution [23] are very difficult to apply for 3D shape matching.

4. Overview and conclusions

In this section the authors summarize their discussion on shape matching methods from the previous section and indicate directions for further research.
Hence, these methods are less robust than feature based methods.
The Princeton Shape Benchmark database contains 1,834 3D models downloaded from the web, subdivided into a training set and a test set, containing 907 models each, classified into 90 and 92 classes respectively.
The vantage method [83] can be applied to compute an efficient index structure for pseudo-metrics that require much computing time, also known as Efficient indexing.
Since the capabilities of feature based methods (fast computation, pseudo-metric, discriminative abilities, robustness) are orthogonal to the capabilities of graph based methods (partial matching, no normalization required), combining different approaches may produce more powerful shape matching methods.

Did you find this useful? Give us your feedback

Content maybe subject to copyright Report

A Survey of Content Based 3D Shape Retrieval Methods

Johan W.H. Tangelder and Remco C. Veltkamp

Institute of Information and Computing Sciences, Utrecht University

hanst@cs.uu.nl, Remco.Veltkamp@cs.uu.nl

Abstract

Recent developments in techniques for modeling, digitiz-

ing and visualizing 3D shapes has led to an explosion in

the number of available 3D models on the Internet and in

domain-speciﬁc databases. This has led to the development

of 3D shape retrieval systems that, given a query object,

retrieve similar 3D objects. For visualization, 3D shapes

are often represented as a surface, in particular polygo-

nal meshes, for example in VRML format. Often these mod-

els contain holes, intersecting polygons, are not manifold,

and do not enclose a volume unambiguously. On the con-

trary, 3D volume models, such as solid models produced

by CAD systems, or voxels models, enclose a volume prop-

erly. This paper surveys the literature on methods for con-

tent based 3D retrieval, taking into account the applicabil-

ity to surface models as well as to volume models. The meth-

ods are evaluated with respect to several requirements of

content based 3D shape retrieval, such as: (1) shape repre-

sentation requirements, (2) properties of dissimilarity mea-

sures, (3) efﬁciency, (4) discrimination abilities, (5) ability

to perform partial matching, (6) robustness, and (7) neces-

sity of pose normalization. Finally, the advantages and lim-

its of the several approaches in content based 3D shape re-

trieval are discussed.

1. Introduction

The advancementof modeling, digitizing and visualizing

techniques for 3D shapes has led to an increasing amount

of 3D models, both on the Internet and in domain-speciﬁc

databases. This has led to the development of the ﬁrst exper-

imental search engines for 3D shapes, such as the 3D model

search engine at Princeton university [2, 57], the 3D model

retrieval system at the National Taiwan University [1, 17],

the Ogden IV system at the National Institute of Multimedia

Education, Japan [62, 77], the 3D retrieval engine at Utrecht

University [4, 78], and the 3D model similarity search en-

gine at the University of Konstanz [3, 84].

Laser scanning has been applied to obtain archives

recording cultural heritage like the Digital Michelan-

gelo Project [25, 48], and the Stanford Digital Formae

Urbis Romae Project [75]. Furthermore, archives contain-

ing domain-speciﬁc shape models are now accessible by

the Internet. Examples are the National Design Repos-

itory, an online repository of CAD models [59, 68],

and the Protein Data Bank, an online archive of struc-

tural data of biological macromolecules [10, 80].

Unlike text documents, 3D models are not easily re-

trieved. Attempting to ﬁnd a 3D model using textual an-

notation and a conventional text-based search engine would

not work in many cases. The annotations added by human

beings depend on language, culture, age, sex, and other fac-

tors. They may be too limited or ambiguous. In contrast,

content based 3D shape retrieval methods, that use shape

properties of the 3D models to search for similar models,

work better than text based methods [58].

Matching is the process of determining how similar two

shapes are. This is often done by computing a distance. A

complementary process is indexing. In this paper, indexing

is understood as the process of building a datastructure to

speed up the search. Note that the term indexing is also of-

ten used for the identiﬁcation of features in models, or mul-

timedia documents in general. Retrieval is the process of

searching and delivering the query results. Matching and in-

dexing are often part of the retrieval process.

Recently, a lot of researchers have investigated the spe-

ciﬁc problem of content based 3D shape retrieval. Also, an

extensive amount of literature can be found in the related

ﬁelds of computer vision, object recognition and geomet-

ric modelling. Survey papers to this literature have been

provided by Besl and Jain [11], Loncaric [50] and Camp-

bell and Flynn [16]. For an overview of 2D shape match-

ing methods we refer the reader to the paper by Veltkamp

[82]. Unfortunately, most 2D methods do not generalize di-

rectly to 3D model matching. Work in progress by Iyer et

al. [40] provides an extensive overview of 3D shape search-

ing techniques. Atmosukarto and Naval [6] describe a num-

ber of 3D model retrieval systems and methods, but do not

provide a categorization and evaluation.

In contrast, this paper evaluates 3D shape retrieval meth-

ods with respect to several requirements on content based

3D shape retrieval, such as: (1) shape representation re-

quirements, (2) properties of dissimilarity measures, (3) ef-

ﬁciency, (4) discrimination abilities, (5) ability to perform

partial matching, (6) robustness, and (7) necessity of pose

3Dmodel

database

descriptors

index

structure

query

sketch

result

query

descriptor

fetching

descriptor

extraction

descriptor

extraction

matching

index

construction

visualization

query

formulation

browsing

queryby

example

directquery

model

id’s

offline

online

Figure 1. Conceptual framework for shape retrieval.

normalization. In section 2 we discuss several aspects of 3D

shape retrieval. The literature on 3D shape matching meth-

ods is discussed in section 3 and evaluated in section 4.

2. 3D shape retrieval aspects

In this section we discuss several issues related to 3D

shape retrieval.

2.1. 3D shape retrieval framework

At a conceptual level, a typical 3D shape retrieval frame-

work as illustrated by ﬁg. 1 consists of a database with an

index structure created

ofﬂine

and an

online

query engine.

Each 3D model has to be identiﬁed with a shape descrip-

tor, providing a compact overall description of the shape.

To efﬁciently search a large collection online, an indexing

data structure and searching algorithm should be available.

The

online

query engine computes the query descriptor, and

models similar to the query model are retrieved by match-

ing descriptors to the query descriptor from the index struc-

ture of the database. The similarity between two descriptors

is quantiﬁed by a dissimilarity measure. Three approaches

can be distinguished to provide a query object: (1) browsing

to select a new query object from the obtained results, (2)

a direct query by providing a query descriptor, (3) query by

example by providing an existing 3D model or by creating

a 3D shape query from scratch using a 3D tool or sketch-

ing 2D projections of the 3D model. Finally, the retrieved

models can be visualized.

2.2. Shape representations

An important issue is the type of shape representation(s)

that a shape retrievalsystem accepts. Most of the 3D models

found on the World Wide Web are meshes deﬁned in a ﬁle

format supporting visual appearance. Currently, the most

common format used for this purpose is the Virtual Real-

ity Modeling Language (VRML) format. Since these mod-

els have been designed for visualization, they often contain

only geometry and appearance attributes. In particular, they

are represented by “polygon soups”, consisting of unorga-

nized sets of polygons. Also, in general these models are

not “watertight” meshes, i.e. they do not enclose a volume.

By contrast, for volume models retrieval methods depend-

ing on a properly deﬁned volume can be applied.

2.3. Measuring similarity

In order to measure how similar two objects are, it is nec-

essary to compute distances between pairs of descriptors us-

ing a dissimilarity measure. Although the term similarity is

often used, dissimilarity corresponds to the notion of dis-

tance: small distances means small dissimilarity, and large

similarity.

A dissimilarity measure can be formalized by a func-

tion deﬁned on pairs of descriptors indicating the degree

of their resemblance. Formally speaking, a dissimilarity

measure d on a set S is a non-negative valued function

d : S × S → R

∪ {0}. Function d may have some of

the following properties:

i. Identity: For all x ∈ S, d(x, x) = 0.

ii. Positivity: For all x 6= y in S, d(x, y) > 0.

iii. Symmetry: For all x, y ∈ S, d(x, y) = d(y, x).

iv. Triangle inequality:

For all x, y, z ∈ S, d(x, z) ≤ d(x, y) + d(y, z).

v. Transformation invariance: For a chosen transforma-

tion group G, for all x, y ∈ S, g ∈ G, d(g(x), g(y)) =

d(x, y).

The identity property says that a shape is completely

similar to itself, while the positivity property claims that dif-

ferent shapes are never completely similar. This property is

very strong for a high-level shape descriptor, and is often

not satisﬁed. However, this is not a severe drawback, if the

loss of uniqueness depends on negligible details.

Symmetry is not always wanted. Indeed, human percep-

tion does not always ﬁnd that shape x is equally similar to

shape y, as y is to x. In particular, a variant x of prototype

y, is often found more similar to y then vice versa [81].

Dissimilarity measures for partial matching, giving a

small distance d(x, y) if a part of x matches a part of y,

do not obey the triangle inequality.

Transformation invariance has to be satisﬁed, if the com-

parison and the extraction process of shape descriptors have

to be independent of the place, orientation and scale of the

object in its Cartesian coordinate system. If we want that

a dissimilarity measure is not affected by any transforma-

tion on x, then we may use as alternative formulation for

(v): Transformation invariance: For a chosen transforma-

tion group G, for all x, y ∈ S, g ∈ G, d(g(x), y) = d(x, y).

When all the properties (i)-(iv) hold, the dissimilarity

measure is called a

metric

. Other combinations are possi-

ble: a pseudo-metric is a dissimilarity measure that obeys

(i), (iii) and (iv) while a semi-metric obeys only (i), (ii) and

(iii). If a dissimilarity measure is a pseudo-metric, the tri-

angle inequality can be applied to make retrieval more efﬁ-

cient [7, 83].

2.4. Efﬁciency

For large shape collections, it is inefﬁcient to sequen-

tially match all objects in the database with the query object.

Because retrieval should be fast, efﬁcient indexing search

structures are needed to support efﬁcient retrieval. Since for

query by example the shape descriptor is computed online,

it is reasonable to require that the shape descriptor compu-

tation is fast enough for interactive querying.

2.5. Discriminative power

A shape descriptor should capture properties that dis-

criminate objects well. However, the judgement of the sim-

ilarity of the shapes of two 3D objects is somewhat sub-

jective, depending on the user preference or the application

at hand. E.g. for solid modeling applications often topol-

ogy properties such as the numbers of holes in a model are

more important than minor differences in shapes. On the

contrary, if a user searches for models looking visually sim-

ilar the existence of a small hole in the model, may be of no

importance to the user.

2.6. Partial matching

In contrast to global shape matching, partial matching

ﬁnds a shape of which a part is similar to a part of another

shape. Partial matching can be applied if 3D shape mod-

els are not complete, e.g. for objects obtained by laser scan-

ning from one or two directions only. Another application

is the search for “3D scenes” containing an instance of the

query object. Also, this feature can potentially give the user

ﬂexibility towards the matching problem, if parts of inter-

est of an object can be selected or weighted by the user.

2.7. Robustness

It is often desirable that a shape descriptor is insensitive

to noise and small extra features, and robust against arbi-

trary topological degeneracies, e.g. if it is obtained by laser

scanning. Also, if a model is given in multiple levels-of-

detail, representations of different levels should not differ

signiﬁcantly from the original model.

2.8. Pose normalization

In the absence of prior knowledge, 3D models have ar-

bitrary scale, orientation and position in the 3D space. Be-

cause not all dissimilarity measures are invariant under ro-

tation and translation, it may be necessary to place the 3D

models into a canonical coordinate system. This should be

the same for a translated, rotated or scaled copy of the

model.

A natural choice is to ﬁrst translate the center to the ori-

gin. For volume models it is natural to translate the cen-

ter of mass to the origin. But for meshes this is in gen-

Figure 2. Similar mugs oriented by principal axes in dif-

ferent ways [30].

eral not possible, because they have not to enclose a vol-

ume. For meshes it is an alternative to translate the cen-

ter of mass of all the faces to the origin. For example

the Principal Component Analysis (PCA) method computes

for each model the principal axes of inertia e

, e

and e

and their eigenvalues λ

, λ

and λ

, and make the nec-

essary conditions to get right-handed coordinate systems.

These principal axes deﬁne an orthogonal coordinate sys-

tem (e

, e

), with λ

≥ λ

. Next, the polyhe-

dral model is rotated around the origin such that the co-

ordinate system (e

, e

) coincides with the coordinate

system(e

, e

The PCA algorithm for pose estimation is fairly simple

and efﬁcient. However, if the eigenvalues are equal, prin-

cipal axes may switch, without affecting the eigenvalues.

Similar eigenvalues may imply an almost symmetrical mass

distribution around an axis (e.g. nearly cylindrical shapes)

or around the center of mass (e.g. nearly spherical shapes).

Fig. 2 illustrates the problem.

3. Shape matching methods

In this section we discuss 3D shape matching methods.

We divide shape matching methods in three broad cate-

gories: (1) feature based methods, (2) graph based meth-

ods and (3) other methods. Fig. 3 illustrates a more detailed

categorization of shape matching methods. Note, that the

classes of these methods are not completely disjoined. For

instance, a graph-based shape descriptor, in some way, de-

scribes also the global feature distribution. By this point of

view the taxonomy should be a graph.

3.1. Feature based methods

In the context of 3D shape matching, features denote ge-

ometric and topological properties of 3D shapes. So 3D

shapes can be discriminated by measuring and comparing

their features. Feature based methods can be divided into

four categories according to the type of shape features used:

(1) global features, (2) global feature distributions, (3) spa-

tial maps, and (4) local features. Feature based methods

from the ﬁrst three categories represent features of a shape

using a single descriptor consisting of a d-dimensional vec-

tor of values, where the dimension d is ﬁxed for all shapes.

Featurebased

Graphbased

Other

Globalfeatures

Spatialmap

Localfeatures

Globalfeature

distribution

Modelgraph

Skeleton

Reebgraph

Viewbased

Volumetricerrorbased

Weightedpointsetbased

Deformationbased

Figure 3. Taxonomy of shape matching methods.

The value of d can easily be a few hundred. The descriptor

of a shape is a point in a high dimensional space, and two

shapes are considered to be similar if they are close in this

space. Retrieving the k best matches for a 3D query model is

equivalent to solving the k nearest neighbors problem. Us-

ing the Euclidean distance, matching feature descriptors can

be done efﬁciently in practice by searching in multiple 1D

spaces to solve the approximate k nearest neighbor prob-

lem as shown by Indyk and Motwani [36]. In contrast with

the feature based methods from the ﬁrst three categories, lo-

cal feature based methods describe for a number of surface

points the 3D shape around the point. For this purpose, for

each surface point a descriptor is used instead of a single de-

scriptor.

3.1.1. Global feature based similarity

Global features characterize the global shape of a 3D model.

Examples of these features are the statistical moments of the

boundary or the volume of the model, volume-to-surface ra-

tio, or the Fourier transform of the volume or the boundary

of the shape.

Zhang and Chen [88] describe methods to com-

pute global features such as volume, area, statistical mo-

ments, and Fourier transform coefﬁcients efﬁciently.

Paquet et al. [67] apply bounding boxes, cords-based,

moments-based and wavelets-based descriptors for 3D

shape matching.

Corney et al. [21] introduce convex-hull based indices

like hull crumpliness (the ratio of the object surface area

and the surface area of its convex hull), hull packing (the

percentage of the convex hull volume not occupied by the

object), and hull compactness (the ratio of the cubed sur-

face area of the hull and the squared volume of the convex

hull).

Kazhdan et al. [42] describe a reﬂective symmetry de-

scriptor as a 2D function associating a measure of reﬂec-

tive symmetry to every plane (speciﬁed by 2 parameters)

through the model’s centroid. Every function value provides

a measure of global shape, where peaks correspond to the

planes near reﬂective symmetry, and valleys correspond to

the planes of near anti-symmetry. Their experimental results

show that the combination of the reﬂective symmetry de-

scriptor with existing methods provides better results.

Since only global features are used to characterize the

overall shape of the objects, these methods are not very dis-

criminative about object details, but their implementation is

straightforward. Therefore, these methods can be used as an

active ﬁlter, after which more detailed comparisons can be

made, or they can be used in combination with other meth-

ods to improve results.

Global feature methods are able to support user feed-

back as illustrated by the following research. Zhang and

Chen [89] applied features such as volume-surface ratio,

moment invariants and Fourier transform coefﬁcients for

3D shape retrieval. They improve the retrieval performance

by an active learning phase in which a human annotator as-

signs attributes such as airplane, car, body, and so on to a

number of sample models. Elad et al. [28] use a moments-

based classiﬁer and a weighted Euclidean distance measure.

Their method supports iterative and interactive database

searching where the user can improve the weights of the

distance measure by marking relevant search results.

3.1.2. Global feature distribution based similarity

The concept of global feature based similarity has been re-

ﬁned recently by comparing distributions of global features

instead of the global features directly.

Osada et al. [66] introduce and compare shape distribu-

tions, which measure properties based on distance, angle,

area and volume measurements between random surface

points. They evaluate the similarity between the objects us-

ing a pseudo-metric that measures distances between distri-

butions. In their experiments the D2 shape distribution mea-

suring distances between random surface points is most ef-

fective.

Ohbuchi et al. [64] investigate shape histograms that are

discretely parameterized along the principal axes of inertia

of the model. The shape descriptor consists of three shape

histograms: (1) the moment of inertia about the axis, (2)

the average distance from the surface to the axis, and (3)

the variance of the distance from the surface to the axis.

Their experiments show that the axis-parameterized shape

features work only well for shapes having some form of ro-

tational symmetry.

Ip et al. [37] investigate the application of shape distri-

butions in the context of CAD and solid modeling. They re-

ﬁned Osada’s D2 shape distribution function by classifying

2 random points as 1) IN distances if the line segment con-

necting the points lies complete inside the model, 2) OUT

distances if the line segment connecting the points lies com-

plete outside the model, 3) MIXED distances if the line seg-

ment connecting the points lies passes both inside and out-

side the model. Their dissimilarity measure is a weighted

distance measure comparing D2, IN, OUT and MIXED dis-

tributions. Since their method requires that a line segment

can be classiﬁed as lying inside or outside the model it is

required that the model deﬁnes a volume properly. There-

fore it can be applied to volume models, but not to polyg-

onal soups. Recently, Ip et al. [38] extend this approach

with a technique to automatically categorize a large model

database, given a categorization on a number of training ex-

amples from the database.

Ohbuchi et al. [63], investigate another extension of the

D2 shape distribution function, called the Absolute Angle-

Distance histogram, parameterized by a parameter denot-

ing the distance between two random points and by a pa-

rameter denoting the angle between the surfaces on which

two random points are located. The latter parameter is ac-

tually computed as an inner product of the surface normal

vectors. In their evaluation experiment this shape distribu-

tion function outperformed the D2 distribution function at

about 1.5 times higher computational costs. Ohbuchi et al.

[65] improved this method further by a multi-resolution ap-

proach computing a number of alpha-shapes at different

scales, and computing for each alpha-shape their Absolute

Angle-Distance descriptor. Their experimental results show

that this approach outperforms the Angle-Distance descrip-

tor at the cost of high processing time needed to compute

the alpha-shapes.

Shape distributions distinguish models in broad cate-

gories very well: aircraft, boats, people, animals, etc. How-

ever, they perform often poorly when having to discrimi-

nate between shapes that have similar gross shape proper-

ties but vastly different detailed shape properties.

3.1.3. Spatial map based similarity

Spatial maps are representations that capture the spatial lo-

cation of an object. The map entries correspond to physi-

cal locations or sections of the object, and are arranged in a

manner that preserves the relative positions of the features

in an object. Spatial maps are in general not invariant to ro-

tations, except for specially designed maps. Therefore, typ-

ically a pose normalization is done ﬁrst.

Ankerst et al. [5] use shape histograms as a means of an-

alyzing the similarity of 3D molecular surfaces. The his-

tograms are not built from volume elements but from uni-

formly distributed surface points taken from the molecular

surfaces. The shape histograms are deﬁned on concentric

shells and sectors around a model’s centroid and compare

shapes using a quadratic form distance measure to compare

the histograms taking into account the distances between

the shape histogram bins.

Vrani´c et al. [85] describe a surface by associating to

each ray from the origin, the value equal to the distance to

the last point of intersection of the model with the ray and

compute spherical harmonics for this spherical extent func-

tion. Spherical harmonics form a Fourier basis on a sphere

much like the familiar sine and cosine do on a line or a cir-

cle. Their method requires pose normalization to provide

rotational invariance. Also, Yu et al. [86] propose a descrip-

tor similar to a spherical extent function and a descriptor

counting the number of intersections of a ray from the ori-

gin with the model. In both cases the dissimilarity between

two shapes is computed by the Euclidean distance of the

Fourier transforms of the descriptors of the shapes. Their

method requirespose normalization to provide rotational in-

variance.

Kazhdan et al. [43] present a general approach based on

spherical harmonics to transform rotation dependent shape

descriptors into rotation independent ones. Their method is

applicable to a shape descriptor which is deﬁned as either a

collection of spherical functions or as a function on a voxel

grid. In the latter case a collection of spherical functions is

obtained from the function on the voxel grid by restricting

the grid to concentric spheres. From the collection of spher-

ical functions they compute a rotation invariant descriptor

by (1) decomposing the function into its spherical harmon-

ics, (2) summing the harmonics within each frequency, and

computing the L

-norm for each frequency component. The

resulting shape descriptor is a 2D histogram indexed by ra-

dius and frequency, which is invariant to rotations about the

center of the mass. This approach offers an alternative for

pose normalization, because their method obtains rotation

invariant shape descriptors. Their experimental results show

indeed that in general the performance of the obtained ro-

tation independent shape descriptors is better than the cor-

responding normalized descriptors. Their experiments in-

clude the ray-based spherical harmonic descriptor proposed

by Vrani´c et al. [85]. Finally, note that their approach gen-

eralizes the method to compute voxel-based spherical har-

monics shape descriptor, described by Funkhouser et al.

[30], which is deﬁned as a binary function on the voxel grid,

where the value at each voxel is given by the negatively ex-

ponentiated Euclidean Distance Transform of the surface of

a 3D model.

Novotni and Klein [61] present a method to compute

3D Zernike descriptors from voxelized models as natural

extensions of spherical harmonics based descriptors. 3D

Zernike descriptors capture object coherence in the radial

direction as well as in the direction along a sphere. Both

3D Zernike descriptors and spherical harmonics based de-

scriptors achieve rotation invariance. However, by sampling

the space only in radial direction the latter descriptors do

HTML Viewer

Frequently Asked Questions (16)

Q1. What are the contributions mentioned in the paper "A survey of content based 3d shape retrieval methods" ?

This paper surveys the literature on methods for content based 3D retrieval, taking into account the applicability to surface models as well as to volume models. Finally, the advantages and limits of the several approaches in content based 3D shape retrieval are discussed.

Q2. What are the future works in "A survey of content based 3d shape retrieval methods" ?

In this section the authors summarize their discussion on shape matching methods from the previous section and indicate directions for further research. Further research is needed to compare both approaches using the same benchmarks and the best PCA method. Because a feature vector is a point in a fixed d-dimensional space, two models can be compared fast by computing their distance in this space. Also, indexing is straightforward and retrieval can be implemented efficiently by nearest neighbour search.

Q3. What are the dominant solid modeling representation methods?

The most dominant solid modeling representation methods are Boundary Representation (B-rep) and Constructive Solid Geometry (CSG).

Q4. How do they compute a rotation invariant descriptor?

From the collection of spherical functions they compute a rotation invariant descriptor by (1) decomposing the function into its spherical harmonics, (2) summing the harmonics within each frequency, and computing the L2-norm for each frequency component.

Q5. Why is it necessary to place 3D models into a canonical coordinate system?

Because not all dissimilarity measures are invariant under rotation and translation, it may be necessary to place the 3D models into a canonical coordinate system.

Q6. What is the method of comparing shape histograms?

The shape histograms are defined on concentric shells and sectors around a model’s centroid and compare shapes using a quadratic form distance measure to comparethe histograms taking into account the distances between the shape histogram bins.

Q7. How do they improve the retrieval performance of 3D shapes?

They improve the retrieval performance by an active learning phase in which a human annotator assigns attributes such as airplane, car, body, and so on to a number of sample models.

Q8. How can the authors measure distances between two objects?

In order to measure how similar two objects are, it is necessary to compute distances between pairs of descriptors using a dissimilarity measure.

Q9. How many thumbnail images are used to obtain the descriptor of each 3D object?

In the preprocessing phase a descriptor of each 3D model is obtained by 13 thumbnail images of boundary contours of the 3D object as seen from 13 orthographic view directions.

Q10. What are the three broad categories of graph based methods?

Graph based methods can be divided into three broad categories according to the type of graph used: (1) model graphs, (2) Reeb graphs, and (3) skeletons.

Q11. What is the size of the topological signature vector?

Since a topological signature vector has a fixed size, the size of its shape descriptor is a constant multiplied by the number of nodes of the skeleton.

Q12. What are the disadvantages of local feature methods?

Compared to the methods from the first three categories, local feature methods, which compute feature value vectors for a number of surface points, matching is less efficient, efficient indexing is not straightforward, and the obtained dissimilarity measure is not obey the triangle inequality.

Q13. What is the generalization of the method used to compute a spherical harmonic?

note that their approach generalizes the method to compute voxel-based spherical harmonics shape descriptor, described by Funkhouser et al. [30], which is defined as a binary function on the voxel grid, where the value at each voxel is given by the negatively exponentiated Euclidean Distance Transform of the surface of a 3D model.

Q14. Why was it not possible to compare the results of different researchers?

Due to the lack of publicly available benchmarks in the past, it was not possible to compare the shape matching results obtained by different researchers.

Q15. What are the three approaches to provide a query object?

Three approaches can be distinguished to provide a query object: (1) browsing to select a new query object from the obtained results, (2) a direct query by providing a query descriptor, (3) query by example by providing an existing 3D model or by creating a 3D shape query from scratch using a 3D tool or sketching 2D projections of the 3D model.

Q16. How do they obtain a shape descriptor?

Dey et al. [24] present a method to obtain a descriptor of a shape, given by a point sample, by first decomposing the shape into its components.

A survey of content based 3D shape retrieval methods

Summary (6 min read)

1. Introduction

2.1. 3D shape retrieval framework

2.2. Shape representations

2.3. Measuring similarity

2.4. Efficiency

2.5. Discriminative power

2.6. Partial matching

2.7. Robustness

2.8. Pose normalization

3. Shape matching methods

3.1. Feature based methods

3.1.1. Global feature based similarity

3.1.2. Global feature distribution based similarity

3.1.3. Spatial map based similarity

3.1.4. Local feature based similarity

3.2. Graph based methods

3.2.1. Model graph based similarity

3.2.2. Skeleton based similarity

3.2.3. Reeb graph based similarity

3.3.1. View based similarity

3.3.2. Volumetric error based similarity

3.3.3. Weighted point set based similarity

3.3.4. Deformation based similarity

4. Overview and conclusions

Citations

References

Related Papers (5)

Frequently Asked Questions (16)

Q1. What are the contributions mentioned in the paper "A survey of content based 3d shape retrieval methods" ?

Q2. What are the future works in "A survey of content based 3d shape retrieval methods" ?

Q3. What are the dominant solid modeling representation methods?

Q4. How do they compute a rotation invariant descriptor?

Q5. Why is it necessary to place 3D models into a canonical coordinate system?

Q6. What is the method of comparing shape histograms?

Q7. How do they improve the retrieval performance of 3D shapes?

Q8. How can the authors measure distances between two objects?

Q9. How many thumbnail images are used to obtain the descriptor of each 3D object?

Q10. What are the three broad categories of graph based methods?

Q11. What is the size of the topological signature vector?

Q12. What are the disadvantages of local feature methods?

Q13. What is the generalization of the method used to compute a spherical harmonic?

Q14. Why was it not possible to compare the results of different researchers?

Q15. What are the three approaches to provide a query object?

Q16. How do they obtain a shape descriptor?