Journal Article•DOI•

Isolated 3D object recognition through next view planning

Sumantra Dutta Roy¹, Santanu Chaudhury, Subhashis Banerjee•Institutions (1)

01 Jan 2000-Vol. 30, Iss: 1, pp 67-76

TL;DR: A new online recognition scheme based on next view planning for the identification of an isolated 3D object using simple features using a probabilistic reasoning framework for recognition and planning is presented.

read less

Abstract: In many cases, a single view of an object may not contain sufficient features to recognize it unambiguously. This paper presents a new online recognition scheme based on next view planning for the identification of an isolated 3D object using simple features. The scheme uses a probabilistic reasoning framework for recognition and planning. Our knowledge representation scheme encodes feature based information about objects as well as the uncertainty in the recognition process. This is used both in the probability calculations as well as in planning the next view. Results clearly demonstrate the effectiveness of our strategy for a reasonably complex experimental set.

...read moreread less

Summary (2 min read)

Jump to: [Introduction] – [A. Relation with Other Work] – [A. Class Identification, Accounting for Uncertainty] – [B. Object Identification] – [A. The Planning Process and Object Recognition] – [B. Bounds on the Number of Observations] – [A. Experiments with Model Base II] and [A. CPM in Crisp Case]

Introduction

A hierarchical knowledge representation scheme facilitates recognition and the planning process.
A single view may not contain sufficient features to recognize the object unambiguously.
A simple feature set is applicable for a larger class of objects than a model base specific complex feature set.
The purpose of this paper is to investigate the use of suitably planned multiple views and two-dimensional (2-D) invariants for 3-D object recognition.

A. Relation with Other Work

Tarabanis et al. [5] survey the field of sensor planning for vision tasks.
S. Dutta Roy and S. Banerjee are with the Department of Computer Science and Engineering, Indian Institute of Technology, New Delhi-110 016, India (e-mail: sumantra@ee.iitd.ernet.in; suban@cse.iitd.ernet.in).
The next view planning strategy acts on the basis of these hypotheses.
The authors use a hierarchical knowledge representation scheme which not only ensures a low-order polynomial-time complexity of the hypothesis generation process, but also plays an important role in planning the next view.
There are six aspects of the object shown, belonging to three classes.

A. Class Identification, Accounting for Uncertainty

2) Class Probability Calculations Using the Knowledge Representation Scheme: (2) P is 1 for those classes which have a link from feature-class fjk.
The computation of(2) takesO(NC) time—this is done for each feature-class.
Due to errors possible in the feature detection process, a degree of uncertainty is associated with the evidence.
The summation reduces to one term,P pjrk.

B. Object Identification

Based on the outcome of the class recognition scheme, the authors estimate the object probabilities as follows.
A particular movement may preclude the occurrence of some aspects for a given class observed.
Let cij and a ij represent the minimum angles necessary to move out of the current assumed aspect in the clockwise and counterclockwise directions, respectively, also known as Auxiliary Move.
The authors construct search tree nodes corresponding to both moves.
From these, the authors finally select one with the minimum total movement.

A. The Planning Process and Object Recognition

In their object identification algorithm, aspect and object probabilities are initialized to theira priori values.
Else, the algorithm initiates the search process to get the best distinguishing move to resolve the ambiguity associated with this view.
All the above steps starting at (a) (b) (c) (d) (e) (f) planning scheme is global—its reactive nature incorporates all previous movements and observations both in the probability calculations (Section III-B) as well as in the planning process.
The authors robust class recognition algorithm can recover from many feature detection errors at the class recognition phase itself (Section III-A-2).
Let denote the angular extent of the smallest aspect observed so far.

B. Bounds on the Number of Observations

It is instructive to consider bounds onTavg(n), the number of observations required to disambiguate between a set ofn aspects (corresponding to the initially observed class).
An interesting case is observed in Fig. 10(c) and (f)—an opportunistic case when the number of steps with primary moves is less than the one with both primary and auxiliary moves.
3) Ordering of Feature Detectors:The third image in Fig. 9(a) shows advantage of their scheduling of feature detectors.
7) Average Number of Observations for a Given Number of Competing Aspects:.

A. Experiments with Model Base II

The authors use the number of horizontal and vertical lines (hhvi), and the number of circles(hci) as features.
The recognition scheme has the ability to correctly identify objects even when they have a large number of similar views.
The primary result obtained in this paper is the use of signed distance ranking of fuzzy numbers obtaining Properties 3 and 4.
The purpose of the critical path method (CPM) is to identify critical activities on the critical path so that resources may be concentrated on these activities in order to reduce project length time.

A. CPM in Crisp Case

Thus, the activity(v1; v2) requires three days, whereas(v1; v3) requires four days.
Let tv v be the processing time for each activity(vi; vj).
The authors define the earliest event time for eventvi and the latest event time for eventvj astEv andtLv , respectively.
Assume that the values oftv v , tEv , andtLv are already known.

Did you find this useful? Give us your feedback

Figures (19)

Fig. 10. Some experiments with Model Base I: initial classh221i. Primary moves alone (a)O : h221i ! h221i ! h423i. (b)O : h221i ! h221i ! h221i ! h221i ! h221i. (c)O : h221i ! h232i ! h232i. Primary and auxiliary moves (d)O : h221i ! h221i ! h423i. (e) O : h221i ! h221i ! h322i. (f) O : h221i ! h232i ! h232i ! h221i ! h232i. The numbers above the arrows denote the number of turntable steps. A negative sign indicates a clockwise movement.

TABLE III PROCESSES OF INDING A FUZZY CRITICAL PATH IN N = (V;A; T )

Fig. 1. Aspects and classes of an object.

Fig. 11. Variation of object probabilities: two examples (see text).

TABLE I THE AVERAGE NUMBER OF MOVES FOR A GIVEN NUMBER OF COMPETING ASPECTS

Fig. 12. Model Base II: The objects (in row major order) are heli1, heli 2, plane1, plane2, plane3, plane4, and biplane.

Fig. 6. Partially constructed search tree.

Fig. 13. Experiments with the initial class ash332i. (a) biplane:h332i ! h420i. (b) plane_1:h342(332)i ! h410i. (c) plane_1:h332i ! h410i. (d) heli_1:h332i ! h540i. (e) heli_2:h332i ! h510i ! h510i. (The figure in parentheses shows an example of recovery from feature detection errors.) In each of these cases, the results for planning with primary moves alone, and those for both primary and auxiliary moves are identical.

Fig. 3. Flow diagram depicting the flow of information and control in our system.

Fig. 14. Experiments with the initial class ash411i. Primary moves alone (a) plane_2:h411i ! h114i. (b) plane_2:h411i ! h114i. (c) plane_1:

Fig. 5. (a) The notation used (Section IV) and (b) a case when our algorithm is not guaranteed to succeed (Section IV-A).

Fig. 15. Experiments with the initial class ash410i. Primary moves alone (a) plane_1:h410i ! h411i. (b) plane_1:h410i ! h411i. (c) plane_4:

Fig. 9, the initial class observed in each case ish232i, while it is h221i in Fig. 10. We make the following observations.

Fig. 8. Model Base I: The objects (from left) areO ,O ,O ,O ,O , O , O , andO , respectively.

Content maybe subject to copyright Report

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART A: SYSTEMS AND HUMANS, VOL. 30, NO. 1, JANUARY 2000 67

Correspondence________________________________________________________________________

Isolated 3-D Object Recognition through Next View

Planning

Sumantra Dutta Roy, Santanu Chaudhury, and Subhashis Banerjee

Abstract—In many cases, a single view of an object may not contain suf-

ficient features to recognize it unambiguously. This paper presents a new

on-line recognition scheme based on next view planning for the identifica-

tion ofanisolatedthree-dimensional(3-D) objectusingsimple features. The

scheme uses a probabilistic reasoning framework for recognition and plan-

ning. Our knowledge representation scheme encodes feature based infor-

mation about objects as well as the uncertainty in the recognition process.

This is used both in the probability calculations as well as in planning the

next view. Results clearly demonstrate the effectiveness of our strategy for

a reasonably complex experimental set.

Index Terms—Active vision, reactive planning, 3-D object recognition.

I. INTRODUCTION

In this paper, we present a new on-line scheme for the recognition

of an isolated three-dimensional (3-D) object using reactive next view

planning. A hierarchical knowledge representation scheme facilitates

recognition and the planning process. The planning process utilizes

the current observation and past history for identifying a sequence of

moves to disambiguate between similar objects.

Most model-based object recognition systems consider the problem

of recognizing objects from the image of a single view [1]–[4]. How-

ever, a single view may not contain sufficient features to recognize

the object unambiguously. In fact, two objects may have all views in

common with respect to a given feature set, and may be distinguished

only through a sequence of views. Further, in recognizing 3-D objects

from a single view, recognition systems often use complex feature sets

[2]. In many cases, it may be possible to achievethe same, incurring less

error and smaller processing cost using a simpler feature set and suit-

ably planning multiple observations. A simple feature set is applicable

for a larger class of objects than a model base specific complex feature

set. Model base-specific complex features such as 3-D invariants have

been proposed only for special cases so far (e.g., [3]). The purpose of

this paper is to investigate the use of suitably planned multiple views

and two-dimensional (2-D) invariants for 3-D object recognition.

A. Relation with Other Work

With an active sensor, object recognition involves identification of a

view of an object and if necessary, planning further views. Tarabanis

et al. [5] survey the field of sensor planning for vision tasks. We can

compare various active 3-D object recognition systems on the basis of

the following four issues.

1) Nature of the Next View Planning Strategy: The system should

plan moves with maximum ability to discriminate between views

Manuscript received October 23, 1997; revised May 5, 1998.

S. Dutta Roy and S. Banerjee are with the Department of Computer Sci-

ence and Engineering, Indian Institute of Technology, NewDelhi-110 016, India

(e-mail: sumantra@ee.iitd.ernet.in; suban@cse.iitd.ernet.in).

S. Chaudhury is with the Department of Electrical Engineering, Indian Insti-

tute of Technology, New Delhi 110 016, India.

Publisher Item Identifier S 1083-4427(00)01177-2.

common to more than one object in the model base. The cost in-

curred in this processshould also be minimal. The system should,

preferably be on-line and reactive—the past and present inputs

should guide the planning mechanism at each stage.

While the scheme of Maver and Bajcsy [6] is on-line, that of

Gremban and Ikeuchi [7] is not. Due to the combinatorial nature

of the problem, an off-line approach may not always be feasible.

2) Uncertainty Handling Capability of the Hypothesis Generation

Mechanism: The occlusion-based next view planning approach

of Maver and Bajcsy [6], as well as that of Gremban and Ikeuchi

[7] are essentially deterministic. A probabilistic strategy can

make the system more robust and resistant to errors compared to

a deterministic one. Dickinson et al. [8] use Bayesian methods

to handle uncertainty, while Hutchinson and Kak [9] use the

Dempster–Shafer theory.

3) Efficient Representation of Domain Knowledge: Theknowledge

representation scheme should support an efficient mechanism

to generate hypotheses on the basis of the evidence received. It

should also play a role in optimally planning the next view.

Dickinson et al. [8] use a hierarchical representation scheme

based on volumetric primitives, which are associated with a high

feature extraction cost. Due to the non-hierarchical nature of

Hutchinson and Kak’s system [9], many redundant hypotheses

are proposed, which have to be later removed through consis-

tency checks.

4) Speed and Efficiency of Algorithms for Both Hypothesis Gen-

eration and Next View Planning: It is desirable to have algo-

rithms with low order polynomial-time complexity to generate

hypotheses accurately and fast. The next view planning strategy

acts on the basis of these hypotheses.

In Hutchinson and Kak’s system [9], although the poly-

nomial-time formulation overcomes the exponential time

complexity associated with assigning beliefs to all possible

hypotheses, their system still has the overhead of intersection

computation in creating common frames of discernment. Con-

sistency checks have to be used to remove the many redundant

hypotheses produced earlier. Though Dickinson et al. [8] use

Bayes nets for hypothesis generation, their system incurs the

overhead of tracking the region of interest through successive

frames.

The next view planning strategy that this paper presents is reactive

and on-line—the evidence obtained from each view is used in the hy-

pothesis generation and the planning process. Our probabilistic hypoth-

esis generation mechanism can handle cases of feature detection errors.

We use a hierarchical knowledge representation scheme which not only

ensures a low-order polynomial-time complexity of the hypothesis gen-

eration process, but also plays an important role in planning the next

view. The hierarchy itself enforces different constraints to prune the

set of possible hypotheses. The scheme is independent of the type of

features used, unlike that of [8]. We present results of over 100 exper-

iments with our recognition scheme on two sets of models. Extensive

experimentation shows the effectiveness of our proposed strategy of

using simple features and multiple views for recognizing complex 3-D

shapes.

The organization of the rest of the paper is as follows: Section II

presents our knowledge representation scheme. We discuss hypothesis

generation for class and object recognition in Section III. Section IV

68 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART A: SYSTEMS AND HUMANS, VOL. 30, NO. 1, JANUARY 2000

describes our algorithm for planning the next view. In Section V we

demonstrate the working of our system on two sets of objects. We sum-

marize the salient features of our scheme and discuss areas for further

work in Section VI.

II. T

HE KNOWLEDGE REPRESENTATION SCHEME

A view of a 3-D object is characterized by a set of features. With re-

spect to a particular feature set and over a particular range of viewing

angles, a view of a 3-D object is independent of the viewpoint. Koen-

derink and van Doorn [10] define aspects as topologically equivalent

classes of object appearances. Ikeuchi et al. generalize this definition:

object appearances may be grouped into equivalence classes with re-

spect to a feature set. These equivalence classes are aspects [11]. In this

context, we define the following terms:

Class A: Class (or, aspect-class) is a set of aspects, equiva-

lent with respect to a feature set.

Feature-Class: A feature-class is a set of equivalent aspects de-

fined for one particular feature.

Fig. 1 shows a simple example of an object with its associated aspects

and classes. The locus of view-directions is one-dimensional (1-D) and

we assume orthographic projection. The basis of the different classes is

the number of horizontal lines

(

)

and vertical lines

(

)

in a particular

view of the object. Thus, a class may be represented as

. There

are six aspects of the object shown, belonging to three classes. In this

example, for simplicity we assume only one feature detector so that

each feature-class is also a class.

We propose a new knowledge representation scheme encoding do-

main knowledge about the object, relations between different aspects,

and the correspondence of these aspects with feature detectors. Fig. 2

illustrates an example of this scheme. We use this knowledge represen-

tation scheme both in belief updating as well as in next view planning.

Sections III and

IV discuss these topics, respectively. The representa-

tion scheme consists of two parts.

1) The Feature-Dependence Subnet: In the feature-dependence

subnet

• F represents the complete set of features

used for

characterizing views.

• A feature node

is associated with feature-classes

Factors such as noise and nonadaptive thresholds can introduce

errors in the feature detection process. Let

jlk

represent the

probability that the feature-class present is

, given that the de-

tector for feature

detects it to be

. We define

jlk

as the

ratio of the number of times the detector for feature

interprets

feature-class

, and the number of times the feature de-

tector reports the feature-class as

. The

node stores a table

of these values for its corresponding feature detector.

• A class node

stores its a priori probability,

(

)

link between class

and feature-class

indicates that

forms a subset of features observed in

. This ac-

counts for a PART-OF relation between the two. Thus, a

class represents an

-vector

[

111

]

. Since a

class cannot be independent of any feature, each class has

input edges corresponding to the

features.

2 The Class-Aspect Subnet: The class-aspect subnet encodes the

relationships between classes, aspects, and objects.

• O represents the set of all objects

• An object node

stores its probability,

(

)

• An aspect node

stores its angular extent



(in de-

grees), its probability

(

)

, its parent class

, and its

neighboring aspects.

• Aspect

has a PART-OF relationship with its parent ob-

ject

. Thus,

-tuple

;

represents an aspect.

Fig. 1. Aspects and classes of an object.

Fig. 2. Example of the knowledge representation scheme.

Aspect node

has exactly one link to any object

(

)

and exactly one link to its parent class

III. H

YPOTHESIS GENERATION

The recognition system takes any arbitraryviewof an object as input.

Using a set of features (the feature-classes), it generates hypotheses

about the likely identity of the class. This is, in turn used for gener-

ating hypotheses about the object’s identity. The interaction of the hy-

pothesis generation part with the rest of the system is shown in Fig. 3.

Hypothesis generation consists of two steps namely, class identifica-

tion, and object identification.

A. Class Identification, Accounting for Uncertainty

Our algorithm suitably schedules feature detectors to perform prob-

abilistic class identification. In what follows, we discuss its various as-

pects. Fig. 4 presents the overall algorithm.

1) Ordering of Feature Detectors: A proper ordering of feature de-

tectors speeds up the class recognition process. At any stage, we choose

the hitherto unused feature detector for which the feature-class corre-

sponding to the most probable class has the least number of outgoing

arcs, i.e., the least out-degree. This is done in order to obtain that fea-

ture-class which has the largest discriminatory power in terms of the

number of classes it could correspond to. For example, in Fig. 2 if all

feature detectors are unused and

has the highest a priori probability,

will be tried first, followed by

and

, if required.

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART A: SYSTEMS AND HUMANS, VOL. 30, NO. 1, JANUARY 2000 69

Fig. 3. Flow diagram depicting the flow of information and control in our system.

Fig. 4. Class recognition algorithm.

2) Class Probability Calculations Using the Knowledge Represen-

tation Scheme: We obtain the a priori probability of class

(

)

(

)

(1)

Here, aspects

belong to class

. Let

, and

denote

the number of feature-classes associated with feature detector

, the

number of classes, and the number of aspects, respectively.

(

)



360

.We can compute

(

)

from our knowledgerepresentation

scheme by considering each aspect node belonging to an object and

testing if it has a link to node

; this takes

(

)

time. (The

term is for the initialization of class probabilities to 0.)

Let the detector for feature

report the feature-class obtained to be

. Given this evidence, we obtain the probability of class

from the

Bayes rule

(

)

(

)

[

(

)

(

)]

(2)

(

)

is 1 for those classes which have a link from feature-class

. It is 0 for the rest. The computation of (2) takes

(

)

time—this

is done for each feature-class. Hence, the computation of

(

)

for

all feature-classes

for feature detector

takes time

(

)

For an error-free situation,

(

)

(

)

, the a posteriori

probability of class

. However, due to errors possible in the feature

detection process, a degree of uncertainty is associated with the evi-

dence. The value of

(

)

is, then

(

)

jlk

(3)

where

’s are feature-classes associated with feature

. According

to our knowledge representation scheme, only one feature-class under

feature

, say

has a link to class

. The summation reduces to one

term,

(

)

jrk

. Thus, our knowledge representation scheme

also enable recovery from feature detection errors.

B. Object Identification

Based on the outcome of the class recognition scheme, we estimate

the object probabilities as follows. Initially, we calculate the a priori

probability of each aspect as

(

j k

(

)

(

j k

)

(4)

70 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART A: SYSTEMS AND HUMANS, VOL. 30, NO. 1, JANUARY 2000

(a) (b)

Fig. 5. (a) The notation used (

Section IV) and (b) a case when our algorithm

is not guaranteed to succeed (Section IV-A).

If there are

objects in the model base, weinitialize

(

)

before the first observation. For the first observation,

(

j k

)



j k

360

. A priori aspect probability calculations take

(

)

time.

For any subsequent observation, we have to account for the move-

ment in the probability calculations. For example, a particular move-

ment may preclude the occurrence of some aspects for a given class

observed. The value of

(

j k

)

is given by

(

j k



j k

360

(5)

where



j k

(



j k

;

j k

])

represents the angular range pos-

sible within aspect

j k

for the move(s) taken to reach this posi-

tion. Due to the movement made, we could have observed only



)

aspects out of a total of

aspects belonging to class

Experiments with Model Base I

Let the class recognition phase report the observed class to be

Let us assume that

could have come from aspects

j k

111

j k

, where all

;

111

are not necessarily different.

We obtain the a posteriori probability of aspect

j k

given this evi-

dence using the Bayes rule

(

j k

(

j k

)

(

j k

)

[

(

j k

)

(

j k

)]

(6)

(

j k

)

for aspects with a link to class

,0 otherwise. Finally,

we obtain the a posteriori probability

(

j k

)

(7)

where aspects

j k

belong to class

If the probability of some object is above a predetermined threshold

(experimentally determined, e.g., 0.87 for Model Base I), the algorithm

reports a success, and stops. If not, it means that the view of the object

is not sufficient to identify the object unambiguously. We have to take

the next view.

In our hierarchical scheme, the link conditional probabilities (rep-

resenting relations between nodes) themselves enforce consistency

checks at each level of evidence. The feature evidence is progressively

refined as it passes through different levels in the hierarchy, leading to

simpler evidence propagation and less computational cost. This is an

advantage of our scheme over that proposed in [9].

IV. N

EXT VIEW PLANNING

The class observed in the class recognition phase could have come

from many aspects in the model base, each with its own range of po-

sitions within the aspect. Due to this ambiguity, one has to search for

Fig. 6. Partially constructed search tree.

Fig. 7. Object recognition algorithm.

the best move to discern between these competing aspects subject to

memory and processing limitations, if any. The parameters described

above characterize the state of the system. The planning process aims

to determine a move from the current step, which would uniquely iden-

tify the given object. We pose the planning problem as that of a forward

search in the state space which takes us to a state in which the aspect

list corresponding to the class observed has exactly one node. We use a

search tree for this purpose. A search tree node represents the following

information: [Fig. 5(a)] the unique class observedfor the angular move-

ment made so far, the aspects possible for this angle-class pair, and for

each aspect, the range of positions possible within it

(



)



and



denote the two positions within aspect

where the current

viewpoint can be, as a result of the movement made thus far. Here,







; and



;

]

, where



is the angular extent of

aspect

. A leaf node is one which has either one aspect associated

with it or corresponds to a total angular movement of 360



or more

from the root node.

Fig. 6 shows an example of a partially constructed search tree. From

a view point, we categorize possible moves as follows.

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART A: SYSTEMS AND HUMANS, VOL. 30, NO. 1, JANUARY 2000 71

Fig. 8. Model Base I: The objects (from left) are

, and

, respectively.

(a)

(b)

(c)

(d)

Fig. 9. Some experiments with Model Base I: initial class

232

. The objects are

[(a) and (c)], and

[(b) and (d)], respectively. (a)

232

i !h

231(221)

i !h

232

i !h

221

i !h

232

. (b)

232

i !h

221

i !h

221

i !h

221

. (c)

232

i !h

232

i !h

221

. (d)

232

i !h

221

i !h

221

i !h

221

. The numbers above the arrows denote the number of turntable steps. A negative sign indicates a clockwise movement.

(The figure in parentheses shows an example of recovery from feature detection errors.)

Primary Move: A primary move represents a move from an aspect



, the minimum angle needed to move out of it.

Auxiliary Move: An auxiliary move represents a move from an as-

pect by an angle corresponding to the primary move of another com-

peting aspect.

Let



and



represent the minimum angles necessary to move out

of the current assumed aspect in the clockwise and counterclockwise

directions, respectively. Three cases are possible.

1) Type I Move:



and



both take us out of the current aspect

to a single aspect in each of the two directions—

and

respectively. We construct search tree nodes corresponding to

both moves.

2) Type II Move: Exactly one out of



and



takes us to a single

aspect

. For the other direction, the aspect we would reach

depends upon the initial position

(



;

])

in the current as-

pect. We construct a search tree nodecorresponding to the former

move.

3) Type III Move: Whether we move in the clockwise or the coun-

terclockwise direction, the aspect reached depends on the initial

position in the current aspect. We choose the move which leads

HTML Viewer

Frequently Asked Questions (14)

Q1. What have the authors contributed in "Isolated 3-d object recognition through next view planning" ?

This paper presents a new on-line recognition scheme based on next view planning for the identification of an isolated three-dimensional ( 3-D ) object using simple features.

Q2. What is the next view planning strategy?

The next view planning strategy that this paper presents is reactive and on-line—the evidence obtained from each view is used in the hypothesis generation and the planning process.

Q3. How do the authors get the number of regions in an image?

For getting the number of regions in the image, the authors perform sequential labeling (connected components: pixel labeling) [12] on a thresholded gradient image.

Q4. What other features may be used to recognize objects?

While the authors use simple features for the purpose of illustration, one may use other features such as texture, color, specularities, and reflectance ratios.

Q5. What is the role of the sensor in the planning of a view?

With an active sensor, object recognition involves identification of a view of an object and if necessary, planning further views.

Q6. How many experiments have been done to demonstrate the effectiveness of using simple features and multiple views?

Over 100 experiments demonstrate the effectiveness of using simple features and multiple views even on a relatively complex class of objects with a high degree of ambiguity associated with a view of the object.

Q7. Why do they use a hierarchical representation scheme?

Due to the non-hierarchical nature of Hutchinson and Kak’s system [9], many redundant hypotheses are proposed, which have to be later removed through consistency checks.

Q8. What is the probability of a move until observation 3?

The sequence of moves until observation 3 could correspond to O4, O5, O6, and O7 with probabilities 0.877, 0.102, 0.014, and 0.007, respectively.

Q9. What is the role of the knowledge representation scheme in generating hypotheses?

The knowledge representation scheme should support an efficient mechanism to generate hypotheses on the basis of the evidence received.

Q10. How can the authors compute class Ci from their knowledge representation scheme?

The authors can computeP (Ci) from their knowledge representation scheme by considering each aspect node belonging to an object and testing if it has a link to node Ci; this takes O(NC + Na) time.

Q11. What is the probable aspect of the view?

If the view indeed corresponds to the most probable aspect at a particular stage, then their search process using primary and auxiliary moves is guaranteed to perform aspect resolution and uniquely identify the object in the following step, assuming no feature detection errors.

Q12. What is the overhead of tracking the region of interest?

Though Dickinson et al. [8] use Bayes nets for hypothesis generation, their system incurs the overhead of tracking the region of interest through successive frames.

Q13. What is the correct number for the first image?

In the first image in Fig. 13(b), due to the shadow of the wing on the fuselage of the aircraft, the feature detector detectsfour vertical lines instead of three, the correct number.

Q14. What is the way to recover from feature detection errors?

Their robust class recognition algorithm can recover from many feature detection errors at the class recognition phase itself (Section III-A-2).

Isolated 3D object recognition through next view planning

Summary (2 min read)

Introduction

A. Relation with Other Work

A. Class Identification, Accounting for Uncertainty

B. Object Identification

A. The Planning Process and Object Recognition

B. Bounds on the Number of Observations

A. Experiments with Model Base II

A. CPM in Crisp Case

Figures (19)

Citations

Cites background from "Isolated 3D object recognition thro..."

Cites background or methods from "Isolated 3D object recognition thro..."

Cites background from "Isolated 3D object recognition thro..."

References

"Isolated 3D object recognition thro..." refers background in this paper

"Isolated 3D object recognition thro..." refers methods in this paper

"Isolated 3D object recognition thro..." refers background in this paper

Related Papers (5)

Frequently Asked Questions (14)

Q1. What have the authors contributed in "Isolated 3-d object recognition through next view planning" ?

Q2. What is the next view planning strategy?

Q3. How do the authors get the number of regions in an image?

Q4. What other features may be used to recognize objects?

Q5. What is the role of the sensor in the planning of a view?

Q6. How many experiments have been done to demonstrate the effectiveness of using simple features and multiple views?

Q7. Why do they use a hierarchical representation scheme?

Q8. What is the probability of a move until observation 3?

Q9. What is the role of the knowledge representation scheme in generating hypotheses?

Q10. How can the authors compute class Ci from their knowledge representation scheme?

Q11. What is the probable aspect of the view?

Q12. What is the overhead of tracking the region of interest?

Q13. What is the correct number for the first image?

Q14. What is the way to recover from feature detection errors?