What future works have the authors mentioned in the paper "Rein - a fast, robust, scalable recognition infrastructure" ?

In the near future the authors will add feature detector-descriptor techniques for textured objects where a bag of words technique is used to propose objects and geometric RANSAC with a 3D feature point model is used to verify ( ” dispose ” ) the proposed recognition while computing the 6DOF object pose similar to the work described in [ 7 ] which they call ” TOD ” for Textured Object Detector. As future plans, the authors are currently working on object picking tasks, with a goal to scale to 1000 objects or more. The authors plan to extend and use ReIn as their main recognition framework, this time using a cross-validated voting of classifiers to handle this span of textured, transparent and untextured items.

What is the object recognition and pose strategy?

Their object recognition and object pose strategy is to use a fast 2D classifier set at a low recognition threshold to rapidly over-detect objects in order to minimize mis-detections.

Why does BiGG use only grayscale gradients?

Because BiGG captures gradients in their context, it can take advantage of the interior texture where it can find it but can also recognize textureless and even transparent objects just from their outer contour.

How many datasets were used in this example?

Please note that in this example, the query cluster was part of the “training” data – which consisted of 2720 datasets representing different objects in various poses – meaning that the distance from the query to itself should be 0.

How did the authors evaluate the effectiveness of the ReIn data passing architecture?

To evaluate the effectiveness of the ReIn data passing architecture, the authors built a simple attention operator that receives a 3D point cloud from the stereo camera at full 640x480, and returns the entire image as a ROI/mask.

What are the default values for VFH?

Although there are a fair number of parameters in BiGGPy such as pyramid levels, pyramid blur, gradient magnitudes etc., in practice the authors use the default values mentioned above which have performed well and mainly just tune the top threshold value and how fast it decays through lower pyramid levels.

What is the distance threshold for the first object detector?

The first object detector (BiGG) is configured with low detection thresholds to obtain high recall at the cost of many false positive detections (see figure 15), detections which are then filtered by the second object detector (VFH).

What is the way to train a gradient detector?

4) The authors next compute a gradient ”Summary Image” where in each n x n block (typically 7x7) the authors OR the gradients together to provide some generalization to exact alignment and pose.

What is the definition of a blackbox?

In ReIn an algorithm is viewed as a blackbox, with a well defined interface, that consumes a set of inputs, produces some outputs and is configured by a set of parameters.

What is the purpose of the object detection stage?

The authors will then use a 3D object and pose detection algorithm to filter out the incorrect object proposals from the correct ones which the authors term the ”Disposal” stage.

What is the current application example of a table clearing?

An application example that the authors are currently pursuing is table clearing with their PR2 platform4, which involves the recognition of plates, cups, and common household items.

(Open Access) REIN - A fast, robust, scalable REcognition INfrastructure (2011) | Marius Muja

Q: What are the contributions in "Rein - a fast, robust, scalable recognition infrastructure" ?

In this paper, the authors present their implementation of such an infrastructure, ReIn ( REcognition INfrastructure ), to answer these needs. In the course of this work the authors introduce two new classifiers designed for robot perception needs: BiGGPy ( Binarized Gradient Grid Pyramids ) for scalable 2D classification and VFH ( Viewpoint Feature Histograms ) for 3D classification and pose. The authors then show how these two classifiers can be easily combined using ReIn to solve object recognition and pose identification problems.

Q: What is the way to remove spurious gradients?

3) To remove spurious gradients, a 3x3 filter is next run that eliminates binarized gradient directions that only appear once in a given 3x3 region.

Q: What is the common format for storing and accessing ROS messages?

3“Bag file” is the common format for storing and accessing ROS messages in an efficient way4PR2 (Personal Robot 2) is a robotic platform developed by Willow Garage – http://www.willowgarage.comTo develop a rapid object detector for their object proposal stage the authors drew on ideas from the HoG detector [9] which is essentially a grid of gradient histograms.

Q: What is the advantage of a plugin system?

The plugin system allows for great flexibility, making possible for different algorithms to be loaded as part of the same process, part of different processes of even on different machines (on a compute cluster for example).

REIN - A Fast, Robust, Scalable REcognition INfrastructure

Marius Muja

∗

, Radu Bogdan Rusu

†

, Gary Bradski

†

, David G. Lowe

∗

University of British Columbia, Canada

{mariusm,lowe}@cs.ubc.ca

†

Willow Garage, 68 Willow Rd., Menlo Park, CA 94025, USA

{rusu, bradski}@willowgarage.com

Abstract— A robust robot perception system intended to

enable object manipulation needs to be able to accurately

identify objects and their pose at high speeds. Since objects vary

considerably in surface properties, rigidity and articulation, no

single detector or object estimation method has been shown

to provide reliable detection across object types to date. This

indicates the need for an architecture that is able to quickly

swap detectors, pose estimators, and ﬁlters, or to run them in

parallel or serial and combine their results, preferably without

any code modiﬁcations at all. In this paper, we present our

implementation of such an infrastructure, ReIn (REcognition

INfrastructure), to answer these needs. ReIn is able to combine

a multitude of 2D/3D object recognition and pose estimation

techniques in parallel as dynamically loadable plugins. It also

provides an extremely efﬁcient data passing architecture, and

offers the possibility to change the parameters and initial

settings of these techniques during their execution. In the course

of this work we introduce two new classiﬁers designed for robot

perception needs: BiGGPy (Binarized Gradient Grid Pyramids)

for scalable 2D classiﬁcation and VFH (Viewpoint Feature

Histograms) for 3D classiﬁcation and pose. We then show how

these two classiﬁers can be easily combined using ReIn to solve

object recognition and pose identiﬁcation problems.

I. INTRODUCTION

In this paper we focus our efforts on the design of a scal-

able, efﬁcient, and modular architecture (ReIn - pronounced

“reyn”) for the problem of object recognition and pose

estimation from 2D/3D imagery. ReIn is motivated by the

recent advances in object recognition such as reported in the

PASCAL VOC challenge [1]. The latest challenge achieved

classiﬁcation rates of 48.6-93.0% and detection rates of

10.2-55.3%. These results, while encouraging for computer

vision algorithm research, are nowhere near acceptable for

robotics. Missing even ﬁve percent of the objects on a table is

unacceptable for a table clearing robot (one out of 20 objects

is left on the table or is perhaps broken by the robot due to

mis-detection). Where humans are involved, missing even 1

percent is unacceptable due to safety reasons. These results

indicate a need to combine various detection, recognition and

pose algorithms and to combine different sensing modalities

in order to attain robust performance. Combining different

algorithms and sensing modalities is a non-trivial task. Quite

often, recognition performance is traded for speed, and

tuning parameters can become complex. We typically might

use one or more fast 2D algorithms with low threshold

settings to over-detect objects over the whole scene in order

to avoid missed detections (”propose”) and then use one

or more slower algorithms with higher recognition perfor-

mance to ﬁlter out the correct objects from their proposed

(sparse) locations (”dispose”) followed perhaps by the use

of 3D information to get 6 degree of freedom (DOF) object

orientation (”6DOF pose”). Thus, recognition algorithms can

be used as detectors, recognizers and as ﬁlters. Parameters

for these algorithms must be adjusted over large amounts of

data in order to play these roles.

Fig. 1. Object recognition using BiGG and VFH within our ReIn infras-

tructure.

In order facilitate the above needs, we propose a new

modular software architecture, ReIn

, that lightly wraps

existing detection, recognition and pose algorithms so that

they may be used in parallel and in serial without the need

to write further code. ReIn runs efﬁciently, taking advantage

of shared memory where it is available to reduce data

copying. In addition, the architecture automatically provides

an online interface to allow changing/tuning parameters for

each algorithm as it runs. We demonstrate this architecture

showing experimental results combining two of our most

recent detectors: BiGG and VFH [2] (see Figure 1). ReIn is

an open source architecture that deﬁnes common interfaces

that are shareable by a large pool of object detectors, and

ReIn – Recognition Infrastructure – is a BSD licensed, open source

project, available as part of ROS, the Robot Operating System (http:

//www.ros.org/wiki/rein).

creates an uniﬁed methodology for swapping these detectors

at run-time using data passing redirections.

Though there are many object recognition architectures

in the literature, there aren’t too many generalized (or better

said standardized) recognition infrastructures. This is mostly

due to the fact that researchers usually insist on individual

detectors in their publications, and though they compare

them with other detectors, the incentive of combining them

together is small.

While there are a some industrial object recognition sys-

tems such as Cognex’s library [3] and Evolution Robotics

ViPR [4], these tend to be domain speciﬁc to factory in-

spection and navigation respectively. There have also been

attempts at cognitive perceptual architectures for robotics

such as COG at MIT [5]. There are far fewer object

recognition architectures devoted to general purpose robotics.

Stanford has developed The STAIR Vision Library [6] which

is centered around a sliding windows approach, now mod-

iﬁable by masks, and CMU has produced a system for

textured objects [7]. OpenCV [8] is a computer vision library

containing many object recognition techniques including a

feature detector-descriptor pipeline and OpenCV is in fact is

called by the BiGGPy recognition routine described below.

But, none of the above addresses run time conﬁgurable

general object recognition and object pose systems in a

generic way. And none does this in a way where we can

have the reconﬁgurable beneﬁts of message passing over a

distributed system but still automatically take advantage of

shared memory when possible.

The remaining of this paper is organized as follows: a

brief description of the ReIn system architecture is presented

in Section II. The two detectors used to demonstrate ReIn,

namely BiGG and VFH are presented in Section III. We

validate the framework and provide experimental results in

Section IV, and conclude with hints towards our related work

in Section V.

II. ARCHITECTURE

To obtain reliable recognition for multiple types of objects

we often need to combine different object detectors, each

with its own strengths and weaknesses. These detectors

can be combined in different conﬁgurations, in parallel, in

cascade or some mixture of the two. Usually each algorithm

has a different interface and combining any two of them

involves converting between different data structures which

can be inefﬁcient. Also the task of integrating many different

detection algorithms into a running system can be non-trivial.

We developed ReIn, a Recognition Infrastructure imple-

mented on top of ROS (Robot Operating System), to address

these concerns. In ReIn an algorithm is viewed as a black-

box, with a well deﬁned interface, that consumes a set of

inputs, produces some outputs and is conﬁgured by a set of

parameters. We deﬁne a set of interfaces shareable by a large

number of object detectors (see Figure 5):

• Attention operator: Takes as input an image and/or a

3D point cloud and produces as output a mask, a region

of interest (ROI) in the image or a segmentation of the

Detectors

Pose estimators

Attention operators

Detector 1

Estimator 1

Operator 1

Operator 2

Detector 2

Estimator 2

Input data

Object class and pose

Fig. 2. An overall snapshot into the ReIn system architecture, together

with its tree major components: attention operators, detectors, and pose

estimators.

point cloud. An attention operator is usually placed in

front of a detector to ﬁnd “interesting” regions in the

image/point cloud where to perform the detection, thus

reducing the detector’s search space. For example an

attention operator could use stereo 3D information to

produce regions of interest in an image for a vision-

only object detector. An attention operator could also

be used to ﬁnd interesting regions in the environment

that the robot should examine in more detail.

• Detector: Takes as input an image, a 3D point cloud, a

list of ROIs/masks or a list of detections, and produces

as output a list of detections and potentially a list of

poses. Since different detection algorithms may only

require some of the inputs (for example some algorithms

only use images, some don’t take advantage of regions

of interest in the image), the inputs can be used in any

combination, conﬁgurable by a set of parameters.

• Pose estimator: Takes as input an image and/or a point

cloud and a set of detections and computes the poses

of the detected objects. A pose estimator is used when

a pose is required for tasks such as grasping, but the

detection algorithms used are not capable of computing

the poses of the detected objects.

Adding existing attention, detection or pose estimation

algorithms to this infrastructure is accomplished by lightly

wrapping them so they implement the above interfaces.

Once wrapped, they we can be freely combined in different

conﬁgurations by redirecting their inputs and outputs.

An additional advantage of wrapping existing algorithms

in our infrastructure is the fact that they automatically be-

come plugins (ROS nodelets

), capable of being dynamically

loaded/unloaded from a system. The plugin system allows

for great ﬂexibility, making possible for different algorithms

to be loaded as part of the same process, part of different

processes of even on different machines (on a compute

cluster for example). When loaded as part of the same

process, the data exchange between the different algorithms

nodelet: a ROS plugin system that provides a way to run multiple

algorithms as part of the same process with zero copy cost between them.

happens very efﬁciently with zero copying, by passing shared

pointers.

Since ReIn is built on a distributed message passing ar-

chitecture (that will take advantage of shared memory where

available to avoid copying data), it is simple to conﬁgure

the “roles” that classiﬁers will take. The conﬁguration of

BiGGPy as classiﬁer and VFH as ﬁlter used in this paper is

shown in Figure 3. Figure 4 shows how easy it is to reverse

the roles so that VFH plays the main classiﬁer and BiGGPy

the ﬁlter. This is done by remapping expected message names

such as “/bigg/image” to look for the raw “/image” and so

on. In this example, the raw images and point clouds are

messages produced by another launch ﬁle responsible for

sensing (not shown).

Fig. 3. Launch ﬁle for ReIn where BiGGPy classiﬁes and VFH ﬁlters.

Fig. 4. Launch ﬁle example reversing the roles so that VFH is the classiﬁer

and BiGGPy the ﬁlter.

In addition to the features presented above, ReIn in-

cludes a framework for training object detectors. In order to

use this framework, an object detector needs to implement

the Trainable interface in addition to the Detector

interface. The advantage of doing this is that all detectors

implementing the Trainable interface can be trained in an

uniform manner, using the same data formats (for example

Detector

detections

rois/masks

poses

image

point_cloud

rois/masks

detections

Attention

Operator

masks

rois

image

point_cloud

Pose estimator

poses

point_cloud

image

detections

poses

Fig. 5. The set of common interfaces and operators in ReIn.

bag ﬁles

or sets of annotated images) and the same tools.

ReIn contains support for the saving and loading of the

trained models, with the serialization backends conﬁgurable

at launch time. The current available backends allow for

serialization on the local ﬁlesystem, as either regular ﬁles or

in a SQLite database, or on a centralized relational database

such as MySQL, PostgreSQL or Oracle.

III. BIGG AND VFH

An application example that we are currently pursuing is

table clearing with our PR2 platform

, which involves the

recognition of plates, cups, and common household items.

Many of these items have no internal texture and many

of them are fairly confusable (different types of cups for

example). For this task, it is convenient to use fairly dense

stereo (using textured light projection) combined with 2D

imagery. The addition of 3D information will help identify

table planes as well as to verify objects and their pose.

Our object recognition and object pose strategy is to use a

fast 2D classiﬁer set at a low recognition threshold to rapidly

over-detect objects in order to minimize mis-detections. We

call this the object ”Proposal” stage where the hope is that no

object is missed and the correct object is identiﬁed in each

location even if there are several false positives. We will then

use a 3D object and pose detection algorithm to ﬁlter out the

incorrect object proposals from the correct ones which we

term the ”Disposal” stage. Finally, the 3D data will also be

used to give us object pose in 6DOF, called the ”6DOF Pose”

stage.

To validate ReIn we implement the above strategy using

the following two algorithms: i) BiGG (Binarized Gradient

Grids) and it’s Pyramid extension (BiGGPy) and ii) VFH

(Viewpoint Feature Histogram) which we previously intro-

duced in [2].

“Bag ﬁle” is the common format for storing and accessing ROS messages

in an efﬁcient way

PR2 (Personal Robot 2) is a robotic platform developed by Willow

Garage – http://www.willowgarage.com

Fig. 6. Pan-Tilt rotating unit used for capturing ground truth for the object

pose (both for BiGG and VFH).

A. BiGG and BiGGPy

To develop a rapid object detector for our object proposal

stage we drew on ideas from the HoG detector [9] which

is essentially a grid of gradient histograms. For speed we

adapted ideas from DOT [10] which binarized image gradi-

ents and used logical OR instead of histogram bins in each

grid cell using a Cosine matching function. We use a simpler

normalized count of matching gradients in BiGG described

below. The stages of the resulting BiGG algorithm are shown

in Figure 7 and described as follows:

1) Gradients are computed from a gray scale input image

using OpenCV’s [8] Scharr [11] gradient detector.

2) Small magnitude gradients are then removed by thresh-

olding (we used a threshold of 200) and then the

gradients are discretized into one of 8 directions ig-

noring direction of contrast (dark-light and light-dark

are equivalent) since objects boundaries might be dark

to light or light to dark depending on the background

the robot observes them against.

3) To remove spurious gradients, a 3x3 ﬁlter is next run

that eliminates binarized gradient directions that only

appear once in a given 3x3 region.

4) We next compute a gradient ”Summary Image” where

in each n x n block (typically 7x7) we OR the

gradients together to provide some generalization to

exact alignment and pose.

5) In the training phase, the above summary gradients are

recorded as a gradient template for each view of the

object

. We used a template of 32x32 in the summary

image. In test, the gradient templates are compared,

one by one in a sliding window over the image

scene. Matching is done by taking the logical AND

of each memorized template with a given window of

the summary image. Results are normalized between

0 and 1 by dividing the match result by the total

number of non-zero gradients in the template. Results

More intuitive training view coverage may be done by mechanically or

perspectively OR’ing together gradients at each view over a solid angular

part of the view sphere.

are then reported out (optionally with the 6DOF pose

memorized with the matched view) and thresholded to

declare recognition (thresholds from 0.7 to 0.85 work

well).

6) Finally, the learned templates are stored in a database

or on disk.

Training BiGG is often done by presetting a threshold

level, say 0.82, learning an initial view and only learning a

new view when none of the set of existing templates for that

object is above the threshold.

Some of the advantages of BiGG are that it can be trained

at frame-rate. We use a precise pan-tilt turn table to learn

views of the object together with a ground truth object

pose (see Figure 6). About 350 views of the object are

learned in a 15 second sweep of the object covering a half

view sphere of the object. Since BiGG uses just grayscale

gradients without regard to direction of contrast, it is very

tolerant of lighting conditions. The summary image collects

all gradients in each patch (here 7x7), and while testing we

can sample every 7th pixel in each direction for a 49 times

speedup without loss of accuracy. Because BiGG captures

gradients in their context, it can take advantage of the interior

texture where it can ﬁnd it but can also recognize textureless

and even transparent objects just from their outer contour.

Finally, if cleanly written, BiGG can be quite fast and can

take advantage of SSE or CUDA instructions to parallelize

matching via parallel AND’ing of the summary image patch

with the template.

The disadvantages of BiGG are: BiGG uses only logical

matches of gradients in its template (zeros do not count)

so highly textured scenes will cause many false positives.

Although BiGG is quite fast, it still scales linearly with the

number of objects learned and this will become a limiting

factor for an autonomous robot.

In order to retain the advantages and minimize the dis-

advantages, we developed a pyramid form of BiGG, termed

”BiGGPy”. Figure 8 gives a ﬂow chart of the change in

moving from BiGG to BiGGPy. Instead of computing the

summary image, we start with the full resolution binary

gradient image at the bottom of the pyramid and go up

the pyramid in each stage by logically OR’ing 2x2 gradient

cells from the lower level together forming a pyramid level

of half the size in each dimension. Typically we use a 4

level pyramid which forms our data structure to train and

test against.

Training BiGGPy then goes from top to bottom of the

pyramid. Templates at the top levels of the pyramid are

typically associated with many objects, and those at the

bottom level with one or a few objects. At the top level, we

have a very blurred, non-discriminative, gradient summary

image so we set a high threshold in order to break up the

learned objects into many different subtrees, threshold that

we decay as we descent the pyramid levels and the templates

get more discriminative.

We learn a new top template every time no preexisting

template matches the object. After the top level, learning

proceeds recursively down the best matching template sub-

123456789ABCD7E68F3B7

D67A82D

9ABCD7E6

DA76D78

9ABCD7E6

12345678533BA8

F3B7

DCDE8DEC28

B6DE

734B678

CB6BB7

D99

FE4568F3B7

8888888888888888888888888888888888888888888888887676D2E

Fig. 7. BiGG (Binarized Gradient Grids) detector architecture. Starting from the left, gradients and their magnitudes of an image are computed. Small

magnitude gradients are removed and the rest are binarized into 8 directions ignoring direction of contrast. Next, singleton (noisy) gradients are removed

and a gradient summary image is creating by OR’ing gradients together over a local patch. Finally, in recognition mode, a sliding window search is used

to ﬁnd object in the scene.

Compute Gradient Image

Discretize Gradients

Filter Noisy Gradients

Compute Summary Image

Sliding Window Matching

Input image

Detections

Compute Gradient Image

Discretize Gradients

Filter Noisy Gradients

Compute Image Pyramid

Pyramid Matching

Input image

Detections

Fig. 8. BiGGPy: Moving from BiGG at left to Pyramid of BiGG (BiGGPy)

on the right.

tree. If no existing template is found, another is learned

and so on until the bottom of the tree. At the bottom,

we record the object class, segmentation mask, (given by

using depth cues plus GrabCut [12]), bounding box from the

segmentation mask and object pose from the pan-tilt table.

In this way, we learn a tree of BiGG masks whose search

time is (on average) logarithmic in the number of objects

learned.

In test mode for BiGGPy, we compute a pyramid summary

image as above. At top we do a sliding window search over

the smallest summary image. This produces candidate detec-

tion locations. In each candidate location, we descend to the

next level of the pyramid and search with lowered threshold

that vicinity (plus and minus a pixel in order to avoid misses

due to slight misalignments). The search proceeds recursively

until the bottom layer where recognitions are reported or until

no template matches. Figure 10 depicts this process.

Note that, unlike BiGGPy’s logarithmically growing

recognition search time with each new object, the memory

requirements of BiGGPy grow linearly with new objects.

Memory is however much less of an issue than search

time. We need the robot to remain rapidly responsive even

as it learns a large numbers of objects. But in any given

situation, such as clearing a kitchen table for example, we

only need to load in the BiGGPy templates that we need.

When object recognition search requires templates that are

not in memory, the templates can be pulled in from disk.

The initial recognition time may be slower but the robot will

quickly come up to speed in that given context. Old templates

or template trees that have gone stale can similarly be pruned

from memory. Thus, memory requirements are much less of

a problem than search times.

BiGGPy not only allows recognition to scale to many

objects, but the more detailed gradients at the bottom levels

of the pyramid are less likely to produce false positives.

Although there are a fair number of parameters in BiGGPy

such as pyramid levels, pyramid blur, gradient magnitudes

etc., in practice we use the default values mentioned above

which have performed well and mainly just tune the top

threshold value and how fast it decays through lower pyramid

levels.

B. VFH

VFH was already presented in our previous work [2]

as a standalone 3D meta-local descriptor that is extremely

efﬁcient for object class recognition and pose estimation at

high speeds. The meta locality of the descriptor comes from

the fact that it is usually applied to a cluster of 3D points that

contains the object to be recognized with a high probability.

In our previous work, we assumed that the objects of interest

are supported by horizontal planes, and used segmentation

and clustering techniques to extract individual objects as

REIN - A fast, robust, scalable REcognition INfrastructure

Figures

Citations

Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes

Gradient Response Maps for Real-Time Detection of Textureless Objects

6-DoF object pose from semantic keypoints

CAD-model recognition and 6DOF pose estimation using 3D cues

A Dataset for Improved RGBD-Based Object Detection and Pose Estimation for Warehouse Pick-and-Place

References

Histograms of oriented gradients for human detection

"GrabCut": interactive foreground extraction using iterated graph cuts

Fast Point Feature Histograms (FPFH) for 3D registration

Fast approximate nearest neighbors with automatic algorithm configuration

A discriminatively trained, multiscale, deformable part model

Related Papers (5)

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Distinctive Image Features from Scale-Invariant Keypoints

The Pascal Visual Object Classes (VOC) Challenge

Speeded-Up Robust Features (SURF)

Object recognition from local scale-invariant features

Frequently Asked Questions (22)

Q1. What are the contributions in "Rein - a fast, robust, scalable recognition infrastructure" ?

Q2. What future works have the authors mentioned in the paper "Rein - a fast, robust, scalable recognition infrastructure" ?

Q3. What is the object recognition and pose strategy?

Q4. What is the way to remove spurious gradients?

Q5. Why does BiGG use only grayscale gradients?

Q6. What is the common format for storing and accessing ROS messages?

Q7. How many datasets were used in this example?

Q8. What is the meta locality of the descriptor?

Q9. How did the authors evaluate the effectiveness of the ReIn data passing architecture?

Q10. What is the advantage of a plugin system?

Q11. How can you parallelize the summary image patch with the template?

Q12. What is the advantage of wrapping existing algorithms in a distributed message passing architecture?

Q13. What are the default values for VFH?

Q14. What is the distance threshold for the first object detector?

Q15. What did the authors assume was that the objects of interest were supported by horizontal planes?

Q16. What is the advantage of wrapping existing algorithms in their infrastructure?

Q17. What is the advantage of wrapping objects in a way that allows for uniform training?

Q18. What is the way to train a gradient detector?

Q19. What is the definition of a blackbox?

Q20. What is the purpose of the object detection stage?

Q21. What is the definition of a detection algorithm?

Q22. What is the current application example of a table clearing?