scispace - formally typeset
Search or ask a question
Book ChapterDOI

A comparison of features in parts-based object recognition hierarchies

09 Sep 2007-pp 210-219
TL;DR: A comparative investigation of different feature types with regard to their suitability for category discrimination in patches of gray-scale images were compared with SIFT descriptors and patches from the high-level output of a feedforward hierarchy related to the ventral visual pathway.
Abstract: Parts-based recognition has been suggested for generalizing from few training views in categorization scenarios. In this paper we present the results of a comparative investigation of different feature types with regard to their suitability for category discrimination. So patches of gray-scale images were compared with SIFT descriptors and patches from the high-level output of a feedforward hierarchy related to the ventral visual pathway. We discuss the conceptual differences, resulting performance and consequences for hierarchical models of visual recognition.

Summary (2 min read)

1 Introduction

  • The human brain employs different kinds of interrelated representations and processes to recognize objects, depending on the familiarity of the object and the required level of recognition, which is defined by the current task.
  • A parts-based representation is especially efficient for storing and categorizing novel objects, because the largest variance in unseen views of an object can be expected in the position and arrangement of parts, while each part of an object will be visible under a large variety of 3D object transformations.
  • Here hierarchies of feature layers are used, like in the ventral visual pathway, where they combine specificity and invariance of features.
  • In other holistic methods the receptive fields of the features cover the whole image.
  • The approach selects features based on the maximization of mutual information for a single class.

2 Analytic Features

  • To generalize from few training examples, parts-based recognition follows the notion that similar combinations of parts are specific for a certain category over a wide range of variations.
  • So the authors need a reasonable feature selection strategy that evaluates which and how many views of a certain category a feature can separate from other categories and, based on those results, choose the subset of features that in combination can describe the whole scenario best.
  • The maximum activated bin in this histogram is used to normalize the rotation of the patch in advance.
  • A similar cluster step was also done in [15] to improve the generalization performance of the otherwise very specific SIFT descriptors.
  • The feedforward hierarchy proposed in [3] is shown in Fig. 1a.

3 Results

  • The authors tested the performance of the different feature types on the categorization scenario shown in Fig.
  • For the different tests the authors then varied the number of used features and the number of training views that were used by a single layer perceptron (SLP), as the final classifier.
  • C2-H is similar to C2-P and GRAY-P, and takes the lead when using a large number of views.
  • The performance of SIFT-P and GRAY-P on cups(7) is very poor and does not improve with more training views.
  • This is especially true for categories where the rotation in depth looks like rotation in plane (bottle(2), brush(4), phone(9), tool(10)).

4 Conclusion

  • The authors evaluated the performance of different types of local feature when used in parts-based recognition.
  • The biological motivated feedforward hierarchy in [3] is powerful in holistic recognition with a sufficient number of training examples, but the patches from the output layer are too general and therefore show weak performance in parts-based recognition.
  • First features are used that extract the magnitudes for 8 different local gradient directions.
  • This could be beneficial for both feature types.
  • The most related work in the direction of analytic features was done in [16], where Ullman introduced invariance over viewpoint in his fragments approach, or in the work of Dorko et al. in [15], where highly informative clusters of SIFT descriptors are used.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

A Comparison of Features in Parts-Based
Object Recognition Hierarchies
Stephan Hasler, Heiko Wersing, and Edgar orner
Honda Research Institute Europe GmbH
D-63073 Offenbach/Germany
stephan.hasler@honda-ri.de
Abstract. Parts-based recognition has been suggested for generalizing
from few training views in categorization scenarios. In this paper we
present the results of a comparative investigation of different feature
types with regard to their suitability for category discrimination. So
patches of gray-scale images were compared with SIFT descriptors and
patches from the high-level output of a feedforward hierarchy related to
the ventral visual pathway. We discuss the conceptual differences, re-
sulting performance and consequences for hierarchical models of visual
recognition.
1 Introduction
The human brain employs different kinds of interrelated representations and
processes to recognize objects, depending on the familiarity of the object and
the required level of recognition, which is defined by the current task. There is
evidence that for identifying highly familiar objects, like faces, holistic templates
are used that emphasize the spatial layout of the object’s parts but neglect de-
tails of the parts themselves. This holistic prototypical representation requires
a lot of experience and coding capacity and therefore can not be used for all
the objects in every day’s life. A more compact representation can be obtained
when handling objects as combinations of shared parts. There is various biologi-
cal motivation for such a representation. The experiments of Tanaka [1] revealed
that there are high-level areas in primates ventral visual pathway that predict
the presence of a large set of features with intermediate complexity, generaliz-
ing over small variations and being invariant to retinotopical position and scale.
The combinatorial use of those features was shown by Tsunoda [2]. He observed
that complex objects simultaneously activate different spots in those areas and
that this activation is caused by the constituent parts. A parts-based represen-
tation is especially efficient for storing and categorizing novel objects, because
the largest variance in unseen views of an object can be expected in the position
and arrangement of parts, while each part of an object will be visible under a
large variety of 3D object transformations.
In computer vision literature there is a similar distinction into holistic and
parts-based approaches, depending on how feature responses are aggregated over
J. Marques de a et al. (Eds.): ICANN 2007, Part II, LNCS 4669, pp. 210–219, 2007.
c
Springer-Verlag Berlin Heidelberg 2007

A Comparison of Features in Parts-Based Object Recognition Hierarchies 211
the image. Parts can be local features of any kind. The response of a part detector
at different positions in an image means that the part might be present several
times but not that the probability is higher that the part is present at all. So
each peak in the multimodal response map is handled as a possible instance
of the part. In contrary to this, holistic approaches contain a layer that simply
accumulates the real-valued response of single features of the previous layer
over the whole image. This is only comparable to the biological definition if the
configurational information is kept.
Approaches with strong biological motivation are presented in [3,4]. Here hi-
erarchies of feature layers are used, like in the ventral visual pathway, where they
combine specificity and invariance of features. So there are cells that are either
sensitive to a specific pattern of activation in lower layers, in this way increas-
ing the feature’s complexity, or that pool the responses of similar features, so
generalizing over small variations. The output layer of the feedforward hierarchy
proposed in [3] contains several topographically organized feature maps which
are used directly by the final classifier. Following the above definition this is a
holistic approach. The similar hierarchy of [4] employs in the highest feature
layer a spatial max-pooling over each feature map in the previous layer, which
makes it a parts-based approach. Multimodal response characteristics and the
position of the parts are neglected.
Most other approaches work more directly on the images. Very typical holis-
tic approaches apply histograms, so e.g. in [5] the responses to local features are
simply summed and in [6] it is counted how often a response lies in a certain
range. In other holistic methods the receptive fields of the features cover the
whole image. So e.g. in [7] features obtained by principal component analysis
(PCA) on gray-scale images were used to classify faces. These features, so called
eigenfaces, show a very global activation and do not reflect parts of a face. In
contrary to PCA other methods produce so called parts-based features like the
nonnegative matrix factorization (NMF) proposed in [8] or a similar scheme pro-
posed in [9] yielding more class-specific features. Although during training the
receptive field of each feature covers the whole image, it learns to reconstruct
a certain localized region that contains the same part in many training views
(e.g. parts of normalized frontal views of faces). But usually those features are
used in a holistic manner, meaning that they are extracted at a single position
in the test image and in this way are only sensitive to the rigid constellation of
parts that was present during training. This limits the possibilities to general-
ize over geometric transformations, which is especially a drawback when using
few training examples in an unnormalized setting. Also the holistic approaches
perform bad in the presence of clutter and occlusion and often require extensive
preprocessing as localization and segmentation.
Other parts-based recognition approaches also use the maximum activation
of each feature, like the highest layer in [4]. In [10] the features are fragments of
gray-scale images. The response of a feature is binary and obtained by thresh-
olding the maximum activation in the image. The approach selects features
based on the maximization of mutual information for a single class. This yields

212 S. Hasler, H. Wersing, and E. orner
fragments of intermediate complexity. An image is classified by comparing bi-
nary activation vectors to stored representatives in a nearest neighbor fashion.
Other approaches make use of the position and treat each peak in the response
map as possible part instance. In the scale invariant feature transform (SIFT)
approach in [11] gradient-histograms are extracted for small patches around in-
teresting points (see Fig. 1c). Each such patch descriptor is compared against a
large repertoire of stored descriptors, where the best match votes for the presence
of an object at a certain position, scale and rotation. The votes are combined
using a Generalized Hough Transform and the maximally activated hypothesis
is chosen. A similar scheme is proposed in [12]. Here image patches are used as
features and the algorithm is capable to produce a segmentation mask for the
object hypothesis that can be used for a further refinement process. In the bags
of keypoints approaches, e.g. [13], it is counted how often parts are detected in
an image. In contrast to holistic histogram-based approaches the presence of a
part is the result of a strong local competition of parts. Therefore it is more a
counting of symbol-type information than a summation over real-valued signal-
type responses. Parts-based recognition can be used to localize and recognize
objects at the same time and works well in the presence of clutter and occlusion.
In Sect. 2 we first comment on the task we want to solve and the nature of
the features required for this. Then we describe the investigated feature types
and our feature selection strategy. We give results for a categorization problem
in Sect. 3 and present our conclusions in Sect. 4.
2 Analytic Features
To generalize from few training examples, parts-based recognition follows the
notion that similar combinations of parts are specific for a certain category over
a wide range of variations. In this work we investigate how suitable different
feature types are for this purpose and which effort is needed in terms of the
number of used features. As has been argued in [10], it is beneficial that a single
part can be detected in many views of one category, while being absent in other
categories. So we need a reasonable feature selection strategy that evaluates
which and how many views of a certain category a feature can separate from
other categories and, based on those results, choose the subset of features that in
combination can describe the whole scenario best. For simple categories a single
feature can separate many views and therefore only few features are necessary to
represent the whole category. For categories with more variation more features
have to be selected to cover the whole appearance. This dynamic distribution of
resources is necessary to make best use of the limited number of features.
How well certain local descriptors can be re-detected under different image
transformations, as scale, rotation and viewpoint changes, was investigated in
[14]. Although this is a desired quality, it does not necessarily state something on
the usefulness in object recognition tasks. To underline that the desired features
should be meaningful, i.e. offer a compromise between specificity and generality

A Comparison of Features in Parts-Based Object Recognition Hierarchies 213
at low costs, and to avoid confusion with approaches that learn parts-based
features, we will use the term analytic features.
We decided to compare patches of gray-scale images, for their simplicity, SIFT
descriptors, for their known invariance, and patches of the output of the feed-
forward hierarchy in [3], because of the biological background.
A SIFT descriptor as proposed in [11] describes a gray-scale patch of 16x16
pixels using a grid of 4x4 gradient-histograms (see Fig. 1c). Each histogram in
the grid is made up of eight orientation bins. The magnitude of the gradient at a
certain pixel is distributed in a bilinear fashion over the neighboring histograms
(in general four), where the orientation of the gradient determines the bin. The
gradient magnitudes are scaled with a Gaussian that is centered on the patch, in
this way reducing the influence of border pixels. Prior to the calculation of the
histogram grid a single histogram with a higher number of orientation bins is
computed for the whole patch. The maximum activated bin in this histogram is
used to normalize the rotation of the patch in advance. Finally the energy of the
whole descriptor is normalized to obtain invariance to illumination. In contrast
to [11], we do not extract SIFT descriptors at a small number of interesting
keypoints, but for all locations where at least a minimum of structure is present.
In this way only uniform, dark background is neglected and on the category
scenario in Fig. 3 on average one third of all descriptors is kept. We reduce the
number of descriptors for each image by applying a k-means algorithm with
200 components. A similar cluster step was also done in [15] to improve the
generalization performance of the otherwise very specific SIFT descriptors.
For the gray-scale patches we decided to use the same patch size as for the
SIFT approach and the influence of the pixels is also weighted with a Gaussian
that is centered on the patch.
The feedforward hierarchy proposed in [3] is shown in Fig. 1a. The S1-layer
computes the magnitudes of the response to four differently oriented gabor filters.
This activation is pooled to a lower resolution in the C1-layer performing a local
OR-operation. The 50 features used in S2 are trained as to efficiently reconstruct
a large set of random 4x4x4 C1-patches from natural images and are therefore
sensitive to local patterns in C1. Layer C2 performs a further pooling operation
and is the output of the hierarchy. Columns of 2x2 pixels are cut from the C2-
layer as shown in Fig. 1b and used as feature candidates. Because of the two
pooling layers, which offer a small degree of invariance to translation, a column of
2x2 pixels in C2 corresponds roughly to a patch of 16x16 pixels in the gray-scale
image.
We will refer to the parts-based approaches as GRAY-P, SIFT-P, and C2-P.
For SIFT-P each image i is described by the J =4× 4 × 8 = 128 dimensional
representatives of the 200 k-means clusters p
in
, n =1...200. For GRAY-P the
p
in
are the patches of image i at all distinct positions n (J =16× 16 = 256).
Similar to this for C2 each p
in
is a column through the feature maps of image
i at a distinct position n asshowninFig.1b(J =2× 2 × 50 = 200). The p
in
show a large variety. Therefore we will use all p
in
directly as feature candidates
w
m
,wherem is an index over all combinations of i and n, and select a subset

214 S. Hasler, H. Wersing, and E. orner
I
S1
C1
S2
Pooling
Input
4 Gabors
50 Combinations
C2
Pooling
C2
16x16 Pixels
a)
b)
c)
50 Planes
Fig. 1. a) Feedforward hierarchy in [3]. b) Columns of C2-layer are used as local fea-
tures. c) SIFT descriptor [11] is grid of gradient histograms each with 8 orientations.
of those candidates with a strategy that is described later. The response r
mi
of
feature w
m
on the image i is given by:
r
mi
=max
n
(G(w
m
, p
in
)) . (1)
For GRAY-P G(w
m
, p
in
)=
J
j=1
h
j
(w
j
m
w
m
)(p
j
in
p
in
)
j
h
j
(w
j
m
w
m
)
2
j
h
j
(p
j
in
p
in
)
2
is used which is
the normalized cross-correlation, where
w
m
and p
in
are the means of vector
w
m
and p
in
respectively, and h
j
is a weighting which decreases the influence
of border pixels with a Gaussian. For C2-P the negative Euclidean distance
G(w
m
, p
in
)=
J
j=1
(w
j
m
p
j
in
)
2
shows better performance because of the
sparseness in this layer. The similarity between SIFT descriptors is given by their
dot product G(w
m
, p
in
)=
J
j=1
w
j
m
p
j
in
. The maximum activation per image is
chosen as response and spatial information is neglected.
Reflecting the remarks on feature selection given above, we decided to use
the following strategy: First we determine which views of a certain category
each individual candidate feature w
m
can separate. Therefore we compute the
response r
mi
for every training image with (1). Then the minimal threshold t
m
is chosen that guarantees that all images with r
mi
above or equal to t
m
belong
the same category (see Fig. 2):
t
m
=min
t|∀
i|r
mi
t
j|r
mj
t
l
i
= l
j
. (2)
Here l
i
denotes the category label of image i. The images separated by the
threshold is assigned a constant score s
mi
= k with respect to the feature w
m
.

Citations
More filters
Journal ArticleDOI
TL;DR: To achieve the life-long learning ability for a cognitive system, a new learning vector quantization approach combined with a category-specific feature selection method to allow several metrical "views" on the representation space of each individual vector quantification node.

53 citations


Cites methods from "A comparison of features in parts-b..."

  • ...The parts-based feature detection (see Hasler et al. (2007) for details) is based on a preselected set of SIFT-descriptors (Lowe, 2004), which are designed to be invariant with regard to rotations in the image plane....

    [...]

Journal ArticleDOI
TL;DR: This work presents an integrated vision architecture capable of incrementally learning several visual categories based on natural hand-held objects and imposes no restrictions on the viewing angle of presented objects, relaxing the common constraint on canonical views.
Abstract: We present an integrated vision architecture capable of incrementally learning several visual categories based on natural hand-held objects. Additionally we focus on interactive learning, which requires real-time image processing methods and a fast learning algorithm. The overall system is composed of a figure-ground segregation part, several feature extraction methods and a life-long learning approach combining incremental learning with category-specific feature selection. In contrast to most visual categorization approaches, where typically each view is assigned to a single category, we allow labeling with an arbitrary number of shape and color categories. We also impose no restrictions on the viewing angle of presented objects, relaxing the common constraint on canonical views.

30 citations

Patent
Andreas Knoblauch1
15 Dec 2009
TL;DR: In this paper, a neural associative memory structure for storing and maintaining associations between memory address patterns and memory content patterns using a neural network is presented, as well as methods for retrieving such associations.
Abstract: This invention is in the field of machine learning and neural associative memory. In particular the invention discloses a neural associative memory structure for storing and maintaining associations between memory address patterns and memory content patterns using a neural network, as well as methods for retrieving such associations. A method for a non-linear synaptic learning of discrete synapses is disclosed, and its application on neural networks is laid out.

25 citations

Patent
Andreas Knoblauch1
28 May 2010
TL;DR: In this paper, a neural associative memory structure for storing and maintaining associations between memory address patterns and memory content patterns using a neural network, as well as methods for storing, retrieving and storing such associations.
Abstract: This invention is in the field of machine learning and neural associative memory. In particular the invention discloses a neural associative memory structure for storing and maintaining associations between memory address patterns and memory content patterns using a neural network, as well as methods for storing and retrieving such associations. Bayesian learning is applied to achieve non-linear learning.

25 citations

Proceedings ArticleDOI
18 Jul 2010
TL;DR: An exemplar-based learning approach for incremental and life-long learning of visual categories and it is argued that contextual information is beneficial for this process.
Abstract: We present an exemplar-based learning approach for incremental and life-long learning of visual categories. The basic concept of the proposed learning method is to subdivide the learning process into two phases. In the first phase we utilize supervised learning to generate an appropriate category seed, while in the second phase this seed is used to autonomously bootstrap the visual representation. This second learning phase is especially useful for assistive systems like a mobile robot, because the visual knowledge can be enhanced even if no tutor is present. Although for this autonomous bootstrapping no category labels are provided, we argue that contextual information is beneficial for this process. Finally we investigate the effect of the proposed second learning phase with respect to the overall categorization performance.

11 citations

References
More filters
01 Feb 2005
TL;DR: A scale-invariant feature selection method that learns to recognize and detect object classes from images of natural scenes that uses local regions to realize robust and sparse part and texture selection invariant to changes in scale, orientation and affine deformation.
Abstract: In this paper, we introduce a scale-invariant feature selection method that learns to recognize and detect object classes from images of natural scenes The first step of our method consists of clustering local scale-invariant descriptors to characterize object class appearance Next, we train on the groups, and perform feature selection to determine the most discriminative parts We use local regions to realize robust and sparse part and texture selection invariant to changes in scale, orientation and affine deformation and, as a result, we avoid image normalization in both training and prediction phases We train our object models without requiring image parts to be labeled or objects to be separated from the background Moreover, our method continues to work well when images have cluttered background and occluded objects We evaluate our method on seven recently proposed datasets, and quantitatively compare the effect of different types of local regions and feature selection criteria on object recognition Our experiments show that local invariant descriptors are an appropriate representation for many different object classes Our results also confirm the importance of appearance-based discriminative feature selection

149 citations


"A comparison of features in parts-b..." refers background or methods in this paper

  • ...A similar cluster step was also done in [15] to improve the generalization performance of the otherwise very specific SIFT descriptors....

    [...]

  • ...in [15], where highly informative clusters of SIFT descriptors are used....

    [...]

BookDOI
01 Jan 2005
TL;DR: The Computational Model to Simulate the Progress of Perceiving Patterns in Neuron Population and the Development of Cognitive Powers in Embodied Systems is presented.
Abstract: Modeling the Brain and Cognitive Functions.- Novelty Analysis in Dynamic Scene for Autonomous Mental Development.- The Computational Model to Simulate the Progress of Perceiving Patterns in Neuron Population.- Short Term Memory and Pattern Matching with Simple Echo State Networks.- Analytical Solution for Dynamic of Neuronal Populations.- Dynamics of Cortical Columns - Sensitive Decision Making.- Dynamics of Cortical Columns - Self-organization of Receptive Fields.- Optimal Information Transmission Through Cortico-Cortical Synapses.- Ensemble of SVMs for Improving Brain Computer Interface P300 Speller Performances.- Modelling Path Integrator Recalibration Using Hippocampal Place Cells.- Coding of Objects in Low-Level Visual Cortical Areas.- A Gradient Rule for the Plasticity of a Neuron's Intrinsic Excitability.- Building the Cerebellum in a Computer.- Special Session: The Development of Cognitive Powers in Embodied Systems.- Combining Attention and Value Maps.- Neural Network with Memory and Cognitive Functions.- Associative Learning in Hierarchical Self Organizing Learning Arrays.- A Review of Cognitive Processing in the Brain.- Spiking Neural Networks.- Neuronal Behavior with Sub-threshold Oscillations and Spiking/Bursting Activity Using a Piecewise Linear Two-Dimensional Map.- On-Line Real-Time Oriented Application for Neuronal Spike Sorting with Unsupervised Learning.- A Spiking Neural Sparse Distributed Memory Implementation for Learning and Predicting Temporal Sequences.- ANN-Based System for Sorting Spike Waveforms Employing Refractory Periods.- Emergence of Oriented Cell Assemblies Associated with Spike-Timing-Dependent Plasticity.- An Information Geometrical Analysis of Neural Spike Sequences.- Perceptual Binding by Coupled Oscillatory Neural Network.- Experimental Demonstration of Learning Properties of a New Supervised Learning Method for the Spiking Neural Networks.- Single-Unit Recordings Revisited: Activity in Recurrent Microcircuits.- A Hardware/Software Framework for Real-Time Spiking Systems.- Efficient Source Detection Using Integrate-and-Fire Neurons.- Associative Memory Models.- A Model for Hierarchical Associative Memories via Dynamically Coupled GBSB Neural Networks.- Balance Algorithm for Point-Feature Label Placement Problem.- Models of Self-correlation Type Complex-Valued Associative Memories and Their Dynamics.- Recovery of Performance in a Partially Connected Associative Memory Network Through Coding.- Optimal Triangle Stripifications as Minimum Energy States in Hopfield Nets.- Models of Biological Functions.- A Biophysical Model of Decision Making in an Antisaccade Task Through Variable Climbing Activity.- Can Dynamic Neural Filters Produce Pseudo-Random Sequences?.- Making Competition in Neural Fields Suitable for Computational Architectures.- Neural Network Computations with Negative Triggering Thresholds.- A Model for Delay Activity Without Recurrent Excitation.- Neuronal Coding Strategies for Two-Alternative Forced Choice Tasks.- Learning Features of Intermediate Complexity for the Recognition of Biological Motion.- Study of Nitric Oxide Effect in the Hebbian Learning: Towards a Diffusive Hebb's Law.- Special Session: Projects in the Area of NeuroIT.- Deterministic Modelling of Randomness with Recurrent Artificial Neural Networks.- Action Understanding and Imitation Learning in a Robot-Human Task.- Comparative Investigation into Classical and Spiking Neuron Implementations on FPGAs.- HYDRA: From Cellular Biology to Shape-Changing Artefacts.- The CIRCE Head: A Biomimetic Sonar System.- Tools for Address-Event-Representation Communication Systems and Debugging.- New Ears for a Robot Cricket.- Reinforcement Learning in MirrorBot.- Evolutionary and Other Biological Inspirations.- Varying the Population Size of Artificial Foraging Swarms on Time Varying Landscapes.- Lamarckian Clonal Selection Algorithm with Application.- Analysis for Characteristics of GA-Based Learning Method of Binary Neural Networks.- Immune Clonal Selection Wavelet Network Based Intrusion Detection.- Investigation of Evolving Populations of Adaptive Agents.- Enhancing Cellular Automata by an Embedded Generalized Multi-layer Perceptron.- Intelligent Pattern Generation for a Tactile Communication System.- Self-organizing Maps and Their Applications.- Self-organizing Map Initialization.- Principles of Employing a Self-organizing Map as a Frequent Itemset Miner.- Spatio-Temporal Organization Map: A Speech Recognition Application.- Residual Activity in the Neurons Allows SOMs to Learn Temporal Order.- Ordering of the RGB Space with a Growing Self-organizing Network. Application to Color Mathematical Morphology.- SOM of SOMs: Self-organizing Map Which Maps a Group of Self-organizing Maps.- The Topographic Product of Experts.- Self Organizing Map (SOM) Approach for Classification of Power Quality Events.- SOM-Based Method for Process State Monitoring and Optimization in Fluidized Bed Energy Plant.- A New Extension of Self-optimizing Neural Networks for Topology Optimization.- A Novel Technique for Data Visualization Based on SOM.- Statistical Properties of Lattices Affect Topographic Error in Self-organizing Maps.- Increasing Reliability of SOMs' Neighbourhood Structure with a Bootstrap Process.- Computer Vision.- Artificial Neural Receptive Field for Stereovision.- Pattern Detection Using Fast Normalized Neural Networks.- Neural Network Model for Extracting Optic Flow.- A Modular Single-Hidden-Layer Perceptron for Letter Recognition.- Fast Color-Based Object Recognition Independent of Position and Orientation.- Class-Specific Sparse Coding for Learning of Object Representations.- Neural Network Based Adult Image Classification.- Online Learning for Object Recognition with a Hierarchical Visual Cortex Model.- Extended Hopfield Network for Sequence Learning: Application to Gesture Recognition.- Accurate and Robust Image Superresolution by Neural Processing of Local Image Representations.- The Emergence of Visual Object Recognition.- Implicit Relevance Feedback from Eye Movements.- Image Segmentation by Complex-Valued Units.- Cellular Neural Networks for Color Image Segmentation.- Image Segmentation Using Watershed Transform and Feed-Back Pulse Coupled Neural Network.- Adaptive Switching Median Filter with Neural Network Impulse Detection Step.- Face Recognition and Detection.- Human Face Detection Using New High Speed Modular Neural Networks.- Face Detection Using Convolutional Neural Networks and Gabor Filters.- Face Identification Performance Using Facial Expressions as Perturbation.- Discriminative Common Images for Face Recognition.- Classification of Face Images for Gender, Age, Facial Expression, and Identity.- Sound and Speech Recognition.- Classifying Unprompted Speech by Retraining LSTM Nets.- Temporal Sound Processing by Cochlear Nucleus Octopus Neurons.- A SOM Based 2500 - Isolated - Farsi - Word Speech Recognizer.- Training HMM/ANN Hybrid Speech Recognizers by Probabilistic Sampling.- Chord Classifications by Artificial Neural Networks Revisited: Internal Representations of Circles of Major Thirds and Minor Thirds.- Bioinformatics.- Biclustering Gene Expression Data in the Presence of Noise.- Gene Extraction for Cancer Diagnosis by Support Vector Machines.- High-Throughput Multi-dimensional Scaling (HiT-MDS) for cDNA-Array Expression Data.- Biomedical Applications.- Comparing Neural Network Architecture for Pattern Recognize System on Artificial Noses.- Medical Document Categorization Using a Priori Knowledge.- A Neurofuzzy Methodology for the Diagnosis of Wireless-Capsule Endoscopic Images.- Neural Network Use for the Identification of Factors Related to Common Mental Disorders.- Development and Realization of the Artificial Neural Network for Diagnostics of Stroke Type.- Special Session: Information-Theoretic Concepts in Biomedical Data Analysis.- A First Attempt at Constructing Genetic Programming Expressions for EEG Classification.- SOM-Based Wavelet Filtering for the Exploration of Medical Images.- Functional MRI Analysis by a Novel Spatiotemporal ICA Algorithm.- Early Detection of Alzheimer's Disease by Blind Source Separation, Time Frequency Representation, and Bump Modeling of EEG Signals.

70 citations

Book
01 Jan 2005
TL;DR: In this article, the authors present a model for real-time spiking in the brain using an ANN-based system for Sorting Spike Waveforms employing Refractory Periods.
Abstract: Modeling the Brain and Cognitive Functions.- Novelty Analysis in Dynamic Scene for Autonomous Mental Development.- The Computational Model to Simulate the Progress of Perceiving Patterns in Neuron Population.- Short Term Memory and Pattern Matching with Simple Echo State Networks.- Analytical Solution for Dynamic of Neuronal Populations.- Dynamics of Cortical Columns - Sensitive Decision Making.- Dynamics of Cortical Columns - Self-organization of Receptive Fields.- Optimal Information Transmission Through Cortico-Cortical Synapses.- Ensemble of SVMs for Improving Brain Computer Interface P300 Speller Performances.- Modelling Path Integrator Recalibration Using Hippocampal Place Cells.- Coding of Objects in Low-Level Visual Cortical Areas.- A Gradient Rule for the Plasticity of a Neuron's Intrinsic Excitability.- Building the Cerebellum in a Computer.- Special Session: The Development of Cognitive Powers in Embodied Systems.- Combining Attention and Value Maps.- Neural Network with Memory and Cognitive Functions.- Associative Learning in Hierarchical Self Organizing Learning Arrays.- A Review of Cognitive Processing in the Brain.- Spiking Neural Networks.- Neuronal Behavior with Sub-threshold Oscillations and Spiking/Bursting Activity Using a Piecewise Linear Two-Dimensional Map.- On-Line Real-Time Oriented Application for Neuronal Spike Sorting with Unsupervised Learning.- A Spiking Neural Sparse Distributed Memory Implementation for Learning and Predicting Temporal Sequences.- ANN-Based System for Sorting Spike Waveforms Employing Refractory Periods.- Emergence of Oriented Cell Assemblies Associated with Spike-Timing-Dependent Plasticity.- An Information Geometrical Analysis of Neural Spike Sequences.- Perceptual Binding by Coupled Oscillatory Neural Network.- Experimental Demonstration of Learning Properties of a New Supervised Learning Method for the Spiking Neural Networks.- Single-Unit Recordings Revisited: Activity in Recurrent Microcircuits.- A Hardware/Software Framework for Real-Time Spiking Systems.- Efficient Source Detection Using Integrate-and-Fire Neurons.- Associative Memory Models.- A Model for Hierarchical Associative Memories via Dynamically Coupled GBSB Neural Networks.- Balance Algorithm for Point-Feature Label Placement Problem.- Models of Self-correlation Type Complex-Valued Associative Memories and Their Dynamics.- Recovery of Performance in a Partially Connected Associative Memory Network Through Coding.- Optimal Triangle Stripifications as Minimum Energy States in Hopfield Nets.- Models of Biological Functions.- A Biophysical Model of Decision Making in an Antisaccade Task Through Variable Climbing Activity.- Can Dynamic Neural Filters Produce Pseudo-Random Sequences?.- Making Competition in Neural Fields Suitable for Computational Architectures.- Neural Network Computations with Negative Triggering Thresholds.- A Model for Delay Activity Without Recurrent Excitation.- Neuronal Coding Strategies for Two-Alternative Forced Choice Tasks.- Learning Features of Intermediate Complexity for the Recognition of Biological Motion.- Study of Nitric Oxide Effect in the Hebbian Learning: Towards a Diffusive Hebb's Law.- Special Session: Projects in the Area of NeuroIT.- Deterministic Modelling of Randomness with Recurrent Artificial Neural Networks.- Action Understanding and Imitation Learning in a Robot-Human Task.- Comparative Investigation into Classical and Spiking Neuron Implementations on FPGAs.- HYDRA: From Cellular Biology to Shape-Changing Artefacts.- The CIRCE Head: A Biomimetic Sonar System.- Tools for Address-Event-Representation Communication Systems and Debugging.- New Ears for a Robot Cricket.- Reinforcement Learning in MirrorBot.- Evolutionary and Other Biological Inspirations.- Varying the Population Size of Artificial Foraging Swarms on Time Varying Landscapes.- Lamarckian Clonal Selection Algorithm with Application.- Analysis for Characteristics of GA-Based Learning Method of Binary Neural Networks.- Immune Clonal Selection Wavelet Network Based Intrusion Detection.- Investigation of Evolving Populations of Adaptive Agents.- Enhancing Cellular Automata by an Embedded Generalized Multi-layer Perceptron.- Intelligent Pattern Generation for a Tactile Communication System.- Self-organizing Maps and Their Applications.- Self-organizing Map Initialization.- Principles of Employing a Self-organizing Map as a Frequent Itemset Miner.- Spatio-Temporal Organization Map: A Speech Recognition Application.- Residual Activity in the Neurons Allows SOMs to Learn Temporal Order.- Ordering of the RGB Space with a Growing Self-organizing Network. Application to Color Mathematical Morphology.- SOM of SOMs: Self-organizing Map Which Maps a Group of Self-organizing Maps.- The Topographic Product of Experts.- Self Organizing Map (SOM) Approach for Classification of Power Quality Events.- SOM-Based Method for Process State Monitoring and Optimization in Fluidized Bed Energy Plant.- A New Extension of Self-optimizing Neural Networks for Topology Optimization.- A Novel Technique for Data Visualization Based on SOM.- Statistical Properties of Lattices Affect Topographic Error in Self-organizing Maps.- Increasing Reliability of SOMs' Neighbourhood Structure with a Bootstrap Process.- Computer Vision.- Artificial Neural Receptive Field for Stereovision.- Pattern Detection Using Fast Normalized Neural Networks.- Neural Network Model for Extracting Optic Flow.- A Modular Single-Hidden-Layer Perceptron for Letter Recognition.- Fast Color-Based Object Recognition Independent of Position and Orientation.- Class-Specific Sparse Coding for Learning of Object Representations.- Neural Network Based Adult Image Classification.- Online Learning for Object Recognition with a Hierarchical Visual Cortex Model.- Extended Hopfield Network for Sequence Learning: Application to Gesture Recognition.- Accurate and Robust Image Superresolution by Neural Processing of Local Image Representations.- The Emergence of Visual Object Recognition.- Implicit Relevance Feedback from Eye Movements.- Image Segmentation by Complex-Valued Units.- Cellular Neural Networks for Color Image Segmentation.- Image Segmentation Using Watershed Transform and Feed-Back Pulse Coupled Neural Network.- Adaptive Switching Median Filter with Neural Network Impulse Detection Step.- Face Recognition and Detection.- Human Face Detection Using New High Speed Modular Neural Networks.- Face Detection Using Convolutional Neural Networks and Gabor Filters.- Face Identification Performance Using Facial Expressions as Perturbation.- Discriminative Common Images for Face Recognition.- Classification of Face Images for Gender, Age, Facial Expression, and Identity.- Sound and Speech Recognition.- Classifying Unprompted Speech by Retraining LSTM Nets.- Temporal Sound Processing by Cochlear Nucleus Octopus Neurons.- A SOM Based 2500 - Isolated - Farsi - Word Speech Recognizer.- Training HMM/ANN Hybrid Speech Recognizers by Probabilistic Sampling.- Chord Classifications by Artificial Neural Networks Revisited: Internal Representations of Circles of Major Thirds and Minor Thirds.- Bioinformatics.- Biclustering Gene Expression Data in the Presence of Noise.- Gene Extraction for Cancer Diagnosis by Support Vector Machines.- High-Throughput Multi-dimensional Scaling (HiT-MDS) for cDNA-Array Expression Data.- Biomedical Applications.- Comparing Neural Network Architecture for Pattern Recognize System on Artificial Noses.- Medical Document Categorization Using a Priori Knowledge.- A Neurofuzzy Methodology for the Diagnosis of Wireless-Capsule Endoscopic Images.- Neural Network Use for the Identification of Factors Related to Common Mental Disorders.- Development and Realization of the Artificial Neural Network for Diagnostics of Stroke Type.- Special Session: Information-Theoretic Concepts in Biomedical Data Analysis.- A First Attempt at Constructing Genetic Programming Expressions for EEG Classification.- SOM-Based Wavelet Filtering for the Exploration of Medical Images.- Functional MRI Analysis by a Novel Spatiotemporal ICA Algorithm.- Early Detection of Alzheimer's Disease by Blind Source Separation, Time Frequency Representation, and Bump Modeling of EEG Signals.

70 citations

Journal ArticleDOI
TL;DR: The results of the study suggest that the capacity of the recognition system to generalize to novel conditions in an efficient and flexible manner depends on the ongoing extraction of different families of informative features, acquired for different tasks and different object classes.

52 citations


"A comparison of features in parts-b..." refers background in this paper

  • ...The most related work in the direction of analytic features was done in [16], where Ullman introduced invariance over viewpoint in his fragments approach,...

    [...]

Book ChapterDOI
11 Sep 2005
TL;DR: Two new methods which extend the traditional sparse coding approach with supervised components are presented to increase the suitability of the learned features for classification tasks while keeping most of their general representation performance.
Abstract: We present two new methods which extend the traditional sparse coding approach with supervised components. The goal of these extensions is to increase the suitability of the learned features for classification tasks while keeping most of their general representation performance. A special visualization is introduced which allows to show the principal effect of the new methods. Furthermore some first experimental results are obtained for the COIL-100 database.

6 citations


"A comparison of features in parts-b..." refers methods in this paper

  • ...In contrary to PCA other methods produce so called parts-based features like the nonnegative matrix factorization (NMF) proposed in [8] or a similar scheme proposed in [9] yielding more class-specific features....

    [...]

Frequently Asked Questions (2)
Q1. What are the contributions mentioned in the paper "A comparison of features in parts-based object recognition hierarchies" ?

In this paper the authors present the results of a comparative investigation of different feature types with regard to their suitability for category discrimination. The authors discuss the conceptual differences, resulting performance and consequences for hierarchical models of visual recognition. 

Besides the normalization of rotation for SIFT, it would be interesting to investigate other reasons for the differences in performance in future work. Since both approaches have not been applied to scenarios with multiple categories, the authors hope that their comparative study provides further helpful inside into parts-based 3D object recognition.