scispace - formally typeset
Proceedings ArticleDOI

Real-time Instance Detection with Fast Incremental Learning

TLDR
In this article, an ensemble of efficient single-class instance detectors capable of fast and incremental adaptation to new object sets is proposed, which can be obtained within less than 40 minutes on a consumer GPU while only a small percentage of the existing detection models need to be updated.
Abstract
Object instance detection is a highly relevant task to several robotic applications such as automated order picking, or household and hospital assistance robots. In these applications, a holistic scene labeling is often not required whereas it is sufficient to find a certain object type of interest, e.g. for picking it up. At the same time, large and continuously changing object sets are characteristic in such applications, requiring efficient model update capabilities from the object detector. Today’s monolithic multi-class detectors do not fulfill this criterion for fast and flexible model updates.This paper introduces InstanceNet, an ensemble of efficient single-class instance detectors capable of fast and incremental adaptation to new object sets. Due to a dynamic sampling-based training strategy, accurate detection models for new objects can be obtained within less than 40 minutes on a consumer GPU while only a small percentage of the existing detection models needs to be updated in a very efficient manner. The new detector has been thoroughly evaluated on the basis of a novel dataset of 100 grocery store objects.

read more

Citations
More filters
Proceedings ArticleDOI

Fast Hierarchical Learning for Few-Shot Object Detection

TL;DR: This work poses few-shot detection as a hierarchical learning problem, where the novel classes are treated as the child classes of existing base classes and the background class, leading to lower training times compared to stochastic gradient descent.
Journal ArticleDOI

Data-Driven Robotic Tactile Grasping for Hyper-Personalization Line Pick-and-Place

TL;DR: In this article , the Rochu two-fingered soft gripper with customized force-sensing resistor (FSR) force sensors mounted on the fingertips was used to improve the robotic grasping capability for novel objects.
Proceedings ArticleDOI

Fast Hierarchical Learning for Few-Shot Object Detection

TL;DR: In this paper , the authors pose few-shot detection as a hierarchical learning problem, where the novel classes are treated as the child classes of existing base classes and the background class.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.
Proceedings ArticleDOI

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.
Related Papers (5)