Top 10 papers published by Emmanuel Dellandréa from École centrale de Lyon in 2018

Proceedings Article•DOI•

Jacquard: A Large Scale Dataset for Robotic Grasp Detection

[...]

Amaury Depierre, Emmanuel Dellandréa¹, Liming Chen¹•Institutions (1)

29 Mar 2018

TL;DR: The Jacquard dataset as mentioned in this paper is a large-scale synthetic dataset with ground truth, which contains both RGB-D images and annotations of successful grasping positions based on grasp attempts performed in a simulated environment.

...read moreread less

Abstract: Grasping skill is a major ability that a wide number of real-life applications require for robotisation. State-of-the-art robotic grasping methods perform prediction of object grasp locations based on deep neural networks. However, such networks require huge amount of labeled data for training making this approach often impracticable in robotics. In this paper, we propose a method to generate a large scale synthetic dataset with ground truth, which we refer to as the Jacquard grasping dataset. Jacquard is built on a subset of ShapeNet, a large CAD models dataset, and contains both RGB-D images and annotations of successful grasping positions based on grasp attempts performed in a simulated environment. We carried out experiments using an off-the-shelf CNN, with three different evaluation metrics, including real grasping robot trials. The results show that Jacquard enables much better generalization skills than a human labeled dataset thanks to its diversity of objects and grasping positions. For the purpose of reproducible research in robotics, we are releasing along with the Jacquard dataset a web interface for researchers to evaluate the successfulness of their grasping position detections using our dataset.

...read moreread less

203 citations

Journal Article•DOI•

Discriminative Transfer Learning Using Similarities and Dissimilarities

[...]

Ying Lu¹, Liming Chen¹, Alexandre Saidi¹, Emmanuel Dellandréa¹, Yunhong Wang² - Show less +1 more•Institutions (2)

École centrale de Lyon¹, Beihang University²

17 Jan 2018-IEEE Transactions on Neural Networks

TL;DR: A new discriminative TL (DTL) method is proposed, combining a series of hypotheses made by both the model learned with target training samples and the additional models learned with source category samples to improve classifier performance.

...read moreread less

Abstract: Transfer learning (TL) aims at solving the problem of learning an effective classification model for a target category, which has few training samples, by leveraging knowledge from source categories with far more training data. We propose a new discriminative TL (DTL) method, combining a series of hypotheses made by both the model learned with target training samples and the additional models learned with source category samples. Specifically, we use the sparse reconstruction residual as a basic discriminant and enhance its discriminative power by comparing two residuals from a positive and a negative dictionary. On this basis, we make use of similarities and dissimilarities by choosing both positively correlated and negatively correlated source categories to form additional dictionaries. A new Wilcoxon–Mann–Whitney statistic-based cost function is proposed to choose the additional dictionaries with unbalanced training data. Also, two parallel boosting processes are applied to both the positive and negative data distributions to further improve classifier performance. On two different image classification databases, the proposed DTL consistently outperforms other state-of-the-art TL methods while at the same time maintaining very efficient runtime.

...read moreread less

76 citations

Journal Article•DOI•

Visual and Semantic Knowledge Transfer for Large Scale Semi-Supervised Object Detection

[...]

Yuxing Tang¹, Josiah Wang², Xiaofang Wang³, Boyang Gao⁴, Emmanuel Dellandréa³, Robert Gaizauskas², Liming Chen³ - Show less +3 more•Institutions (4)

National Institutes of Health¹, University of Sheffield², École centrale de Lyon³, Istituto Italiano di Tecnologia⁴

01 Dec 2018-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Strong evidence is found that visual similarity and semantic relatedness are complementary for the task, and when combined notably improve detection, achieving state-of-the-art detection performance in a semi-supervised setting.

...read moreread less

Abstract: Deep CNN-based object detection systems have achieved remarkable success on several large-scale object detection benchmarks. However, training such detectors requires a large number of labeled bounding boxes, which are more difficult to obtain than image-level annotations. Previous work addresses this issue by transforming image-level classifiers into object detectors. This is done by modeling the differences between the two on categories with both image-level and bounding box annotations, and transferring this information to convert classifiers to detectors for categories without bounding box annotations. We improve this previous work by incorporating knowledge about object similarities from visual and semantic domains during the transfer process. The intuition behind our proposed method is that visually and semantically similar categories should exhibit more common transferable properties than dissimilar categories, e.g. a better detector would result by transforming the differences between a dog classifier and a dog detector onto the cat class, than would by transforming from the violin class. Experimental results on the challenging ILSVRC2013 detection dataset demonstrate that each of our proposed object similarity based knowledge transfer methods outperforms the baseline methods. We found strong evidence that visual similarity and semantic relatedness are complementary for the task, and when combined notably improve detection, achieving state-of-the-art detection performance in a semi-supervised setting.

...read moreread less

66 citations

Journal Article•DOI•

Affective Video Content Analysis: A Multidisciplinary Insight

[...]

Yoann Baveye¹, Christel Chamaret, Emmanuel Dellandréa², Liming Chen²•Institutions (2)

University of Nantes¹, University of Lyon²

01 Oct 2018-IEEE Transactions on Affective Computing

TL;DR: A multidisciplinary state-of-the-art for affective movie content analysis is given, in order to promote and encourage exchanges between researchers from a very wide range of fields.

...read moreread less

Abstract: In our present society, the cinema has become one of the major forms of entertainment providing unlimited contexts of emotion elicitation for the emotional needs of human beings. Since emotions are universal and shape all aspects of our interpersonal and intellectual experience, they have proved to be a highly multidisciplinary research field, ranging from psychology, sociology, neuroscience, etc., to computer science. However, affective multimedia content analysis work from the computer science community benefits but little from the progress achieved in other research fields. In this paper, a multidisciplinary state-of-the-art for affective movie content analysis is given, in order to promote and encourage exchanges between researchers from a very wide range of fields. In contrast to other state-of-the-art papers on affective video content analysis, this work confronts the ideas and models of psychology, sociology, neuroscience, and computer science. The concepts of aesthetic emotions and emotion induction, as well as the different representations of emotions are introduced, based on psychological and sociological theories. Previous global and continuous affective video content analysis work, including video emotion recognition and violence detection, are also presented in order to point out the limitations of affective video content analysis work.

...read moreread less

65 citations

The MediaEval 2018 Emotional Impact of Movies Task.

[...]

Emmanuel Dellandréa, Martijn Huigsloot, Liming Chen, Yoann Baveye, Zhongzhe Xiao, Mats Sjöberg - Show less +2 more

01 Jan 2018

52 citations

Book Chapter•DOI•

Object Segmentation in Depth Maps with One User Click and a Synthetically Trained Fully Convolutional Network

[...]

Matthieu Grard, Romain Brégier, Florian Sella, Emmanuel Dellandréa, Liming Chen - Show less +1 more

03 Jan 2018

TL;DR: In this paper, a fully convolutional object contour detector is used for instance segmentation in top views of piles of bulk objects given a pixel location, provided interactively by a human operator.

...read moreread less

Abstract: With more and more household objects built on planned obsolescence and consumed by a fast-growing population, hazardous waste recycling has become a critical challenge. Given the large variability of household waste, current recycling platforms mostly rely on human operators to analyze the scene, typically composed of many object instances piled up in bulk. Helping them by robotizing the unitary extraction is a key challenge to speed up this tedious process. Whereas supervised deep learning has proven very efficient for such object-level scene understanding, e.g., generic object detection and segmentation in everyday scenes, it however requires large sets of per-pixel labeled images, that are hardly available for numerous application contexts, including industrial robotics. We thus propose a step towards a practical interactive application for generating an object-oriented robotic grasp, requiring as inputs only one depth map of the scene and one user click on the next object to extract. More precisely, we address in this paper the middle issue of object seg-mentation in top views of piles of bulk objects given a pixel location, namely seed, provided interactively by a human operator. We propose a twofold framework for generating edge-driven instance segments. First, we repurpose a state-of-the-art fully convolutional object contour detector for seed-based instance segmentation by introducing the notion of edge-mask duality with a novel patch-free and contour-oriented loss function. Second, we train one model using only synthetic scenes, instead of manually labeled training data. Our experimental results show that considering edge-mask duality for training an encoder-decoder network, as we suggest, outperforms a state-of-the-art patch-based network in the present application context.

...read moreread less

13 citations

Posted Content•

Jacquard: A Large Scale Dataset for Robotic Grasp Detection

[...]

Amaury Depierre, Emmanuel Dellandréa¹, Liming Chen¹•Institutions (1)

École centrale de Lyon¹

30 Mar 2018-arXiv: Robotics

TL;DR: The results show that Jacquard enables much better generalization skills than a human labeled dataset thanks to its diversity of objects and grasping positions.

...read moreread less

Abstract: Grasping skill is a major ability that a wide number of real-life applications require for robotisation. State-of-the-art robotic grasping methods perform prediction of object grasp locations based on deep neural networks. However, such networks require huge amount of labeled data for training making this approach often impracticable in robotics. In this paper, we propose a method to generate a large scale synthetic dataset with ground truth, which we refer to as the Jacquard grasping dataset. Jacquard is built on a subset of ShapeNet, a large CAD models dataset, and contains both RGB-D images and annotations of successful grasping positions based on grasp attempts performed in a simulated environment. We carried out experiments using an off-the-shelf CNN, with three different evaluation metrics, including real grasping robot trials. The results show that Jacquard enables much better generalization skills than a human labeled dataset thanks to its diversity of objects and grasping positions. For the purpose of reproducible research in robotics, we are releasing along with the Jacquard dataset a web interface for researchers to evaluate the successfulness of their grasping position detections using our dataset.

...read moreread less

8 citations

Proceedings Article•DOI•

Developmental Bayesian Optimization of Black-Box with Visual Similarity-Based Transfer Learning

[...]

Maxime Petit¹, Amaury Depierre¹, Xiaofang Wang¹, Emmanuel Dellandréa¹, Liming Chen¹ - Show less +1 more•Institutions (1)

Centre national de la recherche scientifique¹

16 Sep 2018

TL;DR: A developmental framework based on a long-term memory and reasoning mechanisms (Vision Similarity and Bayesian Optimisation) allows a robot to optimize autonomously hyper-parameters that need to be tuned from any action and/or vision module, treated as a black-box.

...read moreread less

Abstract: We present a developmental framework based on a long-term memory and reasoning mechanisms (Vision Similarity and Bayesian Optimisation). This architecture allows a robot to optimize autonomously hyper-parameters that need to be tuned from any action and/or vision module, treated as a black-box. The learning can take advantage of past experiences (stored in the episodic and procedural memories) in order to warm-start the exploration using a set of hyper-parameters previously optimized from objects similar to the new unknown one (stored in a semantic memory). As example, the system has been used to optimized 9 continuous hyper-parameters of a professional software (Kamido) both in simulation and with a real robot (industrial robotic arm Fanuc) with a total of 13 different objects. The robot is able to find a good object-specific optimization in 68 (simulation) or 40 (real) trials. In simulation, we demonstrate the benefit of the transfer learning based on visual similarity, as opposed to an amnesic learning (i.e. learning from scratch all the time). Moreover, with the real robot, we show that the method consistently outperforms the manual optimization from an expert with less than 2 hours of training time to achieve more than 88% of success.

...read moreread less

5 citations

Posted Content•

Developmental Bayesian Optimization of Black-Box with Visual Similarity-Based Transfer Learning.

[...]

Maxime Petit¹, Amaury Depierre¹, Xiaofang Wang¹, Emmanuel Dellandréa¹, Liming Chen¹ - Show less +1 more•Institutions (1)

Centre national de la recherche scientifique¹

26 Sep 2018-arXiv: Robotics

TL;DR: In this paper, a developmental framework based on a long-term memory and reasoning mechanisms (Vision Similarity and Bayesian Optimisation) is presented, which allows a robot to optimize autonomously hyperparameters that need to be tuned from any action and/or vision module, treated as a black-box.

...read moreread less

Abstract: We present a developmental framework based on a long-term memory and reasoning mechanisms (Vision Similarity and Bayesian Optimisation). This architecture allows a robot to optimize autonomously hyper-parameters that need to be tuned from any action and/or vision module, treated as a black-box. The learning can take advantage of past experiences (stored in the episodic and procedural memories) in order to warm-start the exploration using a set of hyper-parameters previously optimized from objects similar to the new unknown one (stored in a semantic memory). As example, the system has been used to optimized 9 continuous hyper-parameters of a professional software (Kamido) both in simulation and with a real robot (industrial robotic arm Fanuc) with a total of 13 different objects. The robot is able to find a good object-specific optimization in 68 (simulation) or 40 (real) trials. In simulation, we demonstrate the benefit of the transfer learning based on visual similarity, as opposed to an amnesic learning (i.e. learning from scratch all the time). Moreover, with the real robot, we show that the method consistently outperforms the manual optimization from an expert with less than 2 hours of training time to achieve more than 88% of success.

...read moreread less

1 citations

Posted Content•

Object segmentation in depth maps with one user click and a synthetically trained fully convolutional network

[...]

Matthieu Grard, Romain Brégier, Florian Sella, Emmanuel Dellandréa, Liming Chen - Show less +1 more

04 Jan 2018-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a fully convolutional object contour detector is used for instance segmentation in top views of piles of bulk objects given a pixel location, provided interactively by a human operator.

...read moreread less

Abstract: With more and more household objects built on planned obsolescence and consumed by a fast-growing population, hazardous waste recycling has become a critical challenge. Given the large variability of household waste, current recycling platforms mostly rely on human operators to analyze the scene, typically composed of many object instances piled up in bulk. Helping them by robotizing the unitary extraction is a key challenge to speed up this tedious process. Whereas supervised deep learning has proven very efficient for such object-level scene understanding, e.g., generic object detection and segmentation in everyday scenes, it however requires large sets of per-pixel labeled images, that are hardly available for numerous application contexts, including industrial robotics. We thus propose a step towards a practical interactive application for generating an object-oriented robotic grasp, requiring as inputs only one depth map of the scene and one user click on the next object to extract. More precisely, we address in this paper the middle issue of object seg-mentation in top views of piles of bulk objects given a pixel location, namely seed, provided interactively by a human operator. We propose a twofold framework for generating edge-driven instance segments. First, we repurpose a state-of-the-art fully convolutional object contour detector for seed-based instance segmentation by introducing the notion of edge-mask duality with a novel patch-free and contour-oriented loss function. Second, we train one model using only synthetic scenes, instead of manually labeled training data. Our experimental results show that considering edge-mask duality for training an encoder-decoder network, as we suggest, outperforms a state-of-the-art patch-based network in the present application context.

...read moreread less

Showing papers by "Emmanuel Dellandréa published in 2018"