Author

Romain Thibaux

Bio: Romain Thibaux is an academic researcher from Willow Garage. The author has contributed to research in topics: Stereo camera & Stereo cameras. The author has an hindex of 1, co-authored 1 publications receiving 777 citations.

Topics: Stereo camera, Stereo cameras, Pose, Feature extraction, Histogram ...read more

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Fast 3D recognition and pose using the Viewpoint Feature Histogram

[...]

Radu Bogdan Rusu¹, Gary Bradski¹, Romain Thibaux¹, John Hsu¹•Institutions (1)

Willow Garage¹

03 Dec 2010

TL;DR: The Viewpoint Feature Histogram (VFH) is presented, a descriptor for 3D point cloud data that encodes geometry and viewpoint that is robust to large surface noise and missing depth information in order to work reliably on stereo data.

...read moreread less

Abstract: We present the Viewpoint Feature Histogram (VFH), a descriptor for 3D point cloud data that encodes geometry and viewpoint. We demonstrate experimentally on a set of 60 objects captured with stereo cameras that VFH can be used as a distinctive signature, allowing simultaneous recognition of the object and its pose. The pose is accurate enough for robot manipulation, and the computational cost is low enough for real time operation. VFH was designed to be robust to large surface noise and missing depth information in order to work reliably on stereo data.

...read moreread less

874 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning for detecting robotic grasps

[...]

Ian Lenz¹, Honglak Lee², Ashutosh Saxena¹•Institutions (2)

Cornell University¹, University of Michigan²

01 Apr 2015-The International Journal of Robotics Research

TL;DR: This work presents a two-step cascaded system with two deep networks, where the top detections from the first are re-evaluated by the second, and shows that this method improves performance on an RGBD robotic grasping dataset, and can be used to successfully execute grasps on two different robotic platforms.

...read moreread less

Abstract: We consider the problem of detecting robotic grasps in an RGB-D view of a scene containing objects. In this work, we apply a deep learning approach to solve this problem, which avoids time-consuming hand-design of features. This presents two main challenges. First, we need to evaluate a huge number of candidate grasps. In order to make detection fast and robust, we present a two-step cascaded system with two deep networks, where the top detections from the first are re-evaluated by the second. The first network has fewer features, is faster to run, and can effectively prune out unlikely candidate grasps. The second, with more features, is slower but has to run only on the top few detections. Second, we need to handle multimodal inputs effectively, for which we present a method that applies structured regularization on the weights based on multimodal group regularization. We show that our method improves performance on an RGBD robotic grasping dataset, and can be used to successfully execute grasps on two different robotic platforms.

...read moreread less

1,144 citations

Proceedings Article•DOI•

LeGO-LOAM: Lightweight and Ground-Optimized Lidar Odometry and Mapping on Variable Terrain

[...]

Tixiao Shan¹, Brendan Englot¹•Institutions (1)

Stevens Institute of Technology¹

01 Oct 2018

TL;DR: A lightweight and ground-optimized lidar odometry and mapping method, LeGO-LOAM, for realtime six degree-of-freedom pose estimation with ground vehicles and integrated into a SLAM framework to eliminate the pose estimation error caused by drift is integrated.

...read moreread less

Abstract: We propose a lightweight and ground-optimized lidar odometry and mapping method, LeGO-LOAM, for realtime six degree-of-freedom pose estimation with ground vehicles. LeGO-LOAM is lightweight, as it can achieve realtime pose estimation on a low-power embedded system. LeGO-LOAM is ground-optimized, as it leverages the presence of a ground plane in its segmentation and optimization steps. We first apply point cloud segmentation to filter out noise, and feature extraction to obtain distinctive planar and edge features. A two-step Levenberg-Marquardt optimization method then uses the planar and edge features to solve different components of the six degree-of-freedom transformation across consecutive scans. We compare the performance of LeGO-LOAM with a state-of-the-art method, LOAM, using datasets gathered from variable-terrain environments with ground vehicles, and show that LeGO-LOAM achieves similar or better accuracy with reduced computational expense. We also integrate LeGO-LOAM into a SLAM framework to eliminate the pose estimation error caused by drift, which is tested using the KITTI dataset.

...read moreread less

960 citations

Journal Article•DOI•

Data-Driven Grasp Synthesis—A Survey

[...]

Jeannette Bohg, Antonio Morales, Tamim Asfour¹, Danica Kragic•Institutions (1)

Karlsruhe Institute of Technology¹

01 Apr 2014-IEEE Transactions on Robotics

TL;DR: A review of the work on data-driven grasp synthesis and the methodologies for sampling and ranking candidate grasps and an overview of the different methodologies are provided, which draw a parallel to the classical approaches that rely on analytic formulations.

...read moreread less

Abstract: We review the work on data-driven grasp synthesis and the methodologies for sampling and ranking candidate grasps. We divide the approaches into three groups based on whether they synthesize grasps for known, familiar, or unknown objects. This structure allows us to identify common object representations and perceptual processes that facilitate the employed data-driven grasp synthesis technique. In the case of known objects, we concentrate on the approaches that are based on object recognition and pose estimation. In the case of familiar objects, the techniques use some form of a similarity matching to a set of previously encountered objects. Finally, for the approaches dealing with unknown objects, the core part is the extraction of specific features that are indicative of good grasps. Our survey provides an overview of the different methodologies and discusses open problems in the area of robot grasping. We also draw a parallel to the classical approaches that rely on analytic formulations.

...read moreread less

859 citations

Proceedings Article•

Deep Learning for Detecting Robotic Grasps

[...]

Ian Lenz¹, Honglak Lee², Ashutosh Saxena¹•Institutions (2)

Cornell University¹, University of Michigan²

01 Jan 2013

TL;DR: In this paper, a two-step cascaded system with two deep networks is proposed to detect robotic grasps in an RGB-D view of a scene containing objects, where the top detections from the first are re-evaluated by the second.

...read moreread less

824 citations

Journal Article•DOI•

Learning human activities and object affordances from RGB-D videos

[...]

Hema Swetha Koppula¹, Rudhir Gupta¹, Ashutosh Saxena¹•Institutions (1)

Cornell University¹

01 Jul 2013-The International Journal of Robotics Research

TL;DR: In this paper, a structural support vector machine (SSVM) was used to extract a descriptive labeling of the sequence of sub-activities being performed by a human, and more importantly, their interactions with the objects in the form of associated affordances.

...read moreread less

Abstract: Understanding human activities and object affordances are two very important skills, especially for personal robots which operate in human environments. In this work, we consider the problem of extracting a descriptive labeling of the sequence of sub-activities being performed by a human, and more importantly, of their interactions with the objects in the form of associated affordances. Given a RGB-D video, we jointly model the human activities and object affordances as a Markov random field where the nodes represent objects and sub-activities, and the edges represent the relationships between object affordances, their relations with sub-activities, and their evolution over time. We formulate the learning problem using a structural support vector machine (SSVM) approach, where labelings over various alternate temporal segmentations are considered as latent variables. We tested our method on a challenging dataset comprising 120 activity videos collected from 4 subjects, and obtained an accuracy of 79.4% for affordance, 63.4% for sub-activity and 75.0% for high-level activity labeling. We then demonstrate the use of such descriptive labeling in performing assistive tasks by a PR2 robot.

...read moreread less

666 citations

Collapse