Depth Image-Based Deep Learning of Grasp Planning for Textureless Planar-Faced Objects in Vision-Guided Robotic Bin-Picking.

doi:10.3390/S20030706

Open AccessJournal ArticleDOI

Depth Image-Based Deep Learning of Grasp Planning for Textureless Planar-Faced Objects in Vision-Guided Robotic Bin-Picking.

Ping Jiang, +6 more

- 28 Jan 2020 -

Sensors

- Vol. 20, Iss: 3, pp 706

TLDR

A surface feature descriptor is proposed to extract surface features (center position and normal) and refine the predicted grasp point position, removing the need for texture features for vision-guided robot control and sim-to-real modification for DCNN model training.

Abstract:

Bin-picking of small parcels and other textureless planar-faced objects is a common task at warehouses. A general color image-based vision-guided robot picking system requires feature extraction and goal image preparation of various objects. However, feature extraction for goal image matching is difficult for textureless objects. Further, prior preparation of huge numbers of goal images is impractical at a warehouse. In this paper, we propose a novel depth image-based vision-guided robot bin-picking system for textureless planar-faced objects. Our method uses a deep convolutional neural network (DCNN) model that is trained on 15,000 annotated depth images synthetically generated in a physics simulator to directly predict grasp points without object segmentation. Unlike previous studies that predicted grasp points for a robot suction hand with only one vacuum cup, our DCNN also predicts optimal grasp patterns for a hand with two vacuum cups (left cup on, right cup on, or both cups on). Further, we propose a surface feature descriptor to extract surface features (center position and normal) and refine the predicted grasp point position, removing the need for texture features for vision-guided robot control and sim-to-real modification for DCNN model training. Experimental results demonstrate the efficiency of our system, namely that a robot with 7 degrees of freedom can pick randomly posed textureless boxes in a cluttered environment with a 97.5% success rate at speeds exceeding 1000 pieces per hour.

Depth Image-Based Deep Learning of Grasp Planning for Textureless Planar-Faced Objects in Vision-Guided Robotic Bin-Picking.

Citations

Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector

Manipulation Planning for Object Re-Orientation Based on Semantic Segmentation Keypoint Detection

Object Identification for Task-Oriented Communication with Industrial Robots.

Smart Pack: Online Autonomous Object-Packing System Using RGB-D Sensor Data.

Robotics Dexterous Grasping: The Methods Based on Point Cloud and Deep Learning.

References

Distinctive Image Features from Scale-Invariant Keypoints

You Only Look Once: Unified, Real-Time Object Detection

SSD: Single Shot MultiBox Detector

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

SSD: Single Shot MultiBox Detector

Related Papers (5)

Using Near-Field Stereo Vision for Robotic Grasping in Cluttered Environments

Suction Grasp Region Prediction Using Self-supervised Learning for Object Picking in Dense Clutter

Grasping novel objects with depth segmentation

Stereo vision based automation for a bin-picking solution

GraspCNN: Real-Time Grasp Detection Using a New Oriented Diameter Circle Representation