PVN3D: A Deep Point-Wise 3D Keypoints Voting Network for 6DoF Pose Estimation

doi:10.1109/CVPR42600.2020.01165

Open AccessProceedings ArticleDOI

PVN3D: A Deep Point-Wise 3D Keypoints Voting Network for 6DoF Pose Estimation

Yisheng He, +5 more

- pp 11632-11641

Chats0

TLDR

PVN3D as mentioned in this paper proposes a deep Hough voting network to detect 3D keypoints of objects and then estimate the 6D pose parameters within a least-squares fitting manner.

Abstract:

In this work, we present a novel data-driven method for robust 6DoF object pose estimation from a single RGBD image. Unlike previous methods that directly regressing pose parameters, we tackle this challenging task with a keypoint-based approach. Specifically, we propose a deep Hough voting network to detect 3D keypoints of objects and then estimate the 6D pose parameters within a least-squares fitting manner. Our method is a natural extension of 2D-keypoint approaches that successfully work on RGB based 6DoF estimation. It allows us to fully utilize the geometric constraint of rigid objects with the extra depth information and is easy for a network to learn and optimize. Extensive experiments were conducted to demonstrate the effectiveness of 3D-keypoint detection in the 6D pose estimation task. Experimental results also show our method outperforms the state-of-the-art methods by large margins on several benchmarks. Code and video are available at https://github.com/ethnhe/PVN3D.git.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation

Yisheng He, +4 more

TL;DR: FFB6D as discussed by the authors proposes a bidirectional fusion network to combine appearance and geometry information for representation learning as well as output representation selection, which can leverage local and global complementary in-formation from the other one to obtain better representations.

...read moreread less

Journal ArticleDOI

Vision-based robotic grasping from object localization, object pose estimation to grasp estimation for parallel grippers: a review

Guoguang Du, +3 more

- 01 Mar 2021 -

Artificial Intelligence Review

TL;DR: Three key tasks during vision-based robotic grasping are concluded, which are object localization, object pose estimation and grasp estimation, which include 2D planar grasp methods and 6DoF grasp methods.

...read moreread less

Proceedings ArticleDOI

FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism

Wei Chen, +5 more

TL;DR: Li et al. as discussed by the authors proposed a fast shape-based network (FS-Net) with efficient category-level feature extraction for 6D pose estimation from a monocular RGB-D image.

...read moreread less

Book ChapterDOI

Cascade Graph Neural Networks for RGB-D Salient Object Detection

Ao Luo, +5 more

TL;DR: Cascade Graph Neural Networks (Cas-GNN) as mentioned in this paper is a unified framework which is capable of comprehensively distilling and reasoning the mutual benefits between these two data sources through a set of cascade graphs, to learn powerful representations for RGB-D salient object detection.

...read moreread less

Proceedings ArticleDOI

RGB Matters: Learning 7-DoF Grasp Poses on Monocular RGBD Images

Minghao Gou, +5 more

TL;DR: RGBD-Grasp as discussed by the authors decouples the grasp detection into two sub-tasks where RGB and depth information are processed separately, and achieves state-of-the-art results on GraspNet-1Billion dataset compared with several baselines.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Proceedings ArticleDOI

Object recognition from local scale-invariant features

David G. Lowe

TL;DR: Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

...read moreread less

Proceedings ArticleDOI

Mask R-CNN

Kaiming He, +3 more

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation, which extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition.

...read moreread less

Book ChapterDOI

SURF: speeded up robust features

Herbert Bay, +2 more

TL;DR: A novel scale- and rotation-invariant interest point detector and descriptor, coined SURF (Speeded Up Robust Features), which approximates or even outperforms previously proposed schemes with respect to repeatability, distinctiveness, and robustness, yet can be computed and compared much faster.

...read moreread less

Collapse

PVN3D: A Deep Point-Wise 3D Keypoints Voting Network for 6DoF Pose Estimation

Citations

FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation

Vision-based robotic grasping from object localization, object pose estimation to grasp estimation for parallel grippers: a review

FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism

Cascade Graph Neural Networks for RGB-D Salient Object Detection

RGB Matters: Learning 7-DoF Grasp Poses on Monocular RGBD Images

References

Deep Residual Learning for Image Recognition

ImageNet: A large-scale hierarchical image database

Object recognition from local scale-invariant features

Mask R-CNN

SURF: speeded up robust features

Related Papers (5)

PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes

DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion

Model based training, detection and pose estimation of texture-less 3d objects in heavily cluttered scenes

Real-Time Seamless Single Shot 6D Object Pose Prediction

Learning 6D Object Pose Estimation Using 3D Object Coordinates