Category-Level Articulated Object Pose Estimation

doi:10.1109/CVPR42600.2020.00376

Open AccessProceedings ArticleDOI

Category-Level Articulated Object Pose Estimation

Xiaolong Li, +5 more

- pp 3706-3715

Chats0

TLDR

A deep network based on PointNet++ is developed that predicts ANCSH from a single depth point cloud, including part segmentation, normalized coordinates, and joint parameters in the canonical object space, and leveraging the canonicalized joints are demonstrated.

Abstract:

This paper addresses the task of category-level pose estimation for articulated objects from a single depth image. We present a novel category-level approach that correctly accommodates object instances previously unseen during training. We introduce Articulation-aware Normalized Coordinate Space Hierarchy (ANCSH) – a canonical representation for different articulated objects in a given category. As the key to achieve intra-category generalization, the representation constructs a canonical object space as well as a set of canonical part spaces. The canonical object space normalizes the object orientation, scales and articulations (e.g. joint parameters and states) while each canonical part space further normalizes its part pose and scale. We develop a deep network based on PointNet++ that predicts ANCSH from a single depth point cloud, including part segmentation, normalized coordinates, and joint parameters in the canonical object space. By leveraging the canonicalized joints, we demonstrate: 1) improved performance in part pose and scale estimations using the induced kinematic constraints from joints; 2) high accuracy for joint parameter estimation in camera space.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

Shaowei Liu, +4 more

TL;DR: In this article, a joint learning framework for estimating 3D hand and object pose from a single image is proposed, where the spatial-temporal consistency in large-scale hand-object videos is used as a constraint for generating pseudo labels in semi-supervised learning.

...read moreread less

Posted Content

CaSPR: Learning Canonical Spatiotemporal Point Cloud Representations

Davis Rempe, +5 more

- 06 Aug 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, the authors propose a method to learn object-centric Canonical Spatiotemporal Point Cloud Representations (CaSPR) to enable information aggregation over time and the interrogation of object state at any spatio-temporal neighborhood.

...read moreread less

Posted Content

ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory

Ajinkya Jain, +3 more

- 24 Aug 2020 -

arXiv: Robotics

TL;DR: Results demonstrate that ScrewNet can successfully estimate the articulation models and their parameters for novel objects across articulation model categories with better on average accuracy than the prior state-of-the-art method.

...read moreread less

Proceedings ArticleDOI

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Jiahui Huang, +6 more

TL;DR: MultiBodySync as discussed by the authors proposes an end-to-end trainable multi-body motion segmentation and rigid registration framework for multiple input 3D point clouds, which incorporates spectral synchronization into an iterative deep declarative network.

...read moreread less

Posted Content

Vector Neurons: A General Framework for SO(3)-Equivariant Networks

Congyue Deng, +5 more

- 25 Apr 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, the authors propose a vector neuron representation for the SO(3)-equivariance to the rotation group of pointclouds, which can be extended from 1D scalars to 3D vectors.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Martin A. Fischler, +1 more

- 01 Jun 1981 -

Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

Journal ArticleDOI

A method for registration of 3-D shapes

Paul J. Besl, +1 more

- 01 Feb 1992 -

IEEE Transactions on Pattern Analysis an...

TL;DR: In this paper, the authors describe a general-purpose representation-independent method for the accurate and computationally efficient registration of 3D shapes including free-form curves and surfaces, based on the iterative closest point (ICP) algorithm, which requires only a procedure to find the closest point on a geometric entity to a given point.

...read moreread less

Posted Content

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Charles R. Qi, +3 more

- 07 Jun 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A hierarchical neural network that applies PointNet recursively on a nested partitioning of the input point set and proposes novel set learning layers to adaptively combine features from multiple scales to learn deep point set features efficiently and robustly.

...read moreread less

Proceedings Article

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Charles R. Qi, +3 more

TL;DR: PointNet++ as discussed by the authors applies PointNet recursively on a nested partitioning of the input point set to learn local features with increasing contextual scales, and proposes novel set learning layers to adaptively combine features from multiple scales.

...read moreread less

Journal ArticleDOI

Least-squares estimation of transformation parameters between two point patterns

S. Umeyama

- 01 Apr 1991 -

IEEE Transactions on Pattern Analysis an...

TL;DR: The proposed theorem is a strict solution of the problem, and it always gives the correct transformation parameters even when the data is corrupted.

...read moreread less

Collapse

Related Papers (5)

ShapeNet: An Information-Rich 3D Model Repository

Angel X. Chang, +12 more

- 09 Dec 2015 -

arXiv: Graphics

Category-Level Articulated Object Pose Estimation

Citations

Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

CaSPR: Learning Canonical Spatiotemporal Point Cloud Representations

ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Vector Neurons: A General Framework for SO(3)-Equivariant Networks

References

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

A method for registration of 3-D shapes

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Least-squares estimation of transformation parameters between two point patterns

Related Papers (5)

ShapeNet: An Information-Rich 3D Model Repository

Deep Residual Learning for Image Recognition

Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation

Manipulating articulated objects with interactive perception

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation