scispace - formally typeset
Search or ask a question
Author

Yijia He

Bio: Yijia He is an academic researcher from Chinese Academy of Sciences. The author has contributed to research in topics: Inscribed angle & Line segment. The author has an hindex of 2, co-authored 2 publications receiving 19 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: It is shown that knowing the value of the inscribed angle between the two 3D rays poses additional constraints on the relative orientation, and using the latter enables the solution of the relative pose problem with as few as 3 correspondences across the two images.
Abstract: Corners are popular features for relative pose computation with 2D-2D point correspondences. Stable corners may be formed by two 3D rays sharing a common starting point. We call such elements ray-point-ray (RPR) structures. Besides a local invariant keypoint given by the lines’ intersection, their reprojection also defines a corner orientation and an inscribed angle in the image plane. The present paper investigates such RPR features, and aims at answering the fundamental question of what additional constraints can be formed from correspondences between RPR features in two views. In particular, we show that knowing the value of the inscribed angle between the two 3D rays poses additional constraints on the relative orientation. Using the latter enables the solution of the relative pose problem with as few as 3 correspondences across the two images. We provide a detailed analysis of all minimal cases distinguishing between 90-degree RPR-structures and structures with an arbitrary, known inscribed angle. We furthermore investigate the special cases of a known directional correspondence and planar motion, the latter being solvable with only a single RPR correspondence. We complete the exposition by outlining an image processing technique for robust RPR-feature extraction. Our results suggest high practicality in man-made environments, where 90-degree RPR-structures naturally occur.

25 citations

Journal ArticleDOI
TL;DR: The experimental results demonstrate that the proposed SLAM method exhibits more accurate localization and reconstruction than state-of-the-art line-based SLAM systems in line-rich environments.
Abstract: In this paper, we propose a stereo simultaneous localization and mapping (SLAM) method based on line segments. For the front-end module of SLAM, we designed a novel method based on the coplanar junction detection, description, and matching. Then the junctions along with their multi-scale rotated BRIEF descriptors are used in other SLAM modules, including line tracking, mapping, and loop closure. The line extraction and matching thread runs at 20 ~ 40Hz for stereo image sequences on a laptop, making it a practical front-end for line-based SLAM system. For the back-end module, a cost function is designed to minimize both of the reprojection error of line segments and alignment error of the vanishing points. The experimental results demonstrate that the proposed method exhibits more accurate localization and reconstruction than state-of-the-art line-based SLAM systems in line-rich environments.

10 citations


Cited by
More filters
01 Jan 2013
TL;DR: This paper revisits the classical perspective-n-point (PnP) problem, and proposes the first non-iterative O(n) solution that is fast, generally applicable and globally optimal.
Abstract: In this paper, we revisit the classical perspective-n-point (PnP) problem, and propose the first non-iterative O(n) solution that is fast, generally applicable and globally optimal. Our basic idea is to formulate the PnP problem into a functional minimization problem and retrieve all its stationary points by using the Gr¨obner basis technique. The novelty lies in a non-unit quaternion representation to parameterize the rotation and a simple but elegant formulation of the PnP problem into an unconstrained optimization problem. Interestingly, the polynomial system arising from its first-order optimality condition assumes two-fold symmetry, a nice property that can be utilized to improve speed and numerical stability of a Gr¨obner basis solver. Experiment results have demonstrated that, in terms of accuracy, our proposed solution is definitely better than the state-ofthe- art O(n) methods, and even comparable with the reprojection error minimization method. (Less)

55 citations

Proceedings ArticleDOI
14 Jun 2020
TL;DR: In this paper, the affine transformation between feature points is exploited to recover the relative camera pose under the planar motion assumption or with knowledge of a vertical direction, and a single affine correspondence is sufficient to recover camera pose.
Abstract: In this paper we present four cases of minimal solutions for two-view relative pose estimation by exploiting the affine transformation between feature points and we demonstrate efficient solvers for these cases. It is shown, that under the planar motion assumption or with knowledge of a vertical direction, a single affine correspondence is sufficient to recover the relative camera pose. The four cases considered are two-view planar relative motion for calibrated cameras as a closed-form and a least-squares solution, a closed-form solution for unknown focal length and the case of a known vertical direction. These algorithms can be used efficiently for outlier detection within a RANSAC loop and for initial motion estimation. All the methods are evaluated on both synthetic data and real-world datasets from the KITTI benchmark. The experimental results demonstrate that our methods outperform comparable state-of-the-art methods in accuracy with the benefit of a reduced number of needed RANSAC iterations.

36 citations

Proceedings ArticleDOI
01 Oct 2019
TL;DR: In this paper, a complete classification of all minimal problems for generic arrangements of points and lines completely observed by calibrated perspective cameras is presented, and their algebraic degrees, i.e. the number of solutions, which measure their intrinsic difficulty.
Abstract: We present a complete classification of all minimal problems for generic arrangements of points and lines completely observed by calibrated perspective cameras. We show that there are only 30 minimal problems in total, no problems exist for more than 6 cameras, for more than 5 points, and for more than 6 lines. We present a sequence of tests for detecting minimality starting with counting degrees of freedom and ending with full symbolic and numeric verification of representative examples. For all minimal problems discovered, we present their algebraic degrees, i.e. the number of solutions, which measure their intrinsic difficulty. It shows how exactly the difficulty of problems grows with the number of views. Importantly, several new mini- mal problems have small degrees that might be practical in image matching and 3D reconstruction.

23 citations

Journal ArticleDOI
TL;DR: A knowledge memory embedding model with mutual modulation for visual reasoning that learns not only knowledge-based embeddings derived from key–value memory network to make the full and joint of textual and visual information, but also exploits the prior knowledge to improve the performance withknowledge-based representation learning for applying other general reasoning tasks.

20 citations

Proceedings ArticleDOI
14 Jun 2020
TL;DR: This work shows in real experiment that (i) SIFT features provide good enough point-and-line correspondences for three-view reconstruction and (ii) that it can solve difficult cases with too few or too noisy tentative matches where the state of the art structure from motion initialization fails.
Abstract: We present a method for solving two minimal problems for relative camera pose estimation from three views, which are based on three view correspondences of (i) three points and one line and (ii) three points and two lines through two of the points. These problems are too difficult to be efficiently solved by the state of the art Grobner basis methods. Our method is based on a new efficient homotopy continuation (HC) solver, which dramatically speeds up previous HC solving by specializing HC methods to generic cases of our problems. We show in simulated experiments that our solvers are numerically robust and stable under image noise. We show in real experiment that (i) SIFT features provide good enough point-and-line correspondences for three-view reconstruction and (ii) that we can solve difficult cases with too few or too noisy tentative matches where the state of the art structure from motion initialization fails.

17 citations