Showing papers on "Orientation (computer vision) published in 2022"

PDF

Open Access

Journal Article•DOI•

Multi-attention augmented network for single image super-resolution

[...]

Rui Chen¹, james, Zhang Heng¹, Liu Jixin¹•Institutions (1)

01 Feb 2022-Pattern Recognition

TL;DR: A multi-attention augmented network, which mainly consists of content-, orientation- and position-aware modules, is proposed, which develops an attention augmented U-net structure to form the content-aware module in order to learn and combine multi-scale informative features within a large receptive field.

...read moreread less

24 citations

Journal Article•DOI•

A novel vision-based calibration framework for industrial robotic manipulators

[...]

Hamid Majidi Balanji¹, Ali Emre Turgut¹, Lutfi Taner Tunc²•Institutions (2)

Middle East Technical University¹, Sabancı University²

01 Feb 2022-Robotics and Computer-integrated Manufacturing

TL;DR: A novel calibration framework is proposed based on a single camera and computer vision techniques using ArUco markers used for kinematic modeling of the robot to avoid the singularity and shows that the method is usable in real world scenarios.

...read moreread less

Abstract: With the increasing involvement of industrial robots in manufacturing processes, the demand for high quality robots has increased considerably. A high-quality robot is a robot having good repeatability and accuracy. Industrial robots are known to have very good repeatability, however it is not the same with accuracy. Due to harsh working conditions, accuracy of robots deteriorate over time. Calibration is a practical approach to sustain accuracy. In calibration, position and orientation of the tool center point (TCP) of a robot arm should be corrected using a tracking device with higher accuracy. Different devices such as laser-trackers, optical CMMs, and stereo cameras have been used in the literature. In this paper, a novel calibration framework is proposed based on a single camera and computer vision techniques using ArUco markers. The product of exponentials method is used for kinematic modeling of the robot to avoid the singularity. The performance of the framework is tested using computer-based simulations and using a six degree of freedom (6-DOF) UR5 robotic manipulator. Position and orientation errors are used as metrics in the experiments. The position and orientation errors in real world experiments reached to 2.5 mm and 0.2°, respectively. The result shows that the method is usable in real world scenarios.

...read moreread less

20 citations

Journal Article•DOI•

Robust Image Encryption and Zero-watermarking Scheme Using SCA and Modified Logistic Map

[...]

Achraf Daoui¹, Hicham Karmouni¹, Omar El Ogri¹, Mhamed Sayyouri¹, Hassan Qjidaa¹ - Show less +1 more•Institutions (1)

SIDI¹

01 Mar 2022-Expert Systems With Applications

Abstract: In this work, we first present a modified version of the traditional logistic chaotic map. The proposed version contains an additional parameter that is used to increase the security level of the proposed digital image copyright protection scheme. The latter merges two methods of image copyright protection, namely the image zero-watermarking and image encryption, which provides a high level of security when communicating images via the Internet. Next, we discuss the influence of geometric attacks on the efficiency of the proposed scheme, and then we introduce an efficient solution that can resist such attacks. The proposed solution involves the use of Sine Cosine Algorithm (SCA) with an appropriate algorithm suitable for the correction of geometric attacks (image translation, orientation and its combination) applied to the encrypted image. On the one hand, the simulation results show that the proposed scheme provides a high level of security and can resist various attacks (differential, common image processing, geometric, etc.). On the other hand, the conducted comparison in terms of robustness against geometric attacks clearly demonstrates the superiority of our scheme over recent image encryption ones.

...read moreread less

19 citations

Journal Article•DOI•

Learning conditional photometric stereo with high-resolution features

[...]

Yakun Ju¹, Yuxin Peng², Muwei Jian³, Feng Gao¹, Junyu Dong¹ - Show less +1 more•Institutions (3)

Ocean University of China¹, Peking University², Shandong University of Finance and Economics³

01 Mar 2022-Computational Visual Media

TL;DR: Li et al. as discussed by the authors proposed a self-learning conditional network with multi-scale features for photometric stereo, avoiding blurred reconstruction in high-frequency regions (such as crinkles and edges), caused by spectral bias.

...read moreread less

Abstract: Photometric stereo aims to reconstruct 3D geometry by recovering the dense surface orientation of a 3D object from multiple images under differing illumination. Traditional methods normally adopt simplified reflectance models to make the surface orientation computable. However, the real reflectances of surfaces greatly limit applicability of such methods to real-world objects. While deep neural networks have been employed to handle non-Lambertian surfaces, these methods are subject to blurring and errors, especially in high-frequency regions (such as crinkles and edges), caused by spectral bias: neural networks favor low-frequency representations so exhibit a bias towards smooth functions. In this paper, therefore, we propose a self-learning conditional network with multi-scale features for photometric stereo, avoiding blurred reconstruction in such regions. Our explorations include: (i) a multi-scale feature fusion architecture, which keeps high-resolution representations and deep feature extraction, simultaneously, and (ii) an improved gradient-motivated conditionally parameterized convolution (GM-CondConv) in our photometric stereo network, with different combinations of convolution kernels for varying surfaces. Extensive experiments on public benchmark datasets show that our calibrated photometric stereo method outperforms the state-of-the-art.

...read moreread less

11 citations

Journal Article•DOI•

Stereo CenterNet-based 3D object detection for autonomous driving

[...]

Yuguang Shi¹, شذى عبدالله مساعد الفایز², Yu Guo¹, Zhenqiang Mi¹, Xinjie Li¹ - Show less +1 more•Institutions (2)

University of Science and Technology Beijing¹, China Academy of Chinese Medical Sciences²

01 Jan 2022-Neurocomputing

TL;DR: Wang et al. as mentioned in this paper proposed a 3D object detection method, Stereo CenterNet (SC), using geometric information in stereo images, which predicts the four semantic key points of the 3D bounding box of the object in space and utilizes 2D left and right boxes, 3D dimension, orientation, and key points to restore the bounding boxes of an object in the space.

...read moreread less

9 citations

Journal Article•DOI•

UAV/RTS system based on MMCPF theory for fast and precise determination of position and orientation

[...]

Di Zhang¹, Tianye Fang¹, Jiahao Ai¹, Yapeng Wang¹, Lv Zhou², Jiming Guo¹, Mei Wensheng¹, Yehao Zhao¹ - Show less +4 more•Institutions (2)

Wuhan University¹, Guilin University of Technology²

01 Jan 2022-Measurement

TL;DR: A system to determine the position and orientation fast and precisely, by the collaboration measurement of an Unmanned Aerial Vehicle and two Robotic Total Stations, based on the theory of Massive Moving Control Points for Free-stationing (MMCPF).

...read moreread less

7 citations

Journal Article•DOI•

Histograms of oriented mosaic gradients for snapshot spectral image description

[...]

Lulu Chen¹, Yongqiang Zhao¹, Jonathan Cheung-Wai Chan², Seong G. Kong³•Institutions (3)

Northwestern Polytechnical University¹, Vrije Universiteit Brussel², Sejong University³

01 Jan 2022-Isprs Journal of Photogrammetry and Remote Sensing

TL;DR: In this article, a feature descriptor using Histogram of Oriented Mosaic Gradient (HOMG) was proposed to extract spatial-spectral features directly from mosaic spectral images.

...read moreread less

Abstract: This paper presents a feature descriptor using Histogram of Oriented Mosaic Gradient (HOMG) that extracts spatial-spectral features directly from mosaic spectral images. Spectral imaging utilizes unique spectral signatures to distinguish objects of interest in the scene more discriminatively. Snapshot spectral cameras equipped with spectral filter arrays (SFAs) capture spectral videos in real time, making it possible to detect/track fast moving targets based on spectral imaging. How to effectively extract the spatial-spectral feature directly from the mosaic spectral images acquired by snapshot spectral cameras is a core issue for detection/tracking. So far, there is a lack of comprehensive and in-depth research on this issue. To this end, this paper proposed a new spatial-spectral feature extractor for mosaic spectral images. The proposed scheme finds two forms of SFA neighborhood (SFAN) to construct a feature extractor suitable for any SFA structure. Exploiting the spatial-spectral correlation in two SFANs, we design six mosaic spatial-spectral gradient operators to compute spatial-spectral gradient maps (SGMs). HOMG descriptors are constructed using the magnitude and orientation of SGMs. The effectiveness and generalizability of the proposed method have been verified with object tracking experiments. Compared to the state-of-the-art feature descriptors, HOMG ranked first on two datasets captured with snapshot spectral camera with different SFAs, achieving a gain of 3.9% and 5.9% in average success rate over the second-ranked feature.

...read moreread less

6 citations

Journal Article•DOI•

Deep neural models for automated multi-task diagnostic scan management-quality enhancement, view classification and report generation.

[...]

Karthik K¹, Sowmya Kamath S¹•Institutions (1)

National Institute of Technology, Karnataka¹

01 Jan 2022-Biomedical Physics & Engineering Express

TL;DR: In this article, a multi-task neural network is proposed for improving the quality and visualization of medical chest X-ray images. But, the proposed model is not suitable for the medical image data management, as the diagnostic scans are essentially stored as unstructured data, lacking a well-defined processing methodology for enabling intelligent image management for supporting applications like similar patient retrieval, automated disease prediction etc.

...read moreread less

Abstract: The detailed physiological perspectives captured by medical imaging provides actionable insights to doctors to manage comprehensive care of patients. However, the quality of such diagnostic image modalities is often affected by mismanagement of the image capturing process by poorly trained technicians and older/poorly maintained imaging equipment. Further, a patient is often subjected to scanning at different orientations to capture the frontal, lateral and sagittal views of the affected areas. Due to the large volume of diagnostic scans performed at a modern hospital, adequate documentation of such additional perspectives is mostly overlooked, which is also an essential key element of quality diagnostic systems and predictive analytics systems. Another crucial challenge affecting effective medical image data management is that the diagnostic scans are essentially stored as unstructured data, lacking a well-defined processing methodology for enabling intelligent image data management for supporting applications like similar patient retrieval , automated disease prediction etc. One solution is to incorporate automated diagnostic image descriptions of the observation/findings by leveraging computer vision and natural language processing. In this work, we present multi-task neural models capable of addressing these critical challenges. We propose ESRGAN, an image enhancement technique for improving the quality and visualization of medical chest x-ray images, thereby substantially improving the potential for accurate diagnosis, automatic detection and region-of-interest segmentation. We also propose a CNN-based model called ViewNet for predicting the view orientation of the x-ray image and generating a medical report using Xception net, thus facilitating a robust medical image management system for intelligent diagnosis applications. Experimental results are demonstrated using standard metrics like BRISQUE, PIQE and BLEU scores, indicating that the proposed models achieved excellent performance. Further, the proposed deep learning approaches enable diagnosis in a lesser time and their hybrid architecture shows significant potential for supporting many intelligent diagnosis applications.

...read moreread less

3 citations

Journal Article•DOI•

3D reconstruction and positioning of surface features based on a monocular camera and geometric constraints

[...]

Guanyu Hou¹, Weibin Zhang¹, Bin Wu², He Rongfang¹•Institutions (2)

China Academy of Engineering Physics¹, Tianjin University²

20 Feb 2022-Applied Optics

TL;DR: Wang et al. as mentioned in this paper proposed a 3D reconstruction and positioning method based on a single detecting camera and geometric constraint to evaluate the defect quantitatively, which can limit the accuracy of surface feature positioning to 0.7 mm.

...read moreread less

Abstract: This paper proposes a 3D reconstruction and positioning method based on a single detecting camera and geometric constraint to evaluate the defect quantitatively. We realize the camera orientation through datum point coordinates acquired by a photogrammetry system and then solve the equation of the defect detection camera model and surface constraints to reconstruct the color point cloud in 3D space. To realize the reconstruction and orientation of defects on objects without marker points, the system is operated by a repetitive moving of robotic arms. Finally, the experiment achieves the reconstruction of three types of products with marker points and one of the same types of objects without marks. The experiment shows that this method can limit the accuracy of surface feature positioning to 0.7 mm, and relative accuracy of the area calculation to 0.125%.

...read moreread less

2 citations

Book Chapter•DOI•

Stereophotogrammetry Camera Pose Optimization

[...]

Bryan L. Witt¹, J. Justin Wilbanks¹, Brian C. Owens¹, Daniel P. Rohe¹•Institutions (1)

Sandia National Laboratories¹

01 Jan 2022

TL;DR: A planning tool that utilizes projective and Euclidian geometry to iteratively estimate optimal camera poses for available equipment, determines the most efficient image size, and also performs checks for lens diffraction, minimum focal distance, and adequate depth of field is described.

...read moreread less

Abstract: Stereophotogrammetry makes use of calibrated camera pairs to obtain three-dimensional information from two-dimensional images. The accuracy of the extracted measurements is extremely dependent on the selection and setup of the camera system. For a given test object and desired viewing orientation, there is no one “correct” stereo camera setup, but rather a range of potential setups with some approaching an optimal system with respect to maximizing the measurement resolution. The open-ended nature of this test design exercise is compounded by equipment availability and the fact that many of the setup parameters have dependent characteristics, e.g., changing focal distance will affect stand-off distance, field of view, and image projection, among others. This work describes a planning tool that utilizes projective and Euclidian geometry to iteratively estimate optimal camera poses for available equipment, determines the most efficient image size, and also performs checks for lens diffraction, minimum focal distance, and adequate depth of field. Integrating a finite element model with these calculations further extends planning capabilities by allowing (1) an accurate definition of the volume to be imaged and (2) the ability to estimate response displacements in pixels due to an arbitrary excitation applied to the test object. This latter capability is critical for pre-test determination of the chosen camera setup’s ability to successfully extract three-dimensional measurements. The theory and workflow are presented along with an experimental demonstration.

...read moreread less

1 citations

Journal Article•DOI•

Precise calibration of dynamic geometric parameters cameras for aerial mapping

[...]

Zhang Hongwen¹, Guoqin Yuan¹, Liu Xueji¹•Institutions (1)

Chinese Academy of Sciences¹

01 Feb 2022-Optics and Lasers in Engineering

TL;DR: In this paper, a calibration method for aerial mapping on a moving platform can be transformed into a mapping of a stationary platform by using the proposed method, which is shown that the dynamic measurement is identical to static measurement.

...read moreread less

Journal Article•DOI•

Single-shot imaging through scattering medium with a deterministic phase-retrieval algorithm

[...]

Bingxin Tian, Lei Zhu¹, Bingcai Liu, Jun Han•Institutions (1)

Xidian University¹

01 Mar 2022-Optics Communications

TL;DR: Wang et al. as discussed by the authors proposed a deterministic phase-retrieval algorithm to fully reconstruct the object from its speckle pattern and experimentally demonstrated that the new method can retrieve a high-quality image from a single-shot specke pattern.

...read moreread less

Journal Article•DOI•

Multi-scale analysis on the tensile properties of UHPC considering fiber orientation

[...]

Jiangjiang Yu¹, Boshan Zhang¹, Weizhen Chen¹, Hang Liu•Institutions (1)

Tongji University¹

15 Jan 2022-Composite Structures

TL;DR: In this paper, the effect of fiber orientation distribution on the multi-scale tensile properties of ultra-high performance concrete (UHPC) was investigated and the orientation distribution function of non-aligned inclusions was reconstructed by using the hybrid closure approximation method.

...read moreread less