scispace - formally typeset
Search or ask a question

Showing papers on "Orientation (computer vision) published in 2022"


Journal ArticleDOI
TL;DR: A multi-attention augmented network, which mainly consists of content-, orientation- and position-aware modules, is proposed, which develops an attention augmented U-net structure to form the content-aware module in order to learn and combine multi-scale informative features within a large receptive field.

24 citations


Journal ArticleDOI
TL;DR: A novel calibration framework is proposed based on a single camera and computer vision techniques using ArUco markers used for kinematic modeling of the robot to avoid the singularity and shows that the method is usable in real world scenarios.
Abstract: With the increasing involvement of industrial robots in manufacturing processes, the demand for high quality robots has increased considerably. A high-quality robot is a robot having good repeatability and accuracy. Industrial robots are known to have very good repeatability, however it is not the same with accuracy. Due to harsh working conditions, accuracy of robots deteriorate over time. Calibration is a practical approach to sustain accuracy. In calibration, position and orientation of the tool center point (TCP) of a robot arm should be corrected using a tracking device with higher accuracy. Different devices such as laser-trackers, optical CMMs, and stereo cameras have been used in the literature. In this paper, a novel calibration framework is proposed based on a single camera and computer vision techniques using ArUco markers. The product of exponentials method is used for kinematic modeling of the robot to avoid the singularity. The performance of the framework is tested using computer-based simulations and using a six degree of freedom (6-DOF) UR5 robotic manipulator. Position and orientation errors are used as metrics in the experiments. The position and orientation errors in real world experiments reached to 2.5 mm and 0.2°, respectively. The result shows that the method is usable in real world scenarios.

20 citations


Journal ArticleDOI
Achraf Daoui1, Hicham Karmouni1, Omar El Ogri1, Mhamed Sayyouri1, Hassan Qjidaa1 
Abstract: In this work, we first present a modified version of the traditional logistic chaotic map. The proposed version contains an additional parameter that is used to increase the security level of the proposed digital image copyright protection scheme. The latter merges two methods of image copyright protection, namely the image zero-watermarking and image encryption, which provides a high level of security when communicating images via the Internet. Next, we discuss the influence of geometric attacks on the efficiency of the proposed scheme, and then we introduce an efficient solution that can resist such attacks. The proposed solution involves the use of Sine Cosine Algorithm (SCA) with an appropriate algorithm suitable for the correction of geometric attacks (image translation, orientation and its combination) applied to the encrypted image. On the one hand, the simulation results show that the proposed scheme provides a high level of security and can resist various attacks (differential, common image processing, geometric, etc.). On the other hand, the conducted comparison in terms of robustness against geometric attacks clearly demonstrates the superiority of our scheme over recent image encryption ones.

19 citations


Journal ArticleDOI
TL;DR: Li et al. as discussed by the authors proposed a self-learning conditional network with multi-scale features for photometric stereo, avoiding blurred reconstruction in high-frequency regions (such as crinkles and edges), caused by spectral bias.
Abstract: Photometric stereo aims to reconstruct 3D geometry by recovering the dense surface orientation of a 3D object from multiple images under differing illumination. Traditional methods normally adopt simplified reflectance models to make the surface orientation computable. However, the real reflectances of surfaces greatly limit applicability of such methods to real-world objects. While deep neural networks have been employed to handle non-Lambertian surfaces, these methods are subject to blurring and errors, especially in high-frequency regions (such as crinkles and edges), caused by spectral bias: neural networks favor low-frequency representations so exhibit a bias towards smooth functions. In this paper, therefore, we propose a self-learning conditional network with multi-scale features for photometric stereo, avoiding blurred reconstruction in such regions. Our explorations include: (i) a multi-scale feature fusion architecture, which keeps high-resolution representations and deep feature extraction, simultaneously, and (ii) an improved gradient-motivated conditionally parameterized convolution (GM-CondConv) in our photometric stereo network, with different combinations of convolution kernels for varying surfaces. Extensive experiments on public benchmark datasets show that our calibrated photometric stereo method outperforms the state-of-the-art.

11 citations


Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper proposed a 3D object detection method, Stereo CenterNet (SC), using geometric information in stereo images, which predicts the four semantic key points of the 3D bounding box of the object in space and utilizes 2D left and right boxes, 3D dimension, orientation, and key points to restore the bounding boxes of an object in the space.

9 citations


Journal ArticleDOI
TL;DR: A system to determine the position and orientation fast and precisely, by the collaboration measurement of an Unmanned Aerial Vehicle and two Robotic Total Stations, based on the theory of Massive Moving Control Points for Free-stationing (MMCPF).

7 citations


Journal ArticleDOI
TL;DR: In this article, a feature descriptor using Histogram of Oriented Mosaic Gradient (HOMG) was proposed to extract spatial-spectral features directly from mosaic spectral images.
Abstract: This paper presents a feature descriptor using Histogram of Oriented Mosaic Gradient (HOMG) that extracts spatial-spectral features directly from mosaic spectral images. Spectral imaging utilizes unique spectral signatures to distinguish objects of interest in the scene more discriminatively. Snapshot spectral cameras equipped with spectral filter arrays (SFAs) capture spectral videos in real time, making it possible to detect/track fast moving targets based on spectral imaging. How to effectively extract the spatial-spectral feature directly from the mosaic spectral images acquired by snapshot spectral cameras is a core issue for detection/tracking. So far, there is a lack of comprehensive and in-depth research on this issue. To this end, this paper proposed a new spatial-spectral feature extractor for mosaic spectral images. The proposed scheme finds two forms of SFA neighborhood (SFAN) to construct a feature extractor suitable for any SFA structure. Exploiting the spatial-spectral correlation in two SFANs, we design six mosaic spatial-spectral gradient operators to compute spatial-spectral gradient maps (SGMs). HOMG descriptors are constructed using the magnitude and orientation of SGMs. The effectiveness and generalizability of the proposed method have been verified with object tracking experiments. Compared to the state-of-the-art feature descriptors, HOMG ranked first on two datasets captured with snapshot spectral camera with different SFAs, achieving a gain of 3.9% and 5.9% in average success rate over the second-ranked feature.

6 citations


Journal ArticleDOI
TL;DR: In this article, a multi-task neural network is proposed for improving the quality and visualization of medical chest X-ray images. But, the proposed model is not suitable for the medical image data management, as the diagnostic scans are essentially stored as unstructured data, lacking a well-defined processing methodology for enabling intelligent image management for supporting applications like similar patient retrieval, automated disease prediction etc.
Abstract: The detailed physiological perspectives captured by medical imaging provides actionable insights to doctors to manage comprehensive care of patients. However, the quality of such diagnostic image modalities is often affected by mismanagement of the image capturing process by poorly trained technicians and older/poorly maintained imaging equipment. Further, a patient is often subjected to scanning at different orientations to capture the frontal, lateral and sagittal views of the affected areas. Due to the large volume of diagnostic scans performed at a modern hospital, adequate documentation of such additional perspectives is mostly overlooked, which is also an essential key element of quality diagnostic systems and predictive analytics systems. Another crucial challenge affecting effective medical image data management is that the diagnostic scans are essentially stored as unstructured data, lacking a well-defined processing methodology for enabling intelligent image data management for supporting applications like similar patient retrieval , automated disease prediction etc. One solution is to incorporate automated diagnostic image descriptions of the observation/findings by leveraging computer vision and natural language processing. In this work, we present multi-task neural models capable of addressing these critical challenges. We propose ESRGAN, an image enhancement technique for improving the quality and visualization of medical chest x-ray images, thereby substantially improving the potential for accurate diagnosis, automatic detection and region-of-interest segmentation. We also propose a CNN-based model called ViewNet for predicting the view orientation of the x-ray image and generating a medical report using Xception net, thus facilitating a robust medical image management system for intelligent diagnosis applications. Experimental results are demonstrated using standard metrics like BRISQUE, PIQE and BLEU scores, indicating that the proposed models achieved excellent performance. Further, the proposed deep learning approaches enable diagnosis in a lesser time and their hybrid architecture shows significant potential for supporting many intelligent diagnosis applications.

3 citations


Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper proposed a 3D reconstruction and positioning method based on a single detecting camera and geometric constraint to evaluate the defect quantitatively, which can limit the accuracy of surface feature positioning to 0.7 mm.
Abstract: This paper proposes a 3D reconstruction and positioning method based on a single detecting camera and geometric constraint to evaluate the defect quantitatively. We realize the camera orientation through datum point coordinates acquired by a photogrammetry system and then solve the equation of the defect detection camera model and surface constraints to reconstruct the color point cloud in 3D space. To realize the reconstruction and orientation of defects on objects without marker points, the system is operated by a repetitive moving of robotic arms. Finally, the experiment achieves the reconstruction of three types of products with marker points and one of the same types of objects without marks. The experiment shows that this method can limit the accuracy of surface feature positioning to 0.7 mm, and relative accuracy of the area calculation to 0.125%.

2 citations


Book ChapterDOI
01 Jan 2022
TL;DR: A planning tool that utilizes projective and Euclidian geometry to iteratively estimate optimal camera poses for available equipment, determines the most efficient image size, and also performs checks for lens diffraction, minimum focal distance, and adequate depth of field is described.
Abstract: Stereophotogrammetry makes use of calibrated camera pairs to obtain three-dimensional information from two-dimensional images. The accuracy of the extracted measurements is extremely dependent on the selection and setup of the camera system. For a given test object and desired viewing orientation, there is no one “correct” stereo camera setup, but rather a range of potential setups with some approaching an optimal system with respect to maximizing the measurement resolution. The open-ended nature of this test design exercise is compounded by equipment availability and the fact that many of the setup parameters have dependent characteristics, e.g., changing focal distance will affect stand-off distance, field of view, and image projection, among others. This work describes a planning tool that utilizes projective and Euclidian geometry to iteratively estimate optimal camera poses for available equipment, determines the most efficient image size, and also performs checks for lens diffraction, minimum focal distance, and adequate depth of field. Integrating a finite element model with these calculations further extends planning capabilities by allowing (1) an accurate definition of the volume to be imaged and (2) the ability to estimate response displacements in pixels due to an arbitrary excitation applied to the test object. This latter capability is critical for pre-test determination of the chosen camera setup’s ability to successfully extract three-dimensional measurements. The theory and workflow are presented along with an experimental demonstration.

1 citations


Journal ArticleDOI
TL;DR: In this paper, a calibration method for aerial mapping on a moving platform can be transformed into a mapping of a stationary platform by using the proposed method, which is shown that the dynamic measurement is identical to static measurement.

Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed a deterministic phase-retrieval algorithm to fully reconstruct the object from its speckle pattern and experimentally demonstrated that the new method can retrieve a high-quality image from a single-shot specke pattern.

Journal ArticleDOI
TL;DR: In this paper, the effect of fiber orientation distribution on the multi-scale tensile properties of ultra-high performance concrete (UHPC) was investigated and the orientation distribution function of non-aligned inclusions was reconstructed by using the hybrid closure approximation method.