scispace - formally typeset
Proceedings ArticleDOI

Depth estimation from single image using machine learning techniques

TLDR
The predicted depth maps are reliable, accurate and very close to ground truth depths which is validated using objective measures: RMSE, PSNR, SSIM and subjective measure: MOS score.
Abstract
In this paper, the problem of depth estimation from single monocular image is considered. The depth cues such as motion, stereo correspondences are not present in single image which makes the task more challenging. We propose a machine learning based approach for extracting depth information from single image. The deep learning is used for extracting features, then, initial depths are generated using manifold learning in which neighborhood preserving embedding algorithm is used. Then, fixed point supervised learning is applied for sequential labeling to obtain more consistent and accurate depth maps. The features used are initial depths obtained from manifold learning and various image based features including texture, color and edges which provide useful information about depth. A fixed point contraction mapping function is generated using which depth map is predicted for new structured input image. The transfer learning approach is also used for improvement in learning in a new task through the transfer of knowledge from a related task that has already been learned. The predicted depth maps are reliable, accurate and very close to ground truth depths which is validated using objective measures: RMSE, PSNR, SSIM and subjective measure: MOS score.

read more

Citations
More filters
Journal ArticleDOI

Identification of Fruit Tree Pests With Deep Learning on Embedded Drone to Achieve Accurate Pesticide Spraying

TL;DR: In this paper, a Tiny-YOLOv3 neural network model built on an embedded system NVIDIA Jetson TX2 was used to recognize T papillosa in the orchard to determine the position of the pests in real-time.
Proceedings ArticleDOI

Real-Time Hazard Symbol Detection and Localization Using UAV Imagery

TL;DR: An architecture that identifies and locates objects of interest in real-time using low-cost hardware and a state of the art object detection algorithm that avoids the use of expensive LIDAR sensors and UAVs, but still manages to determine the position of specific predefined objects in the field with high precision.
References
More filters
Proceedings ArticleDOI

Deep face recognition

TL;DR: It is shown how a very large scale dataset can be assembled by a combination of automation and human in the loop, and the trade off between data purity and time is discussed.
Proceedings Article

Depth Map Prediction from a Single Image using a Multi-Scale Deep Network

TL;DR: In this article, two deep network stacks are employed to make a coarse global prediction based on the entire image, and another to refine this prediction locally, which achieves state-of-the-art results on both NYU Depth and KITTI.
Journal ArticleDOI

Make3D: Learning 3D Scene Structure from a Single Still Image

TL;DR: This work considers the problem of estimating detailed 3D structure from a single still image of an unstructured environment and uses a Markov random field (MRF) to infer a set of "plane parameters" that capture both the 3D location and 3D orientation of the patch.
Book ChapterDOI

Efficient large-scale stereo matching

TL;DR: A novel approach to binocular stereo for fast matching of high-resolution images by building a prior on the disparities by forming a triangulation on a set of support points which can be robustly matched, reducing the matching ambiguities of the remaining points.
Journal ArticleDOI

3-D Depth Reconstruction from a Single Still Image

TL;DR: This work proposes a model that incorporates both monocular cues and stereo (triangulation) cues, to obtain significantly more accurate depth estimates than is possible using either monocular or stereo cues alone.
Related Papers (5)