One Stage Monocular 3D Object Detection Utilizing Discrete Depth and Orientation Representation

doi:10.1109/tits.2022.3175198

Journal ArticleDOI

One Stage Monocular 3D Object Detection Utilizing Discrete Depth and Orientation Representation

- 01 Nov 2022 -

IEEE Transactions on Intelligent Transpo...

- Vol. 23, Iss: 11, pp 21630-21640

Chats0

TLDR

In this article , a monocular 3D object detection method that utilizes the discrete depth and orientation representation was proposed to predict object locations on 3D space utilizing keypoint detection on the object's center point.

Abstract:

On-road object detection is a critical component in an autonomous driving system. The safety of the vehicle can only be as good as the reliability of the on-road object detection system. Thus, developing a fast and robust object detection algorithm has been the primary goal of many automotive industries and institutes. In recent years, multi-purpose vision-based driver assistance systems have gained popularity with the emergence of a deep neural network. A monocular camera has been developed to locate an object in the image plane and estimate the distance of the said object in the real world or the vehicle plane. In this work, we present a monocular 3D object detection method that utilizes the discrete depth and orientation representation. Our proposed method strives to predict object locations on 3D space utilizing keypoint detection on the object’s center point. To improve the point detection, we employ center regression on the objects segmentation mask, reducing the detection offset significantly. The simplicity of our proposed network architecture and its one-stage approach allows our algorithm to achieve competitive speed compared with prior methods. Our proposed method is able to achieve 26.93% detection score on the Cityscapes 3D object detection dataset, outperforming the preceding monocular method by a margin of 2.8 points.

One Stage Monocular 3D Object Detection Utilizing Discrete Depth and Orientation Representation

Citations

Touchless Head-Control (THC): Head Gesture Recognition for Cursor and Orientation Control

Real-Time 3D Object Detection and Classification in Autonomous Driving Environment Using 3D LiDAR and Camera Sensors

Unsupervised Cross-Domain Adaptation through Mutual Mean Learning and GANs for Person Re-identification

References

Multi-view 3D Object Detection Network for Autonomous Driving

Object scene flow for autonomous vehicles

Deep Ordinal Regression Network for Monocular Depth Estimation

On-road vehicle detection: a review

Pyramid Stereo Matching Network

Related Papers (5)

Efficient pedestrian detection with enhanced object segmentation in far IR night vision

One Stage Monocular 3D Object Detection Utilizing Discrete Depth and Orientation Representation

Monocular-based pose estimation using vanishing points for indoor image correction

Pointing gesture-based unknown object extraction for learning objects with robot

A Novel Illumination-Invariant Loss for Monocular 3D Pose Estimation