scispace - formally typeset
Open AccessProceedings ArticleDOI

VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition

TLDR
In this paper, a unified end-to-end trainable multi-task network that jointly handles lane and road marking detection and recognition that is guided by a vanishing point under adverse weather conditions is proposed.
Abstract
In this paper, we propose a unified end-to-end trainable multi-task network that jointly handles lane and road marking detection and recognition that is guided by a vanishing point under adverse weather conditions. We tackle rainy and low illumination conditions, which have not been extensively studied until now due to clear challenges. For example, images taken under rainy days are subject to low illumination, while wet roads cause light reflection and distort the appearance of lane and road markings. At night, color distortion occurs under limited illumination. As a result, no benchmark dataset exists and only a few developed algorithms work under poor weather conditions. To address this shortcoming, we build up a lane and road marking benchmark which consists of about 20,000 images with 17 lane and road marking classes under four different scenarios: no rain, rain, heavy rain, and night. We train and evaluate several versions of the proposed multi-task network and validate the importance of each task. The resulting approach, VPGNet, can detect and classify lanes and road markings, and predict a vanishing point with a single forward pass. Experimental results show that our approach achieves high accuracy and robustness under various conditions in realtime (20 fps). The benchmark and the VPGNet model will be publicly available

read more

Citations
More filters
Proceedings ArticleDOI

BRNet: a Bilateral Lane Detection Framework Based on Recurrent Feature-Shift Aggregator

TL;DR: In this paper , a bilateral lane detection framework (BRNet) is proposed to acquire spatial features and rich contextual features by designing Spatial Path and Context Path and utilize a feature fusion module to achieve information integration for different-level features.
Journal ArticleDOI

Deep Inference Networks for Reliable Vehicle Lateral Position Estimation in Congested Urban Environments

TL;DR: Li et al. as mentioned in this paper proposed a novel deep inference network (DINet) to estimate vehicle lateral position, which can adequately address the challenges of road occlusion and the unreliability of employed reference objects.
Proceedings ArticleDOI

Human Knowledge and Visual Intelligence on SDX<sup>tension</sup>B

TL;DR: In this paper , the Self-Driving Sweeper Bot (SDSB) was reported to be extendable to be SDX, which can be used for road segmentation, rubbish, pedestrian and vehicle detections.
Journal ArticleDOI

Lane Marker Detection Based on Multihead Self-Attention

TL;DR: Wang et al. as discussed by the authors proposed a lane mark detection network based on multi-head self-attention, which can find spatial relationships among lane mark points in the global viewpoint and enlarge its feature map's receptive field equally.
References
More filters
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Book ChapterDOI

Microsoft COCO: Common Objects in Context

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.
Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.