scispace - formally typeset
Open AccessProceedings ArticleDOI

VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition

TLDR
In this paper, a unified end-to-end trainable multi-task network that jointly handles lane and road marking detection and recognition that is guided by a vanishing point under adverse weather conditions is proposed.
Abstract
In this paper, we propose a unified end-to-end trainable multi-task network that jointly handles lane and road marking detection and recognition that is guided by a vanishing point under adverse weather conditions. We tackle rainy and low illumination conditions, which have not been extensively studied until now due to clear challenges. For example, images taken under rainy days are subject to low illumination, while wet roads cause light reflection and distort the appearance of lane and road markings. At night, color distortion occurs under limited illumination. As a result, no benchmark dataset exists and only a few developed algorithms work under poor weather conditions. To address this shortcoming, we build up a lane and road marking benchmark which consists of about 20,000 images with 17 lane and road marking classes under four different scenarios: no rain, rain, heavy rain, and night. We train and evaluate several versions of the proposed multi-task network and validate the importance of each task. The resulting approach, VPGNet, can detect and classify lanes and road markings, and predict a vanishing point with a single forward pass. Experimental results show that our approach achieves high accuracy and robustness under various conditions in realtime (20 fps). The benchmark and the VPGNet model will be publicly available

read more

Citations
More filters
Book ChapterDOI

DILane: Dynamic Instance-Aware Network for Lane Detection

TL;DR: Li et al. as discussed by the authors proposed a self-attention module to gather global information in parallel, which remarkably improves detection accuracy, achieving an F1 score of 79.43% for CULane and 97.80% for TuSimple dataset while achieving 148+ FPS.
Proceedings ArticleDOI

Real-time End-to-End Lane ID Estimation Using Recurrent Networks

TL;DR: In this paper, a real-time, vision-only (i.e. monocular camera) solution is proposed to estimate the relative number or "ID" of the current driven lane within a multi-lane road or a highway.
Book ChapterDOI

Multi-Class Lane Semantic Segmentation of Expressway Dataset Based on Aerial View

TL;DR: In this paper , Deeplab-ERFC (DeepLab with Erosion Loss and a Fully-Connected Conditional Random Field) was proposed to improve boundary prediction performance using corrosion operation to estimate Hausdorff Distance (HD) and to reduce the generation of cavities through two gaussian kernel functions considering the color intensity and position relationship between pixels.
Journal ArticleDOI

Understanding Drivers’ Visual and Comprehension Loads in Traffic Violation Hotspots Leveraging Crowd-Based Driving Simulation

TL;DR: Wang et al. as mentioned in this paper proposed a driver-centered simulation platform to inspect drivers' visual and comprehension loads in traffic violation hotspots, based on the 3D point clouds of real-world traffic violations hotspots.
Journal ArticleDOI

Lane Detection Based on Instance Segmentation of BiSeNet V2 Backbone Network

TL;DR: In this article , a dual-branch neural network model for lane line image segmentation was designed based on BiSeNet V2 and discrete lane line feature points were operated through the clustering model.
References
More filters
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Book ChapterDOI

Microsoft COCO: Common Objects in Context

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.
Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.