Learning Lightweight Lane Detection CNNs by Self Attention Distillation

doi:10.1109/ICCV.2019.00110

Open AccessProceedings ArticleDOI

Learning Lightweight Lane Detection CNNs by Self Attention Distillation

- pp 1013-1021

TLDR

Self Attention Distillation (SAD) as discussed by the authors is a knowledge distillation approach, which allows a model to learn from itself and gains substantial improvement without any additional supervision or labels.

Abstract:

Training deep models for lane detection is challenging due to the very subtle and sparse supervisory signals inherent in lane annotations. Without learning from much richer context, these models often fail in challenging scenarios, e.g., severe occlusion, ambiguous lanes, and poor lighting conditions. In this paper, we present a novel knowledge distillation approach, i.e., Self Attention Distillation (SAD), which allows a model to learn from itself and gains substantial improvement without any additional supervision or labels. Specifically, we observe that attention maps extracted from a model trained to a reasonable level would encode rich contextual information. The valuable contextual information can be used as a form of ‘free’ supervision for further representation learning through performing top- down and layer-wise attention distillation within the net- work itself. SAD can be easily incorporated in any feed- forward convolutional neural networks (CNN) and does not increase the inference time. We validate SAD on three popular lane detection benchmarks (TuSimple, CULane and BDD100K) using lightweight models such as ENet, ResNet- 18 and ResNet-34. The lightest model, ENet-SAD, performs comparatively or even surpasses existing algorithms. Notably, ENet-SAD has 20 × fewer parameters and runs 10 × faster compared to the state-of-the-art SCNN, while still achieving compelling performance in all benchmarks.

Learning Lightweight Lane Detection CNNs by Self Attention Distillation

Citations

Knowledge Distillation: A Survey

Ultra Fast Structure-aware Deep Lane Detection

Visual Perception Enabled Industry Intelligence: State of the Art, Challenges and Prospects

Ultra Fast Structure-aware Deep Lane Detection

Key Points Estimation and Point Instance Segmentation Approach for Lane Detection.

References

Deep Residual Learning for Image Recognition

Distilling the Knowledge in a Neural Network

Spatial transformer networks

Multi-Scale Context Aggregation by Dilated Convolutions

Large-Scale Machine Learning with Stochastic Gradient Descent

Related Papers (5)

Deep Residual Learning for Image Recognition

Distilling the Knowledge in a Neural Network

ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation

A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning

Fully convolutional networks for semantic segmentation