scispace - formally typeset
Open AccessJournal ArticleDOI

Classification With an Edge: Improving Semantic Image Segmentation with Boundary Detection

Reads0
Chats0
TLDR
This work constructs a comparatively simple, memory-efficient model by adding boundary detection to the Segnet encoder-decoder architecture, and includes boundary detection in FCN-type models and sets up a high-end classifier ensemble, showing that boundary detection significantly improves semantic segmentation with CNNs.
Abstract
We present an end-to-end trainable deep convolutional neural network (DCNN) for semantic segmentation with built-in awareness of semantically meaningful boundaries. Semantic segmentation is a fundamental remote sensing task, and most state-of-the-art methods rely on DCNNs as their workhorse. A major reason for their success is that deep networks learn to accumulate contextual information over very large receptive fields. However, this success comes at a cost, since the associated loss of effective spatial resolution washes out high-frequency details and leads to blurry object boundaries. Here, we propose to counter this effect by combining semantic segmentation with semantically informed edge detection, thus making class boundaries explicit in the model. First, we construct a comparatively simple, memory-efficient model by adding boundary detection to the segnet encoder-decoder architecture. Second, we also include boundary detection in fcn-type models and set up a high-end classifier ensemble. We show that boundary detection significantly improves semantic segmentation with CNNs in an end-to-end training scheme. Our best model achieves >90% overall accuracy on the ISPRS Vaihingen benchmark.

read more

Citations
More filters
Proceedings ArticleDOI

DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images

TL;DR: The DeepGlobe 2018 Satellite Image Understanding Challenge is presented, which includes three public competitions for segmentation, detection, and classification tasks on satellite images, and characteristics of each dataset are analyzed, and evaluation criteria for each task are defined.
Journal ArticleDOI

Deep learning based multi-temporal crop classification

TL;DR: This study shows that the Conv1D-based deep learning framework provides an effective and efficient method of time series representation in multi-temporal classification tasks.
Journal ArticleDOI

ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data

TL;DR: In this article, a novel deep learning architecture, ResUNet-a, is proposed for the task of semantic segmentation of monotemporal very high-resolution aerial images.
Proceedings ArticleDOI

Look at Boundary: A Boundary-Aware Face Alignment Algorithm

TL;DR: Wu et al. as mentioned in this paper proposed a boundary-aware face alignment algorithm by utilizing boundary lines as the geometric structure of a human face to help facial landmark localisation, which achieves 3.49% mean error on 300-W Fullset, which outperforms state-of-the-art methods by a large margin.
Journal ArticleDOI

Unmanned Aerial Vehicle for Remote Sensing Applications—A Review

TL;DR: This paper performs a critical review on RS tasks that involve UAV data and their derived products as their main sources including raw perspective images, digital surface models, and orthophotos and focuses on solutions that address the “new” aspects of the U drone data including ultra-high resolution; availability of coherent geometric and spectral data; and capability of simultaneously using multi-sensor data for fusion.
References
More filters
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.
Journal ArticleDOI

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

TL;DR: Quantitative assessments show that SegNet provides good performance with competitive inference time and most efficient inference memory-wise as compared to other architectures, including FCN and DeconvNet.
Posted Content

Distilling the Knowledge in a Neural Network

TL;DR: This work shows that it can significantly improve the acoustic model of a heavily used commercial system by distilling the knowledge in an ensemble of models into a single model and introduces a new type of ensemble composed of one or more full models and many specialist models which learn to distinguish fine-grained classes that the full models confuse.
Book ChapterDOI

Visualizing and Understanding Convolutional Networks

TL;DR: A novel visualization technique is introduced that gives insight into the function of intermediate feature layers and the operation of the classifier in large Convolutional Network models, used in a diagnostic role to find model architectures that outperform Krizhevsky et al on the ImageNet classification benchmark.
Related Papers (5)