scispace - formally typeset
Open AccessProceedings ArticleDOI

Decoupled Dynamic Filter Networks

Reads0
Chats0
Abstract
Convolution is one of the basic building blocks of CNN architectures. Despite its common use, standard convolution has two main shortcomings: Content-agnostic and Computation-heavy. Dynamic filters are content-adaptive, while further increasing the computational overhead. Depth-wise convolution is a lightweight variant, but it usually leads to a drop in CNN performance or requires a larger number of channels. In this work, we propose the Decoupled Dynamic Filter (DDF) that can simultaneously tackle both of these shortcomings. Inspired by recent advances in attention, DDF decouples a depth-wise dynamic filter into spatial and channel dynamic filters. This decomposition considerably reduces the number of parameters and limits computational costs to the same level as depth-wise convolution. Meanwhile, we observe a significant boost in performance when replacing standard convolution with DDF in classification networks. ResNet50 / 101 get improved by 1.9% and 1.3% on the top-1 accuracy, while their computational costs are reduced by nearly half. Experiments on the detection and joint upsampling networks also demonstrate the superior performance of the DDF upsampling variant (DDF-Up) in comparison with standard convolution and specialized content-adaptive layers. The project page with code is available 1.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Dynamic Spatial Propagation Network for Depth Completion

TL;DR: The Dynamic Spatial Propagation Network (DySPN) is introduced, an efficient model that learns the affinity among neighboring pixels with an attention-based, dynamic approach and outperforms other state-of-the-art (SoTA) methods on KITTI Depth Completion evaluation by the time of submission.
Journal ArticleDOI

LAGConv: Local-Context Adaptive Convolution Kernels with Global Harmonic Bias for Pansharpening

TL;DR: In this article , a local context adaptive (LCA) convolution kernel was proposed for remote sensing pansharpening, which can replace the standard convolution that is context-agnostic to fully perceive the particularity of each pixel.
Journal ArticleDOI

BFP Net: Balanced Feature Pyramid Network for Small Apple Detection in Complex Orchard Environment

TL;DR: A balanced feature pyramid network (BFP Net) for small apple detection that can balance information mapped to small apples from two perspectives: multiple-scale fruits on the different layers of FPN and a characteristic of a new extended feature from the output of ResNet50 conv1.
Journal ArticleDOI

Generative Adaptive Convolutions for Real-World Noisy Image Denoising

TL;DR: This work proposes a novel flexible and adaptive denoising network, coined as FADNet, equipped with a plane dynamic filter module, which generates weight filters with flexibility that can adapt to the specific input and thereby impedes the FAD net from overfitting to the training data.
Journal ArticleDOI

Personalized motion kernel learning for human pose estimation

TL;DR: This paper proposes a novel approach to learn specific keypoint motion representations for each person, termed Personalized Motion‐Aware Network (PMAN), which surpasses the state‐of‐the‐art method by +1.7 mAP and achieves 82.9 mAP on PoseTrack2017 dataset.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Book ChapterDOI

Microsoft COCO: Common Objects in Context

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
Related Papers (5)