Butterfly Transform: An Efficient FFT Based Neural Architecture Design

doi:10.1109/CVPR42600.2020.01204

Open AccessProceedings ArticleDOI

Butterfly Transform: An Efficient FFT Based Neural Architecture Design

- pp 12024-12033

TLDR

In this paper, the butterfly operations from the FFT algorithm to a general butterfly transform (BFT) can be used to reduce the computational complexity of channel fusions, which is the main bottleneck in the state-of-the-art efficient CNNs.

Abstract:

In this paper, we show that extending the butterfly operations from the FFT algorithm to a general Butterfly Transform (BFT) can be beneficial in building an efficient block structure for CNN designs. Pointwise convolutions, which we refer to as channel fusions, are the main computational bottleneck in the state-of-the-art efficient CNNs (e.g. MobileNets). We introduce a set of criterion for channel fusion, and prove that BFT yields an asymptotically optimal FLOP count with respect to these criteria. By replacing pointwise convolutions with BFT, we reduce the computational complexity of these layers from O(n^2) to O(n log n) with respect to the number of channels. Our experimental evaluations show that our method results in significant accuracy gains across a wide range of network architectures, especially at low FLOP ranges. For example, BFT results in up to a 6.75% absolute Top-1 improvement for MobileNetV1, 4.4 % for ShuffleNet V2 and 5.4% for MobileNetV3 on ImageNet under a similar number of FLOPS. Notably, ShuffleNet-V2+BFT outperforms state-of-the-art architecture search methods MNasNet, FBNet and MobilenetV3 in the low FLOP regime.

Citations

PDF

Open Access

More filters

Posted Content

HiPPO: Recurrent Memory with Optimal Polynomial Projections

Albert Gu, +4 more

- 17 Aug 2020 -

arXiv: Learning

TL;DR: This formal framework yields a new memory update mechanism (HiPPO-LegS) that scales through time to remember all history, avoiding priors on the timescale and enjoys the theoretical benefits of timescale robustness, fast updates, and bounded gradients.

...read moreread less

Posted Content

Graph Structure of Neural Networks

Jiaxuan You, +3 more

- 13 Jul 2020 -

arXiv: Learning

TL;DR: A novel graph-based representation of neural networks called relational graph is developed, where layers of neural network computation correspond to rounds of message exchange along the graph structure, which shows that a "sweet spot" of relational graphs leads to neural networks with significantly improved predictive performance.

...read moreread less

Posted Content

Mobile-Former: Bridging MobileNet and Transformer.

Yinpeng Chen, +6 more

- 12 Aug 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Mobile-former as mentioned in this paper is a parallel design of MobileNet and Transformer with a two-way bridge in between, which enables bidirectional fusion of local and global features.

...read moreread less

Posted Content

Soft Threshold Weight Reparameterization for Learnable Sparsity

Aditya Kusupati, +6 more

- 08 Feb 2020 -

arXiv: Learning

TL;DR: STR is a simple mechanism which learns effective sparsity budgets that contrast with popular heuristics that boosts the accuracy over existing results by up to 10% in the ultra sparse (99%) regime and can also be used to induce low-rank (structured sparsity) in RNNs.

...read moreread less

Journal ArticleDOI

Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Codesign

Cong Hao, +5 more

- 31 Mar 2021 -

IEEE Design & Test of Computers

TL;DR: The authors argue that workloads that were formerly performed in the cloud are increasingly moving to resource-limited edge computing systems, which raises a new set of challenges for machine learning as well as new opportunities.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Collapse

Butterfly Transform: An Efficient FFT Based Neural Architecture Design

Citations

HiPPO: Recurrent Memory with Optimal Polynomial Projections

Graph Structure of Neural Networks

Mobile-Former: Bridging MobileNet and Transformer.

Soft Threshold Weight Reparameterization for Learnable Sparsity

Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Codesign

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database

Going deeper with convolutions

Related Papers (5)

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

ImageNet: A large-scale hierarchical image database

Deep Residual Learning for Image Recognition

MobileNetV2: Inverted Residuals and Linear Bottlenecks