PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Open AccessPosted Content

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Yuhui Xu, +6 more

- 12 Jul 2019 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

Partially-Connected Differentiable Architecture Search (PC-DARTS) as mentioned in this paper performs operation search in a subset of channels while bypassing the held out part in a shortcut, which alleviates the undesired inconsistency on selecting the edges of super-net caused by sampling different channels.

Abstract:

Differentiable architecture search (DARTS) provided a fast solution in finding effective network architectures, but suffered from large memory and computing overheads in jointly training a super-network and searching for an optimal architecture. In this paper, we present a novel approach, namely, Partially-Connected DARTS, by sampling a small part of super-network to reduce the redundancy in exploring the network space, thereby performing a more efficient search without comprising the performance. In particular, we perform operation search in a subset of channels while bypassing the held out part in a shortcut. This strategy may suffer from an undesired inconsistency on selecting the edges of super-net caused by sampling different channels. We alleviate it using edge normalization, which adds a new set of edge-level parameters to reduce uncertainty in search. Thanks to the reduced memory cost, PC-DARTS can be trained with a larger batch size and, consequently, enjoys both faster speed and higher training stability. Experimental results demonstrate the effectiveness of the proposed method. Specifically, we achieve an error rate of 2.57% on CIFAR10 with merely 0.1 GPU-days for architecture search, and a state-of-the-art top-1 error rate of 24.2% on ImageNet (under the mobile setting) using 3.8 GPU-days for search. Our code has been made available at: this https URL.

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Citations

AutoML: A survey of the state-of-the-art

FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search

Searching Central Difference Convolutional Networks for Face Anti-Spoofing

DARTS+: Improved Differentiable Architecture Search with Early Stopping.

A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions

References

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database

Related Papers (5)

Learning Transferable Architectures for Scalable Image Recognition

Deep Residual Learning for Image Recognition

Neural Architecture Search with Reinforcement Learning

Regularized Evolution for Image Classifier Architecture Search

Learning Multiple Layers of Features from Tiny Images