scispace - formally typeset
Open AccessBook ChapterDOI

AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Reads0
Chats0
TLDR
This paper proposes AutoML for Model Compression (AMC) which leverages reinforcement learning to efficiently sample the design space and can improve the model compression quality and achieves state-of-the-art model compression results in a fully automated way without any human efforts.
Abstract
Model compression is an effective technique to efficiently deploy neural network models on mobile devices which have limited computation resources and tight power budgets. Conventional model compression techniques rely on hand-crafted features and require domain experts to explore the large design space trading off among model size, speed, and accuracy, which is usually sub-optimal and time-consuming. In this paper, we propose AutoML for Model Compression (AMC) which leverages reinforcement learning to efficiently sample the design space and can improve the model compression quality. We achieved state-of-the-art model compression results in a fully automated way without any human efforts. Under 4\(\times \) FLOPs reduction, we achieved 2.7% better accuracy than the hand-crafted model compression method for VGG-16 on ImageNet. We applied this automated, push-the-button compression pipeline to MobileNet-V1 and achieved a speedup of 1.53\(\times \) on the GPU (Titan Xp) and 1.95\(\times \) on an Android phone (Google Pixel 1), with negligible loss of accuracy.

read more

Content maybe subject to copyright    Report

Citations
More filters
Posted Content

BreakingBED -- Breaking Binary and Efficient Deep Neural Networks by Adversarial Attacks

TL;DR: In this article, the robustness of CNNs against white-box and black-box adversarial attacks is investigated, where the attack is detected and the input is discarded and/or cleaned.
Posted Content

Design and Scaffolded Training of an Efficient DNN Operator for Computer Vision on the Edge.

TL;DR: FuSeConv as discussed by the authors is a drop-in replacement for depthwise separable convolutions that factorizes convolution fully along their spatial and depth dimensions, and the resultant computation efficiently maps to systolic arrays.
Journal ArticleDOI

Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review

TL;DR: Some open issues and promising research trends for VAD are put forward, e.g., the cognitive mechanisms of human-machine dialogue under cross-modal dialogue context, and knowledge-enhanced cross- modal semantic interaction.
Journal Article

Going Deeper into Recognizing Actions in Dark Environments: A Comprehensive Benchmark Study

TL;DR: The UG+ Challenge Track 2 (UG2-2) in IEEE CVPR 2021 is launched, with a goal of evaluating and advancing the robustness of AR models in dark environments and guides models to tackle such a task in both fully and semi-supervised manners.
Posted Content

Semantic Relation Preserving Knowledge Distillation for Image-to-Image Translation

TL;DR: In this article, the authors propose a method to address the problem of large model size and long inference time by applying knowledge distillation together with distillation of a semantic relation preserving matrix.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Posted Content

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

TL;DR: Batch Normalization as mentioned in this paper normalizes layer inputs for each training mini-batch to reduce the internal covariate shift in deep neural networks, and achieves state-of-the-art performance on ImageNet.
Dissertation

Learning Multiple Layers of Features from Tiny Images

TL;DR: In this paper, the authors describe how to train a multi-layer generative model of natural images, using a dataset of millions of tiny colour images, described in the next section.
Related Papers (5)