Search or ask a question

Showing papers by "Andrew Howard published in 2017"

PDF

Open Access

Posted Content•

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

[...]

Andrew Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, M. Andreetto, Hartwig Adam - Show less +4 more

17 Apr 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work introduces two simple global hyper-parameters that efficiently trade off between latency and accuracy and demonstrates the effectiveness of MobileNets across a wide range of applications and use cases including object detection, finegrain classification, face attributes and large scale geo-localization.

...read moreread less

Abstract: We present a class of efficient models called MobileNets for mobile and embedded vision applications. MobileNets are based on a streamlined architecture that uses depth-wise separable convolutions to build light weight deep neural networks. We introduce two simple global hyper-parameters that efficiently trade off between latency and accuracy. These hyper-parameters allow the model builder to choose the right sized model for their application based on the constraints of the problem. We present extensive experiments on resource and accuracy tradeoffs and show strong performance compared to other popular models on ImageNet classification. We then demonstrate the effectiveness of MobileNets across a wide range of applications and use cases including object detection, finegrain classification, face attributes and large scale geo-localization.

...read moreread less

14,406 citations

Posted Content•

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

[...]

Benoit Jacob¹, Skirmantas Kligys¹, Bo Chen¹, Menglong Zhu¹, Matthew Tang¹, Andrew Howard¹, Hartwig Adam¹, Dmitry Kalenichenko¹ - Show less +4 more•Institutions (1)

Google¹

15 Dec 2017-arXiv: Learning

TL;DR: In this article, the authors proposed a quantization scheme that allows inference to be carried out using integer-only arithmetic, which can be implemented more efficiently than floating-point inference on commonly available integer only hardware.

...read moreread less

Abstract: The rising popularity of intelligent mobile devices and the daunting computational cost of deep learning-based models call for efficient and accurate on-device inference schemes. We propose a quantization scheme that allows inference to be carried out using integer-only arithmetic, which can be implemented more efficiently than floating point inference on commonly available integer-only hardware. We also co-design a training procedure to preserve end-to-end model accuracy post quantization. As a result, the proposed quantization scheme improves the tradeoff between accuracy and on-device latency. The improvements are significant even on MobileNets, a model family known for run-time efficiency, and are demonstrated in ImageNet classification and COCO detection on popular CPUs.

...read moreread less

42 citations

Patent•

Image processing neural networks with separable convolutional layers

[...]

François Chollet¹, Andrew Howard•Institutions (1)

Google¹

06 Oct 2017

TL;DR: In this article, a neural network system is configured to receive an input image and to generate a classification output for the input image using a separable convolution subnetwork comprising a plurality of separable CNN layers arranged in a stack one after the other.

...read moreread less

Abstract: A neural network system is configured to receive an input image and to generate a classification output for the input image. The neural network system includes: a separable convolution subnetwork comprising a plurality of separable convolutional neural network layers arranged in a stack one after the other, in which each separable convolutional neural network layer is configured to: separately apply both a depthwise convolution and a pointwise convolution during processing of an input to the separable convolutional neural network layer to generate a layer output.

...read moreread less

7 citations