Scaling for edge inference of deep neural networks

doi:10.1038/S41928-018-0059-3

Journal ArticleDOI

Scaling for edge inference of deep neural networks

- Vol. 1, Iss: 4, pp 216-222

TLDR

There are increasing gaps between the computational complexity and energy efficiency required for the continued scaling of deep neural networks and the hardware capacity actually available with current CMOS technology scaling, in situations where edge inference is required.

Abstract:

Deep neural networks offer considerable potential across a range of applications, from advanced manufacturing to autonomous cars. A clear trend in deep neural networks is the exponential growth of network size and the associated increases in computational complexity and memory consumption. However, the performance and energy efficiency of edge inference, in which the inference (the application of a trained network to new data) is performed locally on embedded platforms that have limited area and power budget, is bounded by technology scaling. Here we analyse recent data and show that there are increasing gaps between the computational complexity and energy efficiency required by data scientists and the hardware capacity made available by hardware architects. We then discuss various architecture and algorithm innovations that could help to bridge the gaps. This Perspective highlights the existence of gaps between the computational complexity and energy efficiency required for the continued scaling of deep neural networks and the hardware capacity actually available with current CMOS technology scaling, in situations where edge inference is required; it then discusses various architecture and algorithm innovations that could help to bridge these gaps.

Citations

PDF

Open Access

More filters

Posted Content

Meta-Learning in Neural Networks: A Survey

Timothy M. Hospedales, +3 more

- 11 Apr 2020 -

arXiv: Learning

TL;DR: A new taxonomy is proposed that provides a more comprehensive breakdown of the space of meta-learning methods today, including few-shot learning, reinforcement learning and architecture search, and promising applications and successes.

...read moreread less

Journal ArticleDOI

Convergence of Edge Computing and Deep Learning: A Comprehensive Survey

Xiaofei Wang, +5 more

- 30 Jan 2020 -

IEEE Communications Surveys and Tutorial...

TL;DR: By consolidating information scattered across the communication, networking, and DL areas, this survey can help readers to understand the connections between enabling technologies while promoting further discussions on the fusion of edge intelligence and intelligent edge, i.e., Edge DL.

...read moreread less

Journal ArticleDOI

Convergence of Edge Computing and Deep Learning: A Comprehensive Survey

Xiaofei Wang, +5 more

- 19 Jul 2019 -

arXiv: Networking and Internet Architect...

TL;DR: In this paper, a survey on the relationship between edge intelligence and intelligent edge computing is presented, and the practical implementation methods and enabling technologies, namely DL training and inference in the customized edge computing framework, challenges and future trends of more pervasive and fine-grained intelligence.

...read moreread less

Journal ArticleDOI

A fully integrated reprogrammable memristor–CMOS system for efficient multiply–accumulate operations

Fuxi Cai, +8 more

TL;DR: A programmable neuromorphic computing chip based on passive memristor crossbar arrays integrated with analogue and digital components and an on-chip processor enables the implementation of neuromorphic and machine learning algorithms.

...read moreread less

Journal ArticleDOI

Parallel programming of an ionic floating-gate memory array for scalable neuromorphic computing.

Elliot J. Fuller, +11 more

- 10 May 2019 -

Science

TL;DR: An ionic floating-gate memory array based on a polymer redox transistor connected to a conductive-bridge memory (CBM) is introduced, enabling linear and symmetric weight updates in parallel over an entire crossbar array at megahertz rates over 109 write-read cycles.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Collapse

Related Papers (5)

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

Scaling for edge inference of deep neural networks

Citations

Meta-Learning in Neural Networks: A Survey

Convergence of Edge Computing and Deep Learning: A Comprehensive Survey

Convergence of Edge Computing and Deep Learning: A Comprehensive Survey

A fully integrated reprogrammable memristor–CMOS system for efficient multiply–accumulate operations

Parallel programming of an ionic floating-gate memory array for scalable neuromorphic computing.

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions

Related Papers (5)

Deep Residual Learning for Image Recognition

Deep learning

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks