Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science

doi:10.1038/S41467-018-04316-3

Open AccessJournal ArticleDOI

Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science

Decebal Constantin Mocanu, +5 more

- 19 Jun 2018 -

Nature Communications

- Vol. 9, Iss: 1, pp 2383-2383

Chats0

TLDR

This article proposed sparse evolutionary training of artificial neural networks, an algorithm which evolves an initial sparse topology (Erdős-Renyi random graph) of two consecutive layers of neurons into a scale-free topology, during learning.

Abstract:

Through the success of deep learning in various domains, artificial neural networks are currently among the most used artificial intelligence methods. Taking inspiration from the network properties of biological neural networks (e.g. sparsity, scale-freeness), we argue that (contrary to general practice) artificial neural networks, too, should not have fully-connected layers. Here we propose sparse evolutionary training of artificial neural networks, an algorithm which evolves an initial sparse topology (Erdős–Renyi random graph) of two consecutive layers of neurons into a scale-free topology, during learning. Our method replaces artificial neural networks fully-connected layers with sparse ones before training, reducing quadratically the number of parameters, with no decrease in accuracy. We demonstrate our claims on restricted Boltzmann machines, multi-layer perceptrons, and convolutional neural networks for unsupervised and supervised learning on 15 datasets. Our approach has the potential to enable artificial neural networks to scale up beyond what is currently possible.

Citations

PDF

Open Access

More filters

Posted Content

The State of Sparsity in Deep Neural Networks

Trevor Gale, +2 more

- 25 Feb 2019 -

arXiv: Learning

TL;DR: It is shown that unstructured sparse architectures learned through pruning cannot be trained from scratch to the same test set performance as a model trained with joint sparsification and optimization, and the need for large-scale benchmarks in the field of model compression is highlighted.

...read moreread less

Posted Content

TabNet: Attentive Interpretable Tabular Learning

Sercan O. Arik, +1 more

- 20 Aug 2019 -

arXiv: Learning

TL;DR: It is demonstrated that TabNet outperforms other neural network and decision tree variants on a wide range of non-performance-saturated tabular datasets and yields interpretable feature attributions plus insights into the global model behavior.

...read moreread less

Proceedings Article

Pruning neural networks without any data by iteratively conserving synaptic flow

Hidenori Tanaka, +3 more

TL;DR: The data-agnostic pruning algorithm challenges the existing paradigm that, at initialization, data must be used to quantify which synapses are important, and consistently competes with or outperforms existing state-of-the-art pruning algorithms at initialization over a range of models, datasets, and sparsity constraints.

...read moreread less

Posted Content

Picking Winning Tickets Before Training by Preserving Gradient Flow

Chaoqi Wang, +2 more

- 18 Feb 2020 -

arXiv: Learning

TL;DR: This work argues that efficient training requires preserving the gradient flow through the network, and proposes a simple but effective pruning criterion called Gradient Signal Preservation (GraSP), which achieves significantly better performance than the baseline at extreme sparsity levels.

...read moreread less

Posted Content

Sparse Networks from Scratch: Faster Training without Losing Performance

Tim Dettmers, +1 more

- 10 Jul 2019 -

arXiv: Learning

TL;DR: This work develops sparse momentum, an algorithm which uses exponentially smoothed gradients (momentum) to identify layers and weights which reduce the error efficiently and shows that the benefits of momentum redistribution and growth increase with the depth and size of the network.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Journal ArticleDOI

Collective dynamics of small-world networks

Duncan J. Watts, +1 more

- 04 Jun 1998 -

Nature

TL;DR: Simple models of networks that can be tuned through this middle ground: regular networks ‘rewired’ to introduce increasing amounts of disorder are explored, finding that these systems can be highly clustered, like regular lattices, yet have small characteristic path lengths, like random graphs.

...read moreread less

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Journal ArticleDOI

Emergence of Scaling in Random Networks

Albert-László Barabási, +1 more

- 15 Oct 1999 -

Science

TL;DR: A model based on these two ingredients reproduces the observed stationary scale-free distributions, which indicates that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.

...read moreread less

Collapse

Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science

Citations

The State of Sparsity in Deep Neural Networks

TabNet: Attentive Interpretable Tabular Learning

Pruning neural networks without any data by iteratively conserving synaptic flow

Picking Winning Tickets Before Training by Preserving Gradient Flow

Sparse Networks from Scratch: Faster Training without Losing Performance

References

Deep learning

Gradient-based learning applied to document recognition

Collective dynamics of small-world networks

Deep Learning

Emergence of Scaling in Random Networks

Related Papers (5)

Deep Residual Learning for Image Recognition

Optimal Brain Damage

Learning both weights and connections for efficient neural networks

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks.

Learning Multiple Layers of Features from Tiny Images

Trending Questions (2)