Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis and its Generalization Error

Open AccessPosted Content

Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis and its Generalization Error

Taiji Suzuki, +8 more

- 26 Aug 2018 -

arXiv: Machine Learning

Chats0

TLDR

A new theoretical framework for model compression is developed and a new pruning method called spectral pruning is proposed based on this framework, which defines the ``degrees of freedom'' to quantify the intrinsic dimensionality of a model by using the eigenvalue distribution of the covariance matrix across the internal nodes.

Abstract:

Compression techniques for deep neural network models are becoming very important for the efficient execution of high-performance deep learning systems on edge-computing devices. The concept of model compression is also important for analyzing the generalization error of deep learning, known as the compression-based error bound. However, there is still huge gap between a practically effective compression method and its rigorous background of statistical learning theory. To resolve this issue, we develop a new theoretical framework for model compression and propose a new pruning method called {\it spectral pruning} based on this framework. We define the ``degrees of freedom'' to quantify the intrinsic dimensionality of a model by using the eigenvalue distribution of the covariance matrix across the internal nodes and show that the compression ability is essentially controlled by this quantity. Moreover, we present a sharp generalization error bound of the compressed model and characterize the bias--variance tradeoff induced by the compression procedure. We apply our method to several datasets to justify our theoretical analyses and show the superiority of the the proposed method.

Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis and its Generalization Error

Citations

Advances in Neural Information Processing Systems 27

Compression based bound for non-compressed network: unified generalization error analysis of large compressible deep neural network

Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms

Deep neural networks with dependent weights: Gaussian Process mixture limit, heavy tails, sparsity and compressibility

Understanding the Effects of Pre-Training for Object Detectors via Eigenspectrum

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database

Densely Connected Convolutional Networks

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Related Papers (5)

Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis and its Generalization Error.

Entropy-Constrained Training of Deep Neural Networks

Learning Adaptive Random Features

Robust Generalized Low Rank Approximations of Matrices

Efficient Structure Learning of Markov Networks using L1-Regularization