Mise en abyme with Artificial Intelligence: How to Predict the Accuracy of NN, Applied to Hyper-parameter Tuning.

doi:10.1007/978-3-030-16841-4_30

Open AccessBook ChapterDOI

Mise en abyme with Artificial Intelligence: How to Predict the Accuracy of NN, Applied to Hyper-parameter Tuning.

Giorgia Franchini, +2 more

- pp 286-295

Chats0

TLDR

In this paper, the authors proposed a low-cost strategy to predict the accuracy of the algorithm, based only on its initial behavior, using both curve fitting and support vector machines.

Abstract:

In the context of deep learning, the costliest phase from a computational point of view is the full training of the learning algorithm. However, this process is to be used a significant number of times during the design of a new artificial neural network, leading therefore to extremely expensive operations. Here, we propose a low-cost strategy to predict the accuracy of the algorithm, based only on its initial behaviour. To do so, we train the network of interest up to convergence several times, modifying its characteristics at each training. The initial and final accuracies observed during this beforehand process are stored in a database. We then make use of both curve fitting and Support Vector Machines techniques, the latter being trained on the created database, to predict the accuracy of the network, given its accuracy on the primary iterations of its learning. This approach can be of particular interest when the space of the characteristics of the network is notably large or when its full training is highly time-consuming. The results we obtained are promising and encouraged us to apply this strategy to a topical issue: hyper-parameter optimisation (HO). In particular, we focused on the HO of a convolutional neural network for the classification of the databases MNIST and CIFAR-10. By using our method of prediction, and an algorithm implemented by us for a probabilistic exploration of the hyper-parameter space, we were able to find the hyper-parameter settings corresponding to the optimal accuracies already known in literature, at a quite low-cost.

Mise en abyme with Artificial Intelligence: How to Predict the Accuracy of NN, Applied to Hyper-parameter Tuning.

Citations

Neural architecture search via standard machine learning methodologies

References

Random search for hyper-parameter optimization

Taking the Human Out of the Loop: A Review of Bayesian Optimization

Neural Architecture Search with Reinforcement Learning

Algorithms for Hyper-Parameter Optimization

Sequential model-based optimization for general algorithm configuration

Related Papers (5)

Neural Networks Designing Neural Networks: Multi-Objective Hyper-Parameter Optimization

A new CNN training approach with application to hyperspectral image classification

Estimating the size of neural networks from the number of available training data

A fast learnt fuzzy neural network for huge scale discrete data function approximation and prediction

Bayesian Optimization for Parameter Tuning of the XOR Neural Network