Selective Layer Tuning and Performance Study of Pre-Trained Models Using Genetic Algorithm

doi:10.3390/electronics11192985

Open AccessJournal ArticleDOI

Selective Layer Tuning and Performance Study of Pre-Trained Models Using Genetic Algorithm

Jae Cheol Jeong, +8 more

- 21 Sep 2022 -

Electronics

- Vol. 11, Iss: 19, pp 2985-2985

TLDR

This paper proposes tuning trainable layers using a genetic algorithm on a pre-trained model that is fine-tuned on single-channel image datasets for a classification task.

Abstract:

Utilizing pre-trained models involves fully or partially using pre-trained parameters as initialization. In general, configuring a pre-trained model demands practitioners’ knowledge about problems or an exhaustive trial–error experiment according to a given task. In this paper, we propose tuning trainable layers using a genetic algorithm on a pre-trained model that is fine-tuned on single-channel image datasets for a classification task. The single-channel dataset comprises images from grayscale and preprocessed audio signals transformed into a log-Mel spectrogram. Four deep-learning models used in the experimental evaluation employed the pre-trained model with the ImageNet dataset. The proposed genetic algorithm was applied to find the highest fitness for every generation to determine the selective layer tuning of the pre-trained models. Compared to the conventional fine-tuning method and random layer search, our proposed selective layer search with a genetic algorithm achieves higher accuracy, on average, by 9.7% and 1.88% (MNIST-Fashion), 1.31% and 1.14% (UrbanSound8k), and 2.2% and 0.29% (HospitalAlarmSound), respectively. In addition, our searching method can naturally be applied to various datasets of the same task without prior knowledge about the dataset of interest.

Selective Layer Tuning and Performance Study of Pre-Trained Models Using Genetic Algorithm

Citations

Genetic Algorithm-Based Hyperparameter Optimization for Convolutional Neural Networks in the Classification of Crop Pests

Power Optimization in Multi-Tier Heterogeneous Networks Using Genetic Algorithm

References

Very Deep Convolutional Networks for Large-Scale Image Recognition

Attention is All you Need

ImageNet: A large-scale hierarchical image database

Deep Residual Learning for Image Recognition

Gradient-based learning applied to document recognition

Related Papers (5)

A MAP-based channel estimation algorithm for SIMO systems over extended SV channel model

Deep Bayesian Active Learning with Image Data

Deep Bayesian active learning with image data

Improving image classification robustness using self‐supervision

A Roadmap to Deep Learning: A State-of-the-Art Step Towards Machine Learning