Literature DB >> 33816989

Multi-objective simulated annealing for hyper-parameter optimization in convolutional neural networks.

Ayla Gülcü1, Zeki Kuş1.   

Abstract

In this study, we model a CNN hyper-parameter optimization problem as a bi-criteria optimization problem, where the first objective being the classification accuracy and the second objective being the computational complexity which is measured in terms of the number of floating point operations. For this bi-criteria optimization problem, we develop a Multi-Objective Simulated Annealing (MOSA) algorithm for obtaining high-quality solutions in terms of both objectives. CIFAR-10 is selected as the benchmark dataset, and the MOSA trade-off fronts obtained for this dataset are compared to the fronts generated by a single-objective Simulated Annealing (SA) algorithm with respect to several front evaluation metrics such as generational distance, spacing and spread. The comparison results suggest that the MOSA algorithm is able to search the objective space more effectively than the SA method. For each of these methods, some front solutions are selected for longer training in order to see their actual performance on the original test set. Again, the results state that the MOSA performs better than the SA under multi-objective setting. The performance of the MOSA configurations are also compared to other search generated and human designed state-of-the-art architectures. It is shown that the network configurations generated by the MOSA are not dominated by those architectures, and the proposed method can be of great use when the computational complexity is as important as the test accuracy.
© 2021 Gülcü and Kuş.

Entities:  

Keywords:  Convolutional neural networks; Hyper-parameter optimization; Multi-objective; Simulated annealing

Year:  2021        PMID: 33816989      PMCID: PMC7924536          DOI: 10.7717/peerj-cs.338

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  4 in total

Review 1.  Multiobjective evolutionary algorithms: analyzing the state-of-the-art.

Authors:  D A Van Veldhuizen; G B Lamont
Journal:  Evol Comput       Date:  2000       Impact factor: 3.277

2.  Comparison of multiobjective evolutionary algorithms: empirical results.

Authors:  E Zitzler; K Deb; L Thiele
Journal:  Evol Comput       Date:  2000       Impact factor: 3.277

3.  Optimization by simulated annealing.

Authors:  S Kirkpatrick; C D Gelatt; M P Vecchi
Journal:  Science       Date:  1983-05-13       Impact factor: 47.728

4.  Completely Automated CNN Architecture Design Based on Blocks.

Authors:  Yanan Sun; Bing Xue; Mengjie Zhang; Gary G Yen
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2019-06-20       Impact factor: 10.451

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.