A new efficient training strategy for deep neural networks by hybridization of artificial bee colony and limited-memory BFGS optimization algorithms


Badem H., BAŞTÜRK A., ÇALIŞKAN A., YÜKSEL M. E.

NEUROCOMPUTING, vol.266, pp.506-526, 2017 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 266
  • Publication Date: 2017
  • Doi Number: 10.1016/j.neucom.2017.05.061
  • Journal Name: NEUROCOMPUTING
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Page Numbers: pp.506-526
  • Keywords: Training strategy, L-BFGS, Artificial bee colony optimization algorithm, Deep learning, Stacked autoencoder network, Deep neural network, Hybridization, PARAMETERS
  • Erciyes University Affiliated: Yes

Abstract

Working up with deep learning techniques requires profound understanding of the mechanisms underlying the optimization of the internal parameters of complex structures. The major factor limiting this understanding is that there exist only a few optimization methods such as gradient descent and Limited memory Broyden-Fletcher-Goldfarb-Shannon (L-BFGS) to find the best local minima of the problem space for these complex structures such as deep neural network (DNN). Therefore, in this paper, we represent a new training approach named hybrid artificial bee colony based training strategy (HABCbTS) to tune the parameters of a DNN structure, which includes one or more autoencoder layers cascaded to a softmax classification layer. In this strategy, a derivative-free optimization algorithm "ABC" is combined with a derivative-based algorithm "L-BFGS" to construct "HABC", which is used in the HABCbTS. Detailed simulation results supported by statistical analysis show that the proposed training strategy results in better classification performance compared to the DNN classifier trained with the L-BFGS, ABC and modified ABC. The obtained classification results are also compared with the state-of-the-art classifiers, including MLP, SVM, KNN, DT and NB on 15 data sets with different dimensions and sizes. (C) 2017 Elsevier B.V. All rights reserved.