A new efficient training strategy for deep neural networks by hybridization of artificial bee colony and limited-memory BFGS optimization algorithms

Badem, Hasan; BAŞTÜRK, ALPER; ÇALIŞKAN, Abdullah; YÜKSEL, MEHMET

doi:10.1016/j.neucom.2017.05.061

A new efficient training strategy for deep neural networks by hybridization of artificial bee colony and limited-memory BFGS optimization algorithms

Badem H., BAŞTÜRK A., ÇALIŞKAN A., YÜKSEL M. E.

NEUROCOMPUTING, cilt.266, ss.506-526, 2017 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 266
Basım Tarihi: 2017
Doi Numarası: 10.1016/j.neucom.2017.05.061
Dergi Adı: NEUROCOMPUTING
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.506-526
Anahtar Kelimeler: Training strategy, L-BFGS, Artificial bee colony optimization algorithm, Deep learning, Stacked autoencoder network, Deep neural network, Hybridization
Erciyes Üniversitesi Adresli: Evet

Özet

Working up with deep learning techniques requires profound understanding of the mechanisms underlying the optimization of the internal parameters of complex structures. The major factor limiting this understanding is that there exist only a few optimization methods such as gradient descent and Limited memory Broyden-Fletcher-Goldfarb-Shannon (L-BFGS) to find the best local minima of the problem space for these complex structures such as deep neural network (DNN). Therefore, in this paper, we represent a new training approach named hybrid artificial bee colony based training strategy (HABCbTS) to tune the parameters of a DNN structure, which includes one or more autoencoder layers cascaded to a softmax classification layer. In this strategy, a derivative-free optimization algorithm "ABC" is combined with a derivative-based algorithm "L-BFGS" to construct "HABC", which is used in the HABCbTS. Detailed simulation results supported by statistical analysis show that the proposed training strategy results in better classification performance compared to the DNN classifier trained with the L-BFGS, ABC and modified ABC. The obtained classification results are also compared with the state-of-the-art classifiers, including MLP, SVM, KNN, DT and NB on 15 data sets with different dimensions and sizes. (C) 2017 Elsevier B.V. All rights reserved.