Abstract
A general deep learning (DL) mechanism for a multiple hidden layer feed-forward neural network contains two parts, i.e., 1) an unsupervised greedy layer-wise training and 2) a supervised fine-tuning which is usually an iterative process. Although this mechanism has been demonstrated in many fields to be able to significantly improve the generalization of neural network, there is no clear evidence to show which one of the two parts plays the essential role for the generalization improvement, resulting in an argument within the DL community. Focusing on this argument, this paper proposes a new DL approach to train multilayer feed-forward neural networks. This approach uses restricted Boltzmann machine (RBM) as the layer-wise training and uses the generalized inverse of a matrix as the supervised fine-tuning. Different from the general deep training mechanism like back-propagation (BP), the proposed approach does not need to iteratively tune the weights, and therefore, has many advantages such as quick training, better generalization, and high understandability, etc. Experimentally, the proposed approach demonstrates an excellent performance in comparison with BP-based DL and the traditional training method for multilayer random weight neural networks. To a great extent, this paper demonstrates that the supervised part plays a more important role than the unsupervised part in DL, which provides some new viewpoints for exploring the essence of DL.
Original language | English |
---|---|
Article number | 7931567 |
Pages (from-to) | 1299-1308 |
Number of pages | 10 |
Journal | IEEE Transactions on Systems, Man, and Cybernetics: Systems |
Volume | 49 |
Issue number | 7 |
Early online date | 18 May 2017 |
DOIs | |
Publication status | Published - Jul 2019 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2013 IEEE.
Keywords
- Deep learning (DL)
- generalized inverse of matrix
- random weight neural network (RWNN)
- supervised learning
- training without iteration