The generalization ability of artificial neural networks (ANNs) is greatly dependent on their architectures. Constructive algorithms provide an attractive automatic way of determining a near-optimal ANN architecture for a given problem. Several such algorithms have been proposed in the literature and shown their effectiveness. This paper presents a new constructive algorithm (NCA) in automatically determining ANN architectures. Unlike most previous studies on determining ANN architectures, NCA puts emphasis on architectural adaptation and functional adaptation in its architecture determination process. It uses a constructive approach to determine the number of hidden layers in an ANN and of neurons in each hidden layer. To achieve functional adaptation, NCA trains hidden neurons in the ANN by using different training sets that were created by employing a similar concept used in the boosting algorithm. The purpose of using different training sets is to encourage hidden neurons to learn different parts or aspects of the training data so that the ANN can learn the whole training data in a better way. In this paper, the convergence and computational issues of NCA are analytically studied. The computational complexity of NCA is found to be O(W × Pt × X τ, where W is the number of weights in the ANN, Pt is the number of training examples, and X τ is the number of training epochs. This complexity has the same order as what the backpropagation learning algorithm requires for training a fixed ANN architecture. A set of eight classification and two approximation benchmark problems was used to evaluate the performance of NCA. The experimental results show that NCA can produce ANN architectures with fewer hidden neurons and better generalization ability compared to existing constructive and nonconstructive algorithms. © 2009 IEEE.
|Number of pages
|IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
|Early online date
|10 Jun 2009
|Published - Dec 2009
Bibliographical noteThe work of M. Islam was supported in part by a fellowship grant from the Japanese Society for the Promotion of Science. This paper was recommended by Associate Editor E. Santos, Jr.
- Architectural adaptation
- Artificial neural networks (ANNs)
- Constructive approach
- Functional adaptation
- Generalization ability