Abstract
Extreme learning machine is known for its fast learning speed while maintaining acceptable generalisation. Its learning process can be divided into two parts: (1) randomly assigns input weights and biases in hidden layer, and (2) analytically determines output weights by the use of Moore-Penrose generalised inverse. Through the analysis from theory and experiment aspects we point out that it is the random weights assignment rather than the analytical determination with generalised inverse that leads to its fast training speed. In fact, the calculation of generalised inverse of hidden layer output matrix based on singular value decomposition (SVD) has very low efficiency especially on large scale data, and even directly cannot work. Considering this high calculation complexity reduces the learning speed of ELM conjugate gradient is introduced as a replacement of Moore-Penrose generalised inverse and conjugate gradient based ELM (CG-ELM) is proposed. Numerical simulations show that, in most cases, CG-ELM achieved faster speed than ELM in the condition of maintaining similar generalisation. Even in the case that ELM cannot work because of the huge amount of data CG-ELM attains good performance, which illustrates that Moore-Penrose generalised inverse is not the contribution of fast learning speed of ELM from experiment view.
Original language | English |
---|---|
Pages (from-to) | 314-322 |
Number of pages | 9 |
Journal | International Journal of Wireless and Mobile Computing |
Volume | 13 |
Issue number | 4 |
Early online date | 17 Jan 2017 |
DOIs | |
Publication status | Published - 2017 |
Externally published | Yes |
Bibliographical note
This work is supported by the National Natural Science Foundation of China (Project no. 71371063, 61672205, 61503252), The Science & Technology Bureau of Shenzhen (JCYJ20150324140036825), the Natural Science Foundation of Hebei Province (Project no. A2015210103), Youth Foundation of Hebei Province Department of Education Fund (Project no. QN2016140), and China Postdoctoral Science Foundation (2016T90799).Keywords
- Conjugate gradient method
- extreme learning machine
- Generalised inverse
- SVD