An Initial Study on the Relationship between Meta Features of Dataset and the Initialization of NNRW

Weipeng CAO, Muhammed J.A. PATWARY, Pengfei YANG, Xizhao WANG, Zhong MING*

*Corresponding author for this work

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Referred Conference Paperpeer-review

20 Citations (Scopus)

Abstract

The initialization of neural networks with random weights (NNRW) has a significant impact on model performance. However, there is no suitable way to solve this problem so far. In this paper, the relationship between meta features of a dataset and the initialization of NNRW is studied. Specifically, we construct seven regression datasets with known attributes' distributions, then initialize NNRW with different distributions and trained them based on the datasets to get the corresponding models respectively. The relationship between the attributes' distributions of the datasets and the initialization of NNRW is analyzed by the performance of the models. Several interesting phenomena are observed: firstly, initializing NNRW with the Gaussian distribution can help the model to have a faster convergence rate than ones with the Gamma and Uniform distribution. Secondly, if one or more attributes in a dataset that follow the Gamma distribution, using Gamma distribution to initialize NNRW may result in a slower convergence rate and easy overfitting. Thirdly, initializing NNRW with a specific distribution with smaller variances can always achieve faster convergence rate and better generalization performance than the one with larger variances. The above experimental results are not sensitive to the activation function and the type of NNRW. Some theoretical analyses about the above observations are also given in the study.

Original languageEnglish
Title of host publicationProceedings of 2019 International Joint Conference on Neural Networks, IJCNN 2019
PublisherIEEE
Pages2092-2099
Number of pages8
ISBN (Electronic)9781728119854
DOIs
Publication statusPublished - Jul 2019
Externally publishedYes
Event2019 International Joint Conference on Neural Networks, IJCNN 2019 - Budapest, Hungary
Duration: 14 Jul 201919 Jul 2019

Publication series

NameInternational Joint Conference on Neural Networks (IJCNN)
Volume2019-July

Conference

Conference2019 International Joint Conference on Neural Networks, IJCNN 2019
Country/TerritoryHungary
CityBudapest
Period14/07/1919/07/19

Bibliographical note

We really thank the editor and anonymous reviewers for their invaluable suggestions to help us improve this paper. This work was supported by the National Natural Science Foundation of China under Grant nos. 61672358 and 61836005.

Keywords

  • extreme learning machine
  • meta feature
  • Neural networks with random weights
  • random vector functional link network

Fingerprint

Dive into the research topics of 'An Initial Study on the Relationship between Meta Features of Dataset and the Initialization of NNRW'. Together they form a unique fingerprint.

Cite this