Abstract
The presence of missing data is a challenging issue in processing real-world datasets. It is necessary to improve the data quality by imputing the missing values so that effective learning from data can be achieved. Recently, deep learning has become the most powerful type of machine learning techniques, which can be used for discovering the hidden knowledge that exists in a large dataset to make accurate predictions. In this paper, we propose an imputation method that involves using a convolutional neural network to impute the missing values. The missing value of each instance is imputed essentially by using a trained kernel. The weights of the kernel are determined by learning from the given data that are arranged spatially in the data matrix. The kernel carries out a weighted sum of neighboring elements in an array for imputing the missing values. In addition, in the absence of the true values with which the missing values are expected to be replaced, a loss function is designed without the need to know the true value. Our method is evaluated on UCI datasets in comparison with state-of-the-art methods. The experimental results show that the proposed approach performs closely to or better than other methods.
Original language | English |
---|---|
Pages (from-to) | 278-293 |
Number of pages | 16 |
Journal | Information Sciences |
Volume | 595 |
Early online date | 1 Mar 2022 |
DOIs | |
Publication status | Published - May 2022 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2022 Elsevier Inc.
Keywords
- Convolutional neural network
- Data imputation
- Fuzzy clustering
- Missing value