Handling missing data through deep convolutional neural network

Hufsa KHAN, Xizhao WANG, Han LIU*

*Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

24 Citations (Scopus)

Abstract

The presence of missing data is a challenging issue in processing real-world datasets. It is necessary to improve the data quality by imputing the missing values so that effective learning from data can be achieved. Recently, deep learning has become the most powerful type of machine learning techniques, which can be used for discovering the hidden knowledge that exists in a large dataset to make accurate predictions. In this paper, we propose an imputation method that involves using a convolutional neural network to impute the missing values. The missing value of each instance is imputed essentially by using a trained kernel. The weights of the kernel are determined by learning from the given data that are arranged spatially in the data matrix. The kernel carries out a weighted sum of neighboring elements in an array for imputing the missing values. In addition, in the absence of the true values with which the missing values are expected to be replaced, a loss function is designed without the need to know the true value. Our method is evaluated on UCI datasets in comparison with state-of-the-art methods. The experimental results show that the proposed approach performs closely to or better than other methods.

Original languageEnglish
Pages (from-to)278-293
Number of pages16
JournalInformation Sciences
Volume595
Early online date1 Mar 2022
DOIs
Publication statusPublished - May 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022 Elsevier Inc.

Keywords

  • Convolutional neural network
  • Data imputation
  • Fuzzy clustering
  • Missing value

Fingerprint

Dive into the research topics of 'Handling missing data through deep convolutional neural network'. Together they form a unique fingerprint.

Cite this