Nearest neighbour approach in the least-squares data imputation algorithms

Ito Wasito, B. Mirkin

Research output: Contribution to journalArticlepeer-review

48 Citations (Scopus)

Abstract

Imputation of missing data is of interest in many areas such as survey data editing, medical documentation maintaining and DNA microarray data analysis. This paper is devoted to experimental analysis of a set of imputation methods developed within the so-called least-squares approximation approach, a non-parametric computationally effective multidimensional technique. First, we review global methods for least-squares data imputation. Then we propose extensions of these algorithms based on the nearest neighbours approach. An experimental study of the algorithms on generated data sets is conducted. It appears that straight algorithms may work rather well on data of simple structure and/or with small number of missing entries. However, in more complex cases, the only winner within the least-squares approximation approach is a method, INI, proposed in this paper as a combination of global and local imputation algorithms.

Original languageEnglish
Pages (from-to)1-25
Number of pages25
JournalInformation Sciences
Volume169
Issue number1-2
DOIs
Publication statusPublished - 6 Jan 2005

Keywords

  • Global-local learning
  • Imputation
  • Least-squares
  • Nearest neighbour
  • Singular value decomposition

Fingerprint

Dive into the research topics of 'Nearest neighbour approach in the least-squares data imputation algorithms'. Together they form a unique fingerprint.

Cite this