Missing values imputation based on fuzzy C-Means algorithm for classification of chronic obstructive pulmonary disease (COPD)

Kiki Aristiawati, Titin Siswantining, Devvi Sarwinda, Saskya Mary Soemartojo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Citations (Scopus)

Abstract

Chronic Obstructive Pulmonary Disease (COPD) is one of the most causes of death in the world. World Health Organization (WHO) reported that in 2016 COPD was the third leading cause of death worldwide with around 3 million deaths, equivalent to 5.2% of deaths worldwide. For this reason, further research needs to be done on CPOD. Unfortunately, the data collected in the study does not contain all the desired data, is called as a missing value. Missing value is a problem for all types of data analysis. Several ways that can be applied to handle missing value, by filtering data (ignore or remove data) and imputing data. Ignoring or removing data can reduce the amount of information contained in the data and can cause low accuracy to generate from the data analysis process. To overcome this problem, imputation data will be carried out at the preprocessing stage to obtain complete data which is expected to increase the accuracy of the data analysis performed. Many imputations method can be used, such as mean imputation and Fuzzy C-Means (FCM). Fuzzy C-Means is a clustering method that allows one part of the data to belong to two or more groups based on their membership function. The complete dataset was trained with Decision Tree classifier to observe the performance in terms of accuracy for mean and FCM method. The analysis of proposed imputation on classification shows that FCM slightly accurate compare to mean imputation method.

Original languageEnglish
Title of host publicationProceedings of the 8th SEAMS-UGM International Conference on Mathematics and Its Applications 2019
Subtitle of host publicationDeepening Mathematical Concepts for Wider Application through Multidisciplinary Research and Industries Collaborations
EditorsHerni Utami, Fajar Adi Kusumo, Nanang Susyanto, Yeni Susanti
PublisherAmerican Institute of Physics Inc.
ISBN (Electronic)9780735419438
DOIs
Publication statusPublished - 19 Dec 2019
Event8th SEAMS-UGM International Conference on Mathematics and Its Applications 2019: Deepening Mathematical Concepts for Wider Application through Multidisciplinary Research and Industries Collaborations - Yogyakarta, Indonesia
Duration: 29 Jul 20191 Aug 2019

Publication series

NameAIP Conference Proceedings
Volume2192
ISSN (Print)0094-243X
ISSN (Electronic)1551-7616

Conference

Conference8th SEAMS-UGM International Conference on Mathematics and Its Applications 2019: Deepening Mathematical Concepts for Wider Application through Multidisciplinary Research and Industries Collaborations
Country/TerritoryIndonesia
CityYogyakarta
Period29/07/191/08/19

Keywords

  • COPD
  • Decision Tree
  • Fuzzy C-Means
  • Imputation Data
  • Missing Value

Fingerprint

Dive into the research topics of 'Missing values imputation based on fuzzy C-Means algorithm for classification of chronic obstructive pulmonary disease (COPD)'. Together they form a unique fingerprint.

Cite this