BREAST CANCER MOLECULAR SUBTYPES CLASSIFICATION WITH NAÏVE BAYES CLASSIFIER: A CASE STUDY IN INDONESIAN BREAST CANCER PATIENTS

Research output: Contribution to journalArticlepeer-review

Abstract

Naïve Bayes Classifier (NBC) is one of the most popular machine learning methods due to its simplicity yet great overall accuracy. Previous studies stated a number of claims and reasons as to why this method is suitable to apply in analysing medical data. While there is abundant literature which cover the application of NBC on medical data, there is little to none which covers its application on medical data in Indonesia, especially Indonesian breast cancer data. The main goal of this study is to apply NBC in classifying 101 breast cancer patients in private hospital in Indonesia into five classes of molecular subtypes, assess its accuracy, and compare its performance with another popular classification model, Decision Tree (DT). Results showed that NBC outperformed DT, predicting the classes of most patients in the test set correctly by having an overall accuracy of 85.7%. NBC also could identify all patients in the human epidermal growth factor receptor 2 (HER2)-Enriched and Triple Negative subtypes faultlessly. Another important result to be noted is that oestrogen (ER), progesterone (PR), and HER2 statuses are dominant factors in differentiating subtypes between each patient, matching the official guide made by the breast cancer expert panel. Several empirical results were also found regarding NBC: it does not need a large data to produce high overall accuracy and it could still predict the classes of most patients correctly even with the presence of missing and noisy data.

Original languageEnglish
Pages (from-to)1-17
Number of pages17
JournalJournal of Applied Probability and Statistics
Volume16
Issue number3
Publication statusPublished - Dec 2021

Keywords

  • and posterior class probability
  • Bayes rule
  • breast cancer subtypes
  • classification
  • decision tree

Fingerprint

Dive into the research topics of 'BREAST CANCER MOLECULAR SUBTYPES CLASSIFICATION WITH NAÏVE BAYES CLASSIFIER: A CASE STUDY IN INDONESIAN BREAST CANCER PATIENTS'. Together they form a unique fingerprint.

Cite this