Comparing random forest and support vector machines for breast cancer classification

Chelvian Aroef, Yuda Rivan, Zuherman Rustam

Research output: Contribution to journalArticle

Abstract

There are more than 100 types of cancer around the world with different symptoms and difficulty in predicting its appearance in a person due to its random and sudden attack method. However, the appearance of cancer is generally marked by the growth of some abnormal cell. Someone might be diagnosed early and quickly treated, but the cancerous cell most times hides in the body of its victim and reappear, only to kill its sufferer. One of the most common cancers is breast cancer. According to Ministry of Health, in 2018, breast cancer attacked 42 out of every 100.000 people in Indonesia with approximately 17 deaths. In addition, the Ministry recorded a yearly increase in cancer patients. Therefore, there is adequate need to be able to determine those affected by this disease. This study applied the Boruta feature selection to determine the most important features in making a machine learning model. Furthermore, the Random Forest (RF) and Support Vector Machines (SVM) were the machine learning model used, with highest accuracies of 90% and 95% respectively. From the results obtained, the SVM is a better model than random forest in terms of accuracy.

Original languageEnglish
Pages (from-to)815-821
Number of pages7
JournalTelkomnika (Telecommunication Computing Electronics and Control)
Volume18
Issue number2
DOIs
Publication statusPublished - 1 Apr 2020

Keywords

  • Breast cancer
  • Random forest
  • Support vector machines

Fingerprint Dive into the research topics of 'Comparing random forest and support vector machines for breast cancer classification'. Together they form a unique fingerprint.

  • Cite this