Feature optimization using Backward Elimination and Support Vector Machines (SVM) algorithm for diabetes classification

F. Maulidina, Z. Rustam, S. Hartini, V. V.P. Wibowo, I. Wirasati, W. Sadewo

Research output: Contribution to journalConference articlepeer-review

11 Citations (Scopus)


Diabetes is a disease that occurs when the blood glucose level is higher than normal and also leads to health problems. Early and accurate diagnosis needs to be carried out on individuals affected by this disease. Furthermore, excellent treatment needs to be provided to prevent worse situations. Some studies have used several machine learning methods to diagnose diabetes. Furthermore, in this study, the Backward Elimination and Support Vector Machine (SVM) algorithm was used to classify the PIMA Indians diabetes dataset. It consisted of 268 diabetic and 500 non-diabetic patients with eight attributes. Backward Elimination is a feature selection method used to remove irrelevant features based on the linear regression model. Using this method, the right features for the model was expected. This method has some advantages which include increasing training time, decreasing complexity and improving performance and accuracy. Therefore, the performance of SVM improved. Based on the experiments, it was discovered that by combining feature selection algorithm (backward elimination) and SVM, the highest accuracy obtained was 85.71% using 90% data training. Therefore, it was concluded that Backward Elimination combined with SVM algorithm is an excellent method to classify diabetes by using the PIMA Indians diabetes dataset.

Original languageEnglish
Article number012006
JournalJournal of Physics: Conference Series
Issue number1
Publication statusPublished - 29 Mar 2021
Event6th International Conference on Mathematics: Pure, Applied and Computation, ICOMPAC 2020 - Surabaya, Virtual, Indonesia
Duration: 24 Oct 2020 → …


Dive into the research topics of 'Feature optimization using Backward Elimination and Support Vector Machines (SVM) algorithm for diabetes classification'. Together they form a unique fingerprint.

Cite this