TY - GEN
T1 - The impact of feature selection methods on machine learning-based docking prediction of Indonesian medicinal plant compounds and HIV-1 protease
AU - Pujianto, Rahman
AU - Gultom, Yohanes
AU - Wibisono, Ari
AU - Yanuar, Arry
AU - Suhartanto, Heru
PY - 2019/10
Y1 - 2019/10
N2 - This work evaluates usage feature selection methods to reduce the number of features required to predict docking results between Indonesian medicinal plant compounds and HIV protease. Two feature selection methods, Recursive Feature Elimination (RFE) and Wrapper Method (WM), are trained with a dataset of 7,330 samples and 667 features from PubChem Bioassay and DUD-E decoys. To evaluate the selected features, a dataset of 368 Indonesian herbal chemical compounds labeled by manually docking to PDB HIV-1 protease is used to benchmark the performance of linear SVM classifier using different sets of features. Our experiments show that a set of 471 features selected by RFE and 249 by WM achieve a reduction of classification time by 4.0 and 8.2 seconds respectively. Although the accuracy and sensitivity are also increased by 8% and 16%, no meaningful improvement observed for precision and specificity.
AB - This work evaluates usage feature selection methods to reduce the number of features required to predict docking results between Indonesian medicinal plant compounds and HIV protease. Two feature selection methods, Recursive Feature Elimination (RFE) and Wrapper Method (WM), are trained with a dataset of 7,330 samples and 667 features from PubChem Bioassay and DUD-E decoys. To evaluate the selected features, a dataset of 368 Indonesian herbal chemical compounds labeled by manually docking to PDB HIV-1 protease is used to benchmark the performance of linear SVM classifier using different sets of features. Our experiments show that a set of 471 features selected by RFE and 249 by WM achieve a reduction of classification time by 4.0 and 8.2 seconds respectively. Although the accuracy and sensitivity are also increased by 8% and 16%, no meaningful improvement observed for precision and specificity.
UR - http://www.scopus.com/inward/record.url?scp=85081088919&partnerID=8YFLogxK
U2 - 10.1109/ICACSIS47736.2019.8979672
DO - 10.1109/ICACSIS47736.2019.8979672
M3 - Conference contribution
T3 - 2019 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2019
SP - 181
EP - 186
BT - 2019 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2019
Y2 - 12 October 2019 through 13 October 2019
ER -