TY - JOUR
T1 - Tree-Based Ensemble Methods and Their Applications for Predicting Students’ Academic Performance
AU - Ayulani, Indri Dayanah
AU - Yunawan, Agatha Melinda
AU - Prihutaminingsih, Tamara
AU - Sarwinda, Devvi
AU - Ardaneswari, Gianinna
AU - Handari, Bevina Desjwiandra
N1 - Funding Information:
ACKNOWLEDGMENT Universitas Indonesia funded this research, PUTI grant scheme No.: NKB-1977/UN2.RST/HKP.05.00/2020.
Funding Information:
Universitas Indonesia funded this research, PUTI grant scheme No.: NKB-1977/UN2.RST/HKP.05.00/2020.
Publisher Copyright:
© IJASEIT is licensed under a Creative Commons Attribution-Share Alike 4.0 International License.
PY - 2023
Y1 - 2023
N2 - Students’ academic performance is a key aspect of online learning success. Online learning applications known as Learning Management Systems (LMS) store various online learning activities. In this research, students’ academic performances in online course X are predicted such that teachers could identify students who are at risk much sooner. The prediction uses tree-based ensemble methods such as Random Forest, XGBoost (Extreme Gradient Boosting), and LightGBM (Light Gradient Boosting Machine). Random Forest is a bagging method, whereas XGBoost and LightGBM are boosting methods. The data recorded in LMS UI, or EMAS (e-Learning Management Systems) is collected. The data consists of activity data for 232 students (219 passed, 13 failed) in course X. This data is divided into three proportions (80:20, 70:30, and 60:40) and three periods (the first, first two, and first three months of the study period). Data is pre-processed using the SMOTE method to handle imbalanced data and implemented in all categories, with and without feature selection. The prediction results are compared to determine the best time for predicting students’ academic performance and how well each model can predict the number of unsuccessful students. The implementation results show that students’ academic performance can be predicted at the end of the second month, with best prediction rates of 86.8%, 80%, and 75% for the LightGBM, Random Forest, and XGBoost models, respectively, with feature selection. Therefore, with this prediction, students who could fail still have time to improve their academic performance.
AB - Students’ academic performance is a key aspect of online learning success. Online learning applications known as Learning Management Systems (LMS) store various online learning activities. In this research, students’ academic performances in online course X are predicted such that teachers could identify students who are at risk much sooner. The prediction uses tree-based ensemble methods such as Random Forest, XGBoost (Extreme Gradient Boosting), and LightGBM (Light Gradient Boosting Machine). Random Forest is a bagging method, whereas XGBoost and LightGBM are boosting methods. The data recorded in LMS UI, or EMAS (e-Learning Management Systems) is collected. The data consists of activity data for 232 students (219 passed, 13 failed) in course X. This data is divided into three proportions (80:20, 70:30, and 60:40) and three periods (the first, first two, and first three months of the study period). Data is pre-processed using the SMOTE method to handle imbalanced data and implemented in all categories, with and without feature selection. The prediction results are compared to determine the best time for predicting students’ academic performance and how well each model can predict the number of unsuccessful students. The implementation results show that students’ academic performance can be predicted at the end of the second month, with best prediction rates of 86.8%, 80%, and 75% for the LightGBM, Random Forest, and XGBoost models, respectively, with feature selection. Therefore, with this prediction, students who could fail still have time to improve their academic performance.
KW - features selection
KW - learning analytics
KW - learning management systems
KW - LightGBM
KW - machine learning
KW - online learning
KW - Random Forest
KW - Students’ academic performance
KW - tree-based ensemble methods
KW - XGBoost
UR - http://www.scopus.com/inward/record.url?scp=85163348220&partnerID=8YFLogxK
U2 - 10.18517/ijaseit.13.3.16880
DO - 10.18517/ijaseit.13.3.16880
M3 - Article
AN - SCOPUS:85163348220
SN - 2088-5334
VL - 13
SP - 919
EP - 927
JO - International Journal on Advanced Science, Engineering and Information Technology
JF - International Journal on Advanced Science, Engineering and Information Technology
IS - 3
ER -