TY - GEN
T1 - A Comparative Analysis of Cross-Classifiers with Resampled Datasets for Exfiltration Attacks
AU - Hakim, Arif Rahman
AU - Ramli, Kalamullah
AU - Inayah, Kuni
AU - Agustina, Esti Rahmawati
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Imbalanced datasets pose a significant challenge in intrusion detection, often reducing model performance in classifying attacks. This study evaluates the impact of three resampling techniques, BorderlineSMOTE, SMOTEENN, and SMOTETomek, on the performance of three popular classifiers: Random Forest, XGBoost, and SupportVector Machine (SVM). Using a dataset of over 2.5 million attack records generated by the BSSN honeynet targeting government networks, we assess accuracy, precision, recall, F1-score, specificity, and geometric mean. Results reveal that SMOTEENN consistently delivers superior performance across all classifiers, with notable efficiency in balancing data and enhancing detection accuracy. These findings underscore the importance of selecting the proper resampling technique and classifier for improving intrusion detection in imbalanced datasets.
AB - Imbalanced datasets pose a significant challenge in intrusion detection, often reducing model performance in classifying attacks. This study evaluates the impact of three resampling techniques, BorderlineSMOTE, SMOTEENN, and SMOTETomek, on the performance of three popular classifiers: Random Forest, XGBoost, and SupportVector Machine (SVM). Using a dataset of over 2.5 million attack records generated by the BSSN honeynet targeting government networks, we assess accuracy, precision, recall, F1-score, specificity, and geometric mean. Results reveal that SMOTEENN consistently delivers superior performance across all classifiers, with notable efficiency in balancing data and enhancing detection accuracy. These findings underscore the importance of selecting the proper resampling technique and classifier for improving intrusion detection in imbalanced datasets.
KW - exfiltration
KW - honeynet
KW - imbalance
KW - machine learning
KW - resampling
UR - http://www.scopus.com/inward/record.url?scp=105002484221&partnerID=8YFLogxK
U2 - 10.1109/ICITSI65188.2024.10929455
DO - 10.1109/ICITSI65188.2024.10929455
M3 - Conference contribution
AN - SCOPUS:105002484221
T3 - 2024 International Conference on Information Technology Systems and Innovation, ICITSI 2024 - Proceedings
SP - 106
EP - 111
BT - 2024 International Conference on Information Technology Systems and Innovation, ICITSI 2024 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 International Conference on Information Technology Systems and Innovation, ICITSI 2024
Y2 - 12 December 2024
ER -