IMPROVING MODEL PERFORMANCE FOR PREDICTING EXFILTRATION ATTACKS THROUGH RESAMPLING STRATEGIES

ARIF RAHMAN HAKIM, KALAMULLAH RAMLI, MUHAMMAD SALMAN, ESTI RAHMAWATI AGUSTINA

Research output: Contribution to journalArticlepeer-review

Abstract

Addressing class imbalance is critical in cybersecurity applications, particularly in scenarios like exfiltration detection, where skewed datasets lead to biased predictions and poor generalization for minority classes. This study investigates five Synthetic Minority Oversampling Technique (SMOTE) variants, including BorderlineSMOTE, KMeansSMOTE, SMOTEENC, SMOTEENN, and SMOTETomek, to mitigate severe imbalance in our customized tactic-labeled dataset with dominant majority class influence and weak class separability class imbalance. We use seven imbalance metrics to assess each SMOTE variant's impact on class distribution stability and separability. Furthermore, we evaluate model performance across five classifiers: Logistic Regression, Naïve Bayes, Support Vector Machine, Random Forest, and XGBoost. Findings reveal that SMOTEENN consistently enhances performance metrics (accuracy, precision, recall, F1-score, and geometric mean) on an average of 99% across most classifiers, establishing itself as the most adaptable variant for handling imbalance. This study provides a comprehensive framework for selecting resampling strategies to enhance classification efficacy in cybersecurity tasks with imbalanced data.

Original languageEnglish
Pages (from-to)420-436
Number of pages17
JournalIIUM Engineering Journal
Volume26
Issue number1
DOIs
Publication statusPublished - 2025

Keywords

  • and Exfiltration
  • Imbalance Data
  • Machine Learning
  • SMOTE

Fingerprint

Dive into the research topics of 'IMPROVING MODEL PERFORMANCE FOR PREDICTING EXFILTRATION ATTACKS THROUGH RESAMPLING STRATEGIES'. Together they form a unique fingerprint.

Cite this