CUSTOMER CHURN PREDICTION USING HISTOGRAM AUGMENTATION TECHNIQUE AND XGBOOST MODEL WITH BAYESIAN OPTIMIZATION

Research output: Contribution to journalArticlepeer-review

Abstract

Customer churn is a significant issue in many sectors, such as the telecom- munication sector. Therefore, telecommunication companies need to recognize churn risk as early as possible. Data from the IBM telco customer churn dataset was selected as a case study. One of the common challenges in classification problems is an imbalanced dataset, which will likely fail to predict the minority class. Oversampling with Histogram Augmentation Technique (HAT) is proposed in this study for handling the imbalanced class data. An ensemble learning of gradient boost machine learning techniques, namely XGBoost, was used in this study. In addition, we used Bayesian Optimization (BO) to find the best hyperparameter of the model. The experimental result shows that the accu- racy of HAT-XGBoost-BO is 0.88 and the F1-score is 0.85, outperforming the XGBoost, HAT-XGBoost, and SMOTE-XGBoost models.

Original languageEnglish
Pages (from-to)87-95
Number of pages9
JournalICIC Express Letters
Volume18
Issue number1
DOIs
Publication statusPublished - Jan 2024

Keywords

  • Augmentation
  • Bayesian optimization
  • Customer churn
  • HAT
  • Imbalanced dataset
  • XGBoost

Fingerprint

Dive into the research topics of 'CUSTOMER CHURN PREDICTION USING HISTOGRAM AUGMENTATION TECHNIQUE AND XGBOOST MODEL WITH BAYESIAN OPTIMIZATION'. Together they form a unique fingerprint.

Cite this