TY - GEN
T1 - Generative oversampling method (GenOMe) for imbalanced data on apnea detection using ECG data
AU - Sanabila, H. R.
AU - Kusuma, Ilham
AU - Jatmiko, Wisnu
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/3/6
Y1 - 2017/3/6
N2 - One of machine learning problem that is difficult but important to be addressed is imbalanced data where particular data is recessive while the others are dominant. Most of classifiers performance significantly degraded when dealing with imbalanced data. The major approaches to tackle imbalanced data are cost sensitive learning which modifies the classifier and resampling which modifies the data distribution. In this research, we employed generated oversampling method (GenOMe) that generate new data point with a particular distribution as a constraint. We examine three distribution functions: Beta, Gamma, and Gaussian distribution. We use Logistic Regression, Support Vector Machine (SVM), and Naive Bayes as classifier to assure the robustness of GenOMe. The experimental results shows that GenOMe outperforms classification using original data and classification using SMOTe (Synthetic Minority Oversampling Technique) data.
AB - One of machine learning problem that is difficult but important to be addressed is imbalanced data where particular data is recessive while the others are dominant. Most of classifiers performance significantly degraded when dealing with imbalanced data. The major approaches to tackle imbalanced data are cost sensitive learning which modifies the classifier and resampling which modifies the data distribution. In this research, we employed generated oversampling method (GenOMe) that generate new data point with a particular distribution as a constraint. We examine three distribution functions: Beta, Gamma, and Gaussian distribution. We use Logistic Regression, Support Vector Machine (SVM), and Naive Bayes as classifier to assure the robustness of GenOMe. The experimental results shows that GenOMe outperforms classification using original data and classification using SMOTe (Synthetic Minority Oversampling Technique) data.
UR - http://www.scopus.com/inward/record.url?scp=85016978630&partnerID=8YFLogxK
U2 - 10.1109/ICACSIS.2016.7872805
DO - 10.1109/ICACSIS.2016.7872805
M3 - Conference contribution
AN - SCOPUS:85016978630
T3 - 2016 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2016
SP - 572
EP - 579
BT - 2016 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 8th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2016
Y2 - 15 October 2016 through 16 October 2016
ER -