TY - JOUR
T1 - A critical review on adverse effects of concept drift over machine learning classification models
AU - Jameel, Syed Muslim
AU - Hashmani, Manzoor Ahmed
AU - Alhussain, Hitham
AU - Rehman, Mobashar
AU - Budiman, Arif
N1 - Funding Information:
This research study is conducted in Universiti Teknologi PETRONAS (UTP), Malaysia as a part of research project "Correlation between Concept Drift Parameters and Performance of Deep Learning Models: Towards Fully Adaptive Deep Learning Models" under Fundamental Research Grant Scheme (FRGS) Ministry of Education (MoE) Malaysia (Grant Reference: FRGS/1/2018/ICT02/UTP/02/2).
Publisher Copyright:
© 2013 The Science and Information (SAI) Organization.
PY - 2020
Y1 - 2020
N2 - Big Data (BD) is participating in the current computing revolution in a big way. Industries and organizations are utilizing their insights for Business Intelligence using Machine Learning Models (ML-Models). Deep Learning Models (DL-Models) have been proven to be a better selection than Shallow Learning Models (SL-Models). However, the dynamic characteristics of BD introduce many critical issues for DLModels, Concept Drift (CD) is one of them. CD issue frequently appears in Online Supervised Learning environments in which data trends change over time. The problem may even worsen in the BD environment due to veracity and variability factors. Due to the CD issue, the accuracy of classification results degrades in ML-Models, which may make ML-Models not applicable. Therefore, ML-Models need to adapt quickly to changes to maintain the accuracy level of the results. In current solutions, a substantial improvement in accuracy and adaptability is needed to make ML-Models robust in a non-stationary environment. In the existing literature, the consolidated information on this issue is not available. Therefore, in this study, we have carried out a systematic critical literature review to discuss the Concept Drift taxonomy and identify the adverse effects and existing approaches to mitigate CD.
AB - Big Data (BD) is participating in the current computing revolution in a big way. Industries and organizations are utilizing their insights for Business Intelligence using Machine Learning Models (ML-Models). Deep Learning Models (DL-Models) have been proven to be a better selection than Shallow Learning Models (SL-Models). However, the dynamic characteristics of BD introduce many critical issues for DLModels, Concept Drift (CD) is one of them. CD issue frequently appears in Online Supervised Learning environments in which data trends change over time. The problem may even worsen in the BD environment due to veracity and variability factors. Due to the CD issue, the accuracy of classification results degrades in ML-Models, which may make ML-Models not applicable. Therefore, ML-Models need to adapt quickly to changes to maintain the accuracy level of the results. In current solutions, a substantial improvement in accuracy and adaptability is needed to make ML-Models robust in a non-stationary environment. In the existing literature, the consolidated information on this issue is not available. Therefore, in this study, we have carried out a systematic critical literature review to discuss the Concept Drift taxonomy and identify the adverse effects and existing approaches to mitigate CD.
KW - Adaptive convolutional neural network extreme learning machine (ACNNELM)
KW - Big data classification
KW - Concept drift
KW - Concept drift (CD)
KW - Deep learning (DL)
KW - Hybrid drift (HD)
KW - Machine learning
KW - Meta-cognitive online sequential extreme learning machine (MOSELM)
KW - Online sequential extreme learning machine (OSELM)
KW - Online supervised learning
KW - Real drift (RD)
KW - Shallow learning (SL)
KW - Virtual drift (VD)
UR - http://www.scopus.com/inward/record.url?scp=85080111380&partnerID=8YFLogxK
U2 - 10.14569/ijacsa.2020.0110127
DO - 10.14569/ijacsa.2020.0110127
M3 - Article
AN - SCOPUS:85080111380
SN - 2158-107X
VL - 11
SP - 206
EP - 211
JO - International Journal of Advanced Computer Science and Applications
JF - International Journal of Advanced Computer Science and Applications
IS - 1
ER -