TY - GEN
T1 - Classification of Inundation Level using Tweets in Indonesian Language
AU - Felicia Ilona, Kwee
AU - Budi, Indra
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/2/23
Y1 - 2021/2/23
N2 - Extreme flood events are expected to occur more frequently as climate change has yet to show signs of improvement. This has the potential to lead to higher rainfall and floods that would come more quickly. Early warning systems may sometimes fail to provide quick information when conditions in the field may not match to what is known in the information center, such as a malfunctioning water pump or a water level that has increased relatively quickly. Therefore, this study aims to provide an alternative source of information that may provide inundation level during flood condition based on tweets from Twitter. The proposed model is expected to provide output in the form of inundation level categories, namely "high", "medium", "low", and "unknown". 10-fold stratified cross validation with seven variations of classifiers were used to evaluate the model. The best relevance classification resulted in 90.6% accuracy (SVM Linear SVC), 89.05% average precision (SVM RBF), and 82.03% average F1-score (SVM Linear SVC) and average recall of 84.10% (Logistic Regression). The best classification results of inundation level resulted in accuracy (82.74%), average precision (85.44%) average recall (68.07%) and average F1-score (71.43%). All of them were obtained by using the SVM Linear SVC.
AB - Extreme flood events are expected to occur more frequently as climate change has yet to show signs of improvement. This has the potential to lead to higher rainfall and floods that would come more quickly. Early warning systems may sometimes fail to provide quick information when conditions in the field may not match to what is known in the information center, such as a malfunctioning water pump or a water level that has increased relatively quickly. Therefore, this study aims to provide an alternative source of information that may provide inundation level during flood condition based on tweets from Twitter. The proposed model is expected to provide output in the form of inundation level categories, namely "high", "medium", "low", and "unknown". 10-fold stratified cross validation with seven variations of classifiers were used to evaluate the model. The best relevance classification resulted in 90.6% accuracy (SVM Linear SVC), 89.05% average precision (SVM RBF), and 82.03% average F1-score (SVM Linear SVC) and average recall of 84.10% (Logistic Regression). The best classification results of inundation level resulted in accuracy (82.74%), average precision (85.44%) average recall (68.07%) and average F1-score (71.43%). All of them were obtained by using the SVM Linear SVC.
KW - Flood
KW - Inundation Level
KW - Text Mining
KW - Twitter
UR - http://www.scopus.com/inward/record.url?scp=85112261884&partnerID=8YFLogxK
U2 - 10.1145/3457784.3457806
DO - 10.1145/3457784.3457806
M3 - Conference contribution
AN - SCOPUS:85112261884
T3 - ACM International Conference Proceeding Series
SP - 137
EP - 143
BT - 2021 10th International Conference on Software and Computer Applications, ICSCA 2021
PB - Association for Computing Machinery
T2 - 10th International Conference on Software and Computer Applications, ICSCA 2021
Y2 - 23 February 2021 through 26 February 2021
ER -