TY - JOUR
T1 - Textual Analysis for Public Sentiment Toward National Police Using CRISP-DM Framework
AU - Z.S. Sudar, Latifa
AU - Imbenay, Joash L.
AU - Budi, Indra
AU - Ramadiah, Amanah
AU - Putra, Prabu K.
AU - Santoso, Aris B.
N1 - Publisher Copyright:
© 2024 International Information and Engineering Technology Association. All rights reserved.
PY - 2024/2
Y1 - 2024/2
N2 - Nowadays, public opinion toward the National Police's (POLRI) image is deteriorating. With the explosive growth of social media in Indonesia, opinions on POLRI-related present-day issues on Twitter easily go viral, influencing sentiments among individuals regarding Indonesian law enforcement. Negative sentiments, at some point, may lead to the undervaluation of law enforcement and the failure of the legal system. Therefore, sentiment analysis on Twitter is essential for gaining considerable insights into public views and attitudes on POLRI-related topics. This research is to determine the most effective approaches between Lexicon, a natural language processing method that relies on a corpus, and machine learning, which contains Naive-Bayes, Support Vector Machine (SVM), Random Forest, and Logistic Regression (LR). These approaches have differences in classification types: probability and linearity. To organize the research process, the Cross-Industry Standard Process for Data Mining (CRISP-DM) Framework, which comprises five data mining activities, was employed. The confusion matrix was used as the model performance measurement, with Naive-Bayes emerging as the best among all the tested models. Additionally, the subjects related to POLRI were developed using topic modeling, generating three topics: street police or police station, police acknowledgment in neighborhood activities, and the activity of contacting the police.
AB - Nowadays, public opinion toward the National Police's (POLRI) image is deteriorating. With the explosive growth of social media in Indonesia, opinions on POLRI-related present-day issues on Twitter easily go viral, influencing sentiments among individuals regarding Indonesian law enforcement. Negative sentiments, at some point, may lead to the undervaluation of law enforcement and the failure of the legal system. Therefore, sentiment analysis on Twitter is essential for gaining considerable insights into public views and attitudes on POLRI-related topics. This research is to determine the most effective approaches between Lexicon, a natural language processing method that relies on a corpus, and machine learning, which contains Naive-Bayes, Support Vector Machine (SVM), Random Forest, and Logistic Regression (LR). These approaches have differences in classification types: probability and linearity. To organize the research process, the Cross-Industry Standard Process for Data Mining (CRISP-DM) Framework, which comprises five data mining activities, was employed. The confusion matrix was used as the model performance measurement, with Naive-Bayes emerging as the best among all the tested models. Additionally, the subjects related to POLRI were developed using topic modeling, generating three topics: street police or police station, police acknowledgment in neighborhood activities, and the activity of contacting the police.
KW - CRISP-DM
KW - Lexicon
KW - machine learning
KW - POLRI
KW - sentiment analysis
KW - topic modeling
UR - http://www.scopus.com/inward/record.url?scp=85187353628&partnerID=8YFLogxK
U2 - 10.18280/ria.380107
DO - 10.18280/ria.380107
M3 - Article
AN - SCOPUS:85187353628
SN - 0992-499X
VL - 38
SP - 63
EP - 72
JO - Revue d'Intelligence Artificielle
JF - Revue d'Intelligence Artificielle
IS - 1
ER -