TY - JOUR
T1 - Hate speech and abusive language detection in Indonesian social media
T2 - Progress and challenges
AU - Ibrohim, Muhammad Okky
AU - Budi, Indra
N1 - Funding Information:
This research was funded by PUTI Q2 Research Grant from Universitas Indonesia with grant number NKB-1475/UN2.RST/HKP.05.00/2020 .
Publisher Copyright:
© 2023
PY - 2023/8
Y1 - 2023/8
N2 - Nowadays Hate Speech and Abusive Language (HSAL) have spread extensively over social media. The easy use of social media allows people to abuse the media to spread HSAL. Hate speech and abusive language in social media must be detected because they can trigger conflict among citizens. Not only in social media, but HSAL also often trigger conflict in real life. In recent years, many scholars have researched HSAL detection in various languages and media. However, there are still many tasks on HSAL detection that need to be done to develop a better HSAL detection system. This paper discusses a summary of Indonesian HSAL detection research, conducted by utilizing the Kitchenham systematic literature review method. Based on our summary, we found that most Indonesian HSAL research still uses the classic machine-learning approach with classic text representation features that experimented on the Twitter text dataset. We also found several challenges and tasks that need to be addressed to build a better HSAL detection system in Indonesian social media that can detect the hate speech target, category, and levels; and the hate speech buzzer, thread starter, and fake account spreader.
AB - Nowadays Hate Speech and Abusive Language (HSAL) have spread extensively over social media. The easy use of social media allows people to abuse the media to spread HSAL. Hate speech and abusive language in social media must be detected because they can trigger conflict among citizens. Not only in social media, but HSAL also often trigger conflict in real life. In recent years, many scholars have researched HSAL detection in various languages and media. However, there are still many tasks on HSAL detection that need to be done to develop a better HSAL detection system. This paper discusses a summary of Indonesian HSAL detection research, conducted by utilizing the Kitchenham systematic literature review method. Based on our summary, we found that most Indonesian HSAL research still uses the classic machine-learning approach with classic text representation features that experimented on the Twitter text dataset. We also found several challenges and tasks that need to be addressed to build a better HSAL detection system in Indonesian social media that can detect the hate speech target, category, and levels; and the hate speech buzzer, thread starter, and fake account spreader.
KW - Abusive language
KW - Hate speech
KW - Indonesian social media
UR - http://www.scopus.com/inward/record.url?scp=85167823669&partnerID=8YFLogxK
U2 - 10.1016/j.heliyon.2023.e18647
DO - 10.1016/j.heliyon.2023.e18647
M3 - Review article
AN - SCOPUS:85167823669
SN - 2405-8440
VL - 9
JO - Heliyon
JF - Heliyon
IS - 8
M1 - e18647
ER -