TY - GEN
T1 - Hate speech identification in text written in Indonesian with recurrent neural network
AU - Sazany, Erryan
AU - Budi, Indra
PY - 2019/10
Y1 - 2019/10
N2 - Some researches had succeeded in doing hate speech identification automatically from text with machine learning and deep learning approaches. However, it was still unclear how adaptive is a deep learning-based model if it is tested on a different set of text data with different domain. To address this issue, this research proposed some deep learning-based methods, using some variants of Recurrent Neural Network to identify hate speech in texts sourced from Twitter, and then used to predict other set of text data sourced from Facebook and Twitter. The experiment was done in order to measure the difference of model performance between training phase and testing phase. Experiment results showed that the proposed method outperformed the machine learning based methods, both in training phase, by GRU algorithm with 85.37% F1-score, and in testing phase, by LSTM algorithm with 76.30% F1-score. Then, in terms of adaptability of model performance, the proposed method gave comparable result against the baseline method.
AB - Some researches had succeeded in doing hate speech identification automatically from text with machine learning and deep learning approaches. However, it was still unclear how adaptive is a deep learning-based model if it is tested on a different set of text data with different domain. To address this issue, this research proposed some deep learning-based methods, using some variants of Recurrent Neural Network to identify hate speech in texts sourced from Twitter, and then used to predict other set of text data sourced from Facebook and Twitter. The experiment was done in order to measure the difference of model performance between training phase and testing phase. Experiment results showed that the proposed method outperformed the machine learning based methods, both in training phase, by GRU algorithm with 85.37% F1-score, and in testing phase, by LSTM algorithm with 76.30% F1-score. Then, in terms of adaptability of model performance, the proposed method gave comparable result against the baseline method.
KW - Adaptability
KW - Deep learning
KW - Hate speech
KW - Recurrent neural network
KW - Text classification
UR - http://www.scopus.com/inward/record.url?scp=85081087711&partnerID=8YFLogxK
U2 - 10.1109/ICACSIS47736.2019.8979959
DO - 10.1109/ICACSIS47736.2019.8979959
M3 - Conference contribution
T3 - 2019 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2019
SP - 211
EP - 216
BT - 2019 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2019
Y2 - 12 October 2019 through 13 October 2019
ER -