TY - JOUR
T1 - ABSA of Indonesian customer reviews using IndoBERT
T2 - single-sentence and sentence-pair classification approaches
AU - Yulianti, Evi
AU - Nissa, Nuzulul Khairu
N1 - Publisher Copyright:
© 2024, Institute of Advanced Engineering and Science. All rights reserved.
PY - 2024/10
Y1 - 2024/10
N2 - Aspect-based sentiment analysis (ABSA) task is important to identify user satisfaction from customer reviews by recognizing the sentiments of all aspects discussed in the reviews. This work investigates a novel study on the effectiveness and efficiency of three IndoBERT-based models for solving the ABSA task in Indonesian language. IndoBERT is a state-of-the-art transformer-based model, i.e., bidirectional encoder representations from transformers (BERT), that was pre-trained on Indonesian language. Our first model utilizes IndoBERT in a feature-based mode, paired with the convolutional neural network (CNN) and machine learning models, for single-sentence classification. Next, our second model is obtained by fine-tuning the IndoBERT model for a typical single-sentence classification to build an end-to-end model. At last, our third model also adopts a fine-tuning approach to use IndoBERT, but for sentence-pair classification by utilizing auxiliary sentences. Our results demonstrate that the third model, the fine-tuned IndoBERT for sentence-pair classification, gains the highest effectiveness. It demonstrates significant improvement over deep learning baselines (Word2Vec-CNN-XGBoost) by 23.6% and transformer-based baselines (mBERT-aux-NLIB) by 2.2% in terms of F-1 score. When considering both effectiveness and efficiency, the results show that the best-performing model is our second model, the fine-tuned IndoBERT for single-sentence classification.
AB - Aspect-based sentiment analysis (ABSA) task is important to identify user satisfaction from customer reviews by recognizing the sentiments of all aspects discussed in the reviews. This work investigates a novel study on the effectiveness and efficiency of three IndoBERT-based models for solving the ABSA task in Indonesian language. IndoBERT is a state-of-the-art transformer-based model, i.e., bidirectional encoder representations from transformers (BERT), that was pre-trained on Indonesian language. Our first model utilizes IndoBERT in a feature-based mode, paired with the convolutional neural network (CNN) and machine learning models, for single-sentence classification. Next, our second model is obtained by fine-tuning the IndoBERT model for a typical single-sentence classification to build an end-to-end model. At last, our third model also adopts a fine-tuning approach to use IndoBERT, but for sentence-pair classification by utilizing auxiliary sentences. Our results demonstrate that the third model, the fine-tuned IndoBERT for sentence-pair classification, gains the highest effectiveness. It demonstrates significant improvement over deep learning baselines (Word2Vec-CNN-XGBoost) by 23.6% and transformer-based baselines (mBERT-aux-NLIB) by 2.2% in terms of F-1 score. When considering both effectiveness and efficiency, the results show that the best-performing model is our second model, the fine-tuned IndoBERT for single-sentence classification.
KW - Aspect-based sentiment analysis
KW - Customer review
KW - Indobert
KW - Sentence-pair classification
KW - Single-sentence classification
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=85200737833&partnerID=8YFLogxK
U2 - 10.11591/eei.v13i5.8032
DO - 10.11591/eei.v13i5.8032
M3 - Article
AN - SCOPUS:85200737833
SN - 2089-3191
VL - 13
SP - 3579
EP - 3589
JO - Bulletin of Electrical Engineering and Informatics
JF - Bulletin of Electrical Engineering and Informatics
IS - 5
ER -