TY - GEN
T1 - Comparing Classical Distance Measures and Word Embeddings for Automatic Short Answer Grading
AU - Ripmiatin, Endang
AU - Purnamasari, Prima Dewi
AU - Ratna, Anak Agung Putri
N1 - Publisher Copyright:
© 2023 Copyright held by the owner/author(s)
PY - 2023/12/14
Y1 - 2023/12/14
N2 - In the educational process, students' answers to essay questions are one of the cognitive methods to measure students' understanding of a topic being studied. But checking essay answers is certainly much more difficult than multiple-choice answers. Apart from absorbing much energy and time, it may also be biased depending on the human rater's subjectivity. To overcome this, researchers have already started to develop Automatic Short Answer Grading (ASAG) by exploring the field of natural language processing (NLP). However ASAG research specifically for Indonesian is still limited. This research aims to find the right method to improve the accuracy of the ASAG system in Indonesia focusing on the computer science domain, with a deep learning approach. We started our research by combining the feature extraction method Bag of Words and Term Frequency-Inverse Document Frequency (TF-IDF) with linear regression, Support Vector Regression, and Random Forest Regression. And then employing BERT and FastText. Both experiments produced similar performance, with an F1-score on average of 0, 72, which is categorized as low similarity. This opens opportunities for further research in word embeddings, especially the transformer method that becomes state-of-the-art.
AB - In the educational process, students' answers to essay questions are one of the cognitive methods to measure students' understanding of a topic being studied. But checking essay answers is certainly much more difficult than multiple-choice answers. Apart from absorbing much energy and time, it may also be biased depending on the human rater's subjectivity. To overcome this, researchers have already started to develop Automatic Short Answer Grading (ASAG) by exploring the field of natural language processing (NLP). However ASAG research specifically for Indonesian is still limited. This research aims to find the right method to improve the accuracy of the ASAG system in Indonesia focusing on the computer science domain, with a deep learning approach. We started our research by combining the feature extraction method Bag of Words and Term Frequency-Inverse Document Frequency (TF-IDF) with linear regression, Support Vector Regression, and Random Forest Regression. And then employing BERT and FastText. Both experiments produced similar performance, with an F1-score on average of 0, 72, which is categorized as low similarity. This opens opportunities for further research in word embeddings, especially the transformer method that becomes state-of-the-art.
KW - ASAG
KW - BERT
KW - FastText
KW - Regression
KW - TF-IDF
UR - http://www.scopus.com/inward/record.url?scp=85192216628&partnerID=8YFLogxK
U2 - 10.1145/3638884.3638962
DO - 10.1145/3638884.3638962
M3 - Conference contribution
AN - SCOPUS:85192216628
SN - 979-8-4007-0890-9
T3 - ACM International Conference Proceeding Series
SP - 492
EP - 497
BT - ICCIP 2023 - 2023 the 9th International Conference on Communication and Information Processing
PB - Association for Computing Machinery
T2 - 9th International Conference on Communication and Information Processing, ICCIP 2023
Y2 - 14 December 2023 through 16 December 2023
ER -