TY - GEN
T1 - Term Frequency-Inverse Document Frequency Answer Categorization with Support Vector Machine on Automatic Short Essay Grading System with Latent Semantic Analysis for Japanese Language
AU - Putri Ratna, Anak Agung
AU - Kaltsum, Aaliyah
AU - Santiar, Lea
AU - Khairunissa, Hanifah
AU - Ibrahim, Ihsan
AU - Purnamasari, Prima Dewi
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/10
Y1 - 2019/10
N2 - In this paper, conducted a research to increase accuracy of Japanese language automatic short essay grading system. Japanese short answers are processed with a supervised machine learning algorithm; Support Vector Machine (SVM) before entering the system that used Latent Semantic Analysis (LSA). The SVM is used to classify short answers topics that minimize error in assessing the essay. TF-IDF process is done as an input to the SVM to weigh every keyword in a sentence. Then, the result will be processed with LSA. LSA uses Singular Value Decomposition (SVD) as the main process and Frobenius Norm as the final calculation from the result of SVD. Using linear kernel in SVM, the accuracy obtained in classifying short answers topics from Japanese-written short answers is 96.36% with 10.0 to 100.0 penalty values and 0.5 training portion. The accuracy score obtained from LSA is as much as 87.15% average with the input of TDM that shows frequency of a word's occurrence.
AB - In this paper, conducted a research to increase accuracy of Japanese language automatic short essay grading system. Japanese short answers are processed with a supervised machine learning algorithm; Support Vector Machine (SVM) before entering the system that used Latent Semantic Analysis (LSA). The SVM is used to classify short answers topics that minimize error in assessing the essay. TF-IDF process is done as an input to the SVM to weigh every keyword in a sentence. Then, the result will be processed with LSA. LSA uses Singular Value Decomposition (SVD) as the main process and Frobenius Norm as the final calculation from the result of SVD. Using linear kernel in SVM, the accuracy obtained in classifying short answers topics from Japanese-written short answers is 96.36% with 10.0 to 100.0 penalty values and 0.5 training portion. The accuracy score obtained from LSA is as much as 87.15% average with the input of TDM that shows frequency of a word's occurrence.
KW - e-learning
KW - essay grading
KW - Japanese language
KW - latent semantic analysis
KW - support vector machine
KW - term frequency-inverse document frequency
UR - http://www.scopus.com/inward/record.url?scp=85086009974&partnerID=8YFLogxK
U2 - 10.1109/ICECOS47637.2019.8984530
DO - 10.1109/ICECOS47637.2019.8984530
M3 - Conference contribution
AN - SCOPUS:85086009974
T3 - ICECOS 2019 - 3rd International Conference on Electrical Engineering and Computer Science, Proceeding
SP - 293
EP - 298
BT - ICECOS 2019 - 3rd International Conference on Electrical Engineering and Computer Science, Proceeding
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd International Conference on Electrical Engineering and Computer Science, ICECOS 2019
Y2 - 2 October 2019 through 3 October 2019
ER -