Term Frequency-Inverse Document Frequency Answer Categorization with Support Vector Machine on Automatic Short Essay Grading System with Latent Semantic Analysis for Japanese Language

Anak Agung Putri Ratna, Aaliyah Kaltsum, Lea Santiar, Hanifah Khairunissa, Ihsan Ibrahim, Prima Dewi Purnamasari

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, conducted a research to increase accuracy of Japanese language automatic short essay grading system. Japanese short answers are processed with a supervised machine learning algorithm; Support Vector Machine (SVM) before entering the system that used Latent Semantic Analysis (LSA). The SVM is used to classify short answers topics that minimize error in assessing the essay. TF-IDF process is done as an input to the SVM to weigh every keyword in a sentence. Then, the result will be processed with LSA. LSA uses Singular Value Decomposition (SVD) as the main process and Frobenius Norm as the final calculation from the result of SVD. Using linear kernel in SVM, the accuracy obtained in classifying short answers topics from Japanese-written short answers is 96.36% with 10.0 to 100.0 penalty values and 0.5 training portion. The accuracy score obtained from LSA is as much as 87.15% average with the input of TDM that shows frequency of a word's occurrence.

Original languageEnglish
Title of host publicationICECOS 2019 - 3rd International Conference on Electrical Engineering and Computer Science, Proceeding
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages293-298
Number of pages6
ISBN (Electronic)9781728147147
DOIs
Publication statusPublished - Oct 2019
Event3rd International Conference on Electrical Engineering and Computer Science, ICECOS 2019 - Batam, Indonesia
Duration: 2 Oct 20193 Oct 2019

Publication series

NameICECOS 2019 - 3rd International Conference on Electrical Engineering and Computer Science, Proceeding

Conference

Conference3rd International Conference on Electrical Engineering and Computer Science, ICECOS 2019
CountryIndonesia
CityBatam
Period2/10/193/10/19

Keywords

  • e-learning
  • essay grading
  • Japanese language
  • latent semantic analysis
  • support vector machine
  • term frequency-inverse document frequency

Fingerprint Dive into the research topics of 'Term Frequency-Inverse Document Frequency Answer Categorization with Support Vector Machine on Automatic Short Essay Grading System with Latent Semantic Analysis for Japanese Language'. Together they form a unique fingerprint.

Cite this