TY - JOUR
T1 - Latent Semantic Analysis Based Cross Language Plagiarism Detection System with Support Vector Machine Classifier
AU - Ratna, Anak Agung Putri
AU - Rismawati, Dian
AU - Ibrahim, Ihsan
AU - Ekadiyanto, F. Astha
AU - Salman, Muhammad
AU - Purnamasari, Prima Dewi
PY - 2017/5/31
Y1 - 2017/5/31
N2 - Department of Electrical Engineering, Universitas Indonesia has developed a cross language plagiarism detection system based on Latent Semantic Analysis (LSA) between Indonesian and English papers. The system will generate Frobenius norm, slice, and pad as the output data. This paper explains and provides analysis on the development of plagiarism detection system, namely by applying the Support Vector Machine (SVM) algorithm. SVM divides the output data into two classes, namely "plagiarism" and "not plagiarism" by using two methods, a combination of input data and output data and the AND method. Several modifications to the program input has been made, including varying the parameters of learning and the output data of the program. Using balance of precision and relevance of the program, the accuracy of the SVM is 63,15%. However, when viewed through the percentage of the amount of data that appropriately classified, the accuracy of the SVM is 97.04%.
AB - Department of Electrical Engineering, Universitas Indonesia has developed a cross language plagiarism detection system based on Latent Semantic Analysis (LSA) between Indonesian and English papers. The system will generate Frobenius norm, slice, and pad as the output data. This paper explains and provides analysis on the development of plagiarism detection system, namely by applying the Support Vector Machine (SVM) algorithm. SVM divides the output data into two classes, namely "plagiarism" and "not plagiarism" by using two methods, a combination of input data and output data and the AND method. Several modifications to the program input has been made, including varying the parameters of learning and the output data of the program. Using balance of precision and relevance of the program, the accuracy of the SVM is 63,15%. However, when viewed through the percentage of the amount of data that appropriately classified, the accuracy of the SVM is 97.04%.
U2 - 10.14257/ijseia.2017.11.5.02
DO - 10.14257/ijseia.2017.11.5.02
M3 - Article
SN - 1738-9984
VL - 11
SP - 13
EP - 26
JO - International Journal of Software Engineering and its Applications
JF - International Journal of Software Engineering and its Applications
ER -