Cross-language automatic plagiarism detector using latent semantic analysis and self-organizing map

Anak Agung Putri Ratna, Paskalis Nandana Yestha Nabhastala, Ihsan Ibrahim, F. Astha Ekadiyanto, Muhammad Salman, Prima Dewi Purnamasari, Muhammad Yusuf Irfan Herusaktiawan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Computer assisted detection or automatic detection for plagiarism could help human to check whether an author of a paper do plagiarism or not. Department of Electrical Engineering, Universitas Indonesia had been developing cross-language automatic plagiarism detection which test paper is written on Indonesian and reference paper written on English. More accurate automatic detection system is needed to prevent plagiarism act, especially on academic paper. The system is based on Latent Semantic Analysis (LSA) algorithm with addition of Self-Organizing Map (SOM) to do classification of the output from LSA. Some features for SOM are extracted from singular value matrix from LSA, they are Frobenius Norm and Cosine Similarity. Together with percentage of technical term, all of the features are used as the input for SOM to classify into 10, 5, 3, and 2 classes. The use of 5 classes in LSA could give equal accuracy for all classes, with the highest accuracy reach 83.09%. While in LSA-SOM, the best accuracy is 83.53% for training data and 80.47% for testing data, in 2-classes configuration with 3 features, they were percentage of technical term, frobenius norm, and pad.

Original languageEnglish
Title of host publicationAIVR 2018 - 2018 International Conference on Artificial Intelligence and Virtual Reality
PublisherAssociation for Computing Machinery
Pages83-87
Number of pages5
ISBN (Electronic)9781450366410
DOIs
Publication statusPublished - 23 Nov 2018
Event2018 International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018 - Nagoya, Japan
Duration: 23 Nov 201825 Nov 2018

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2018 International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018
Country/TerritoryJapan
CityNagoya
Period23/11/1825/11/18

Keywords

  • Automatic plagiarism detection
  • Cross language
  • Latent semantic analysis
  • Self-organizing map
  • Singular value decomposition

Fingerprint

Dive into the research topics of 'Cross-language automatic plagiarism detector using latent semantic analysis and self-organizing map'. Together they form a unique fingerprint.

Cite this