Automatic external plagiarism detection using passage similarities

Clara Vania, Mirna Adriani

Research output: Contribution to journalConference articlepeer-review

1 Citation (Scopus)


In this paper, we report our approach in detecting external plagiarism. For the pre-processing stage, we identify non-English documents and translate them into English using an online translator tool. Then we index and retrieve the top documents that are similar to the suspicious documents. We divide the retrieved documents into passages where each passage contains twenty sentences. The plagiarism is detected by identifying the number of overlapped words between suspicious and source passages.

Original languageEnglish
JournalCEUR Workshop Proceedings
Publication statusPublished - 2010
Event2010 Cross Language Evaluation Forum Conference, CLEF 2010 - Padua, Italy
Duration: 22 Sept 201023 Sept 2010


  • Overlapping n-grams
  • Passage retrieval
  • Plagiarism detection

Cite this