Fake news identification characteristics using named entity recognition and phrase detection

Herley Shaori Al-Ash, Wahyu Catur Wibowo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

24 Citations (Scopus)

Abstract

Information explosion that can be generated by anyone may lead to the spread of fake news not only at the news channel, but also at social media, and so forth. Detection of fake news has become an urgent need on the society because of fake news spread of unrest in the society. Several related studies have been conducted in the news classification with the aim of providing a decision whether a news is included in fake news or original news. In the related research, a vector representation of documents is used. This vector representation is then given to the algorithm for further processing. This study aims to model vectors that can accommodate the characteristics of fake news before further processed by language algorithms using the Indonesian language. In this research, fake news and original news are represented according to the vector space model. Vector model combination of frequency term, inverse document frequency and frequency reversed with 10-fold cross validation using support vector machine algorithm classifier. Variations of phrase detection as well as name recognition entities (entity recognition names) are also used in vector representation. A vector representation that uses the term frequency shows promising performance. It can recognize news characteristics correctly 96.74% of 2516 documents across phrase detection and named entity recognition process.

Original languageEnglish
Title of host publicationProceedings of 2018 10th International Conference on Information Technology and Electrical Engineering
Subtitle of host publicationSmart Technology for Better Society, ICITEE 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages12-17
Number of pages6
ISBN (Electronic)9781538647394
DOIs
Publication statusPublished - 13 Nov 2018
Event10th International Conference on Information Technology and Electrical Engineering, ICITEE 2018 - Bali, Indonesia
Duration: 24 Jul 201826 Jul 2018

Publication series

NameProceedings of 2018 10th International Conference on Information Technology and Electrical Engineering: Smart Technology for Better Society, ICITEE 2018

Conference

Conference10th International Conference on Information Technology and Electrical Engineering, ICITEE 2018
Country/TerritoryIndonesia
CityBali
Period24/07/1826/07/18

Keywords

  • Document vector representation
  • Named entity recognition
  • News identification
  • Phrase detection

Fingerprint

Dive into the research topics of 'Fake news identification characteristics using named entity recognition and phrase detection'. Together they form a unique fingerprint.

Cite this