Finding anchor words of separable-nonnegative matrix factorization based on singular value decomposition

Ika Dwi Novitasari, Hendri Murfi, Arie Wibowo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Topic detection is a process to find topics or subjects of discussion in a collection of documents such as tweets on Twitter. Manual detection of topics on Twitter is difficult because of too many tweets. Therefore, it is necessary to detect topics automatically. One of the automatic methods for topic detection is the Separable-Nonnegative Matrix Factorization (SNMF) method with the AGM algorithm. SNMF is a matrix factorization-based model that can be solved directly using the assumption that each topic has one word, called anchor words, that is not present in other topics. SNMF with AGM algorithm consists of three stages, namely the constructing the co-occurrence matrix, finding the anchor words, and recovering the topics. The common method to find the anchor words is the convex hull-based method. In this paper, we examine the process of finding anchor words based on Singular Value Decomposition (SVD). The results show that by considering all words as anchor word candidates, the SVD-based method gives better results than the convex hull-based method. Meanwhile, when the anchor finding was done by using anchor threshold, the convex hull-based method still gives a better result than the SVD-based method.

Original languageEnglish
Title of host publicationProceedings - 2017 1st International Conference on Informatics and Computational Sciences, ICICoS 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages225-229
Number of pages5
ISBN (Electronic)9781538609033
DOIs
Publication statusPublished - 30 Jan 2018
Event1st International Conference on Informatics and Computational Sciences, ICICoS 2017 - Semarang, Indonesia
Duration: 15 Nov 201716 Nov 2017

Publication series

NameProceedings - 2017 1st International Conference on Informatics and Computational Sciences, ICICoS 2017
Volume2018-January

Conference

Conference1st International Conference on Informatics and Computational Sciences, ICICoS 2017
Country/TerritoryIndonesia
CitySemarang
Period15/11/1716/11/17

Keywords

  • Finding Anchor Words
  • Separable Nonnegative Matrix Factorization
  • Singular Value Decomposition
  • Topic Detection
  • Twitter

Fingerprint

Dive into the research topics of 'Finding anchor words of separable-nonnegative matrix factorization based on singular value decomposition'. Together they form a unique fingerprint.

Cite this