The Use of Text Mining for Classification of Product Selling Content in Social Media Female Daily

Bern Jonathan, Indra Budi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Female Daily Network is a company engaged in social media. Female Daily has social media to share experiences using beauty products called Female Daily. Female Daily has regulations not to use the Female Daily Platform to promote, sell products and services on social media platforms in Female Daily. However, users on Female Daily sometimes violate these rules in their posts and cause other users to be annoyed about it. Admins at Female Daily have difficulty identifying users who violate these rules and ban their posts containing product sales due to the limited number of admins with the number of posts that enter each day. Text mining can also overcome this problem by determining the classification automatically by creating a system that carries out the learning process from the available post words. Algorithms that can be used to carry out the text mining process in this research are Support Vector Machine (SVM), Naïve Bayes (NB), Decision Tree (DT), and Random Forest (RF). This study uses a combination of feature extraction, contextual features, and data balancing. This study uses research scenarios to analyze feature extraction, contextual feature usage, and data balancing. The best algorithm seen from the recall value in the combination of algorithms and features of this research is the Random Forest TF-IDF Unigram and uses additional contextual features to detect money and selling words with balanced data. The recall value of 88.37% is obtained from the results of the combination of these algorithms and features.

Original languageEnglish
Title of host publication2021 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665442640
DOIs
Publication statusPublished - 2021
Event13th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2021 - Depok, Indonesia
Duration: 23 Oct 202126 Oct 2021

Publication series

Name2021 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2021

Conference

Conference13th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2021
Country/TerritoryIndonesia
CityDepok
Period23/10/2126/10/21

Keywords

  • contextual features
  • data balancing
  • recall
  • regex
  • social media
  • text mining

Fingerprint

Dive into the research topics of 'The Use of Text Mining for Classification of Product Selling Content in Social Media Female Daily'. Together they form a unique fingerprint.

Cite this