Mining Indonesia Tourism's Reviews to Evaluate the Services through Multilabel Classification and LDA

Irma Latifatul Laily, Indra Budi, Aris Budi Santoso, Prabu Kresna Putra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The tourism sector is one of the mainstay factors, one of the most significant economic contributors in Lamongan. There are two leading tourism destinations in Lamongan, namely WBL and Mazoola.Evaluation of tourist experience in a tourist destination can use the reviews provided at the end of the trip. Tourists review various aspects of tourism, such as price, services, and location. Classify more than one aspect from reviews is a challenging task. Five labels used, namely: Price, Location, Safety, Services and Facilities, and Environment and Ambiance. This study was conducted to determine the aspects that should be evaluated from the reviews that visitors provide. This research uses five multi-label classifier algorithms commonly used for multi-label classification: NBSVM, Binary Relevance-Naive Bayes, Binary Relevance-Logistic Regression, Classifier Chains-Naive Bayes, and Multilabel kNN. NBSVM was a robust performer. For WBL data in scenario 1, the highest accuracy isBR-LR, which is 92%. Whereas in scenario 2, NBSVM has the highest value of 91,32%. However, in other assessments, NBSVM is still superior. Likewise to Mazola's data, NBSVM has the highest accuracy in both scenarios: 87,38% and 87,28%. This study also extracts three trend topics for each data set,WBL and Mazoola. Trend topics aim to find out what topics are discussed more frequently in each tourist destination review-topic extraction using LDA with the Gensim library.

Original languageEnglish
Title of host publication2020 International Conference on Electrical Engineering and Informatics, ICELTICs 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728181998
DOIs
Publication statusPublished - 27 Oct 2020
Event2020 International Conference on Electrical Engineering and Informatics, ICELTICs 2020 - Banda Aceh, Jakarta, Indonesia
Duration: 27 Oct 202028 Oct 2020

Publication series

NameProceedings of the International Conference on Electrical Engineering and Informatics
Volume2020-October
ISSN (Print)2155-6830

Conference

Conference2020 International Conference on Electrical Engineering and Informatics, ICELTICs 2020
CountryIndonesia
CityBanda Aceh, Jakarta
Period27/10/2028/10/20

Keywords

  • LDA
  • Multi/abel Classification
  • Tourism Review

Fingerprint Dive into the research topics of 'Mining Indonesia Tourism's Reviews to Evaluate the Services through Multilabel Classification and LDA'. Together they form a unique fingerprint.

Cite this