Corpus development for Indonesian consumer-health question answering system

Abid Nurul Hakim, Rahmad Mahendra, Mima Adriani, Adrianus Saga Ekakristi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Citations (Scopus)

Abstract

Web-based question answering services facilitate users to seek more personalized health-related information. However, the users sometimes have to wait for a while until their questions to be answered. Automatic question answering research can assist the system to generate or retrieve the answer to users. Our work was a pioneering study on consumer-health question answering for Bahasa Indonesia. We built a corpus of 86,731 consumer-health questions, collected from 5 different websites. As part of annotation, we classify the sub-topics for each question, which corresponds to medical specialization. Question sub-topic classification is completed by two complementary approaches: dictionary-based and machine learning-based.

Original languageEnglish
Title of host publication2017 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages222-227
Number of pages6
ISBN (Electronic)9781538631720
DOIs
Publication statusPublished - 2 Jul 2017
Event9th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017 - Jakarta, Indonesia
Duration: 28 Oct 201729 Oct 2017

Publication series

Name2017 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017
Volume2018-January

Conference

Conference9th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017
Country/TerritoryIndonesia
CityJakarta
Period28/10/1729/10/17

Fingerprint

Dive into the research topics of 'Corpus development for Indonesian consumer-health question answering system'. Together they form a unique fingerprint.

Cite this