Corpus development for Indonesian consumer-health question answering system

Abid Nurul Hakim, Rahmad Mahendra, Mima Adriani, Adrianus Saga Ekakristi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Citations (Scopus)


Web-based question answering services facilitate users to seek more personalized health-related information. However, the users sometimes have to wait for a while until their questions to be answered. Automatic question answering research can assist the system to generate or retrieve the answer to users. Our work was a pioneering study on consumer-health question answering for Bahasa Indonesia. We built a corpus of 86,731 consumer-health questions, collected from 5 different websites. As part of annotation, we classify the sub-topics for each question, which corresponds to medical specialization. Question sub-topic classification is completed by two complementary approaches: dictionary-based and machine learning-based.

Original languageEnglish
Title of host publication2017 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Number of pages6
ISBN (Electronic)9781538631720
Publication statusPublished - 2 Jul 2017
Event9th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017 - Jakarta, Indonesia
Duration: 28 Oct 201729 Oct 2017

Publication series

Name2017 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017


Conference9th International Conference on Advanced Computer Science and Information Systems, ICACSIS 2017


Dive into the research topics of 'Corpus development for Indonesian consumer-health question answering system'. Together they form a unique fingerprint.

Cite this