Building MEDISCO: Indonesian Speech Corpus for Medical Domain

Muhammad Reza Qorib, Mirna Adriani

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper we report our work of building MEDISCO: Medical Indonesian Speech Corpus. The medical text corpus is collected from five Indonesian online medical consultation websites. From the text corpus, we created a speech corpus that consists of 360 sentences read by 13 speakers. In total, our speech corpus contains 731 medical terms and consists of 4,680 utterances with total duration 10 hours.

Original languageEnglish
Title of host publicationProceedings of the 2018 International Conference on Asian Language Processing, IALP 2018
EditorsMinghui Dong, Fariska Z. Ruskanda, Herry Sujaini, Ade Romadhony, Moch. Bijaksana, Elvira Nurfadhilah, Lyla Ruslana Aini, Arif Bijaksana Putra Negara
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages133-138
Number of pages6
ISBN (Electronic)9781728111766
DOIs
Publication statusPublished - 28 Jan 2019
Event22nd International Conference on Asian Language Processing, IALP 2018 - Bandung, Indonesia
Duration: 15 Nov 201817 Nov 2018

Publication series

NameProceedings of the 2018 International Conference on Asian Language Processing, IALP 2018

Conference

Conference22nd International Conference on Asian Language Processing, IALP 2018
CountryIndonesia
CityBandung
Period15/11/1817/11/18

Keywords

  • Indonesian Automatic Speech Recognition
  • Medical Speech Corpus
  • Text Corpus

Fingerprint Dive into the research topics of 'Building MEDISCO: Indonesian Speech Corpus for Medical Domain'. Together they form a unique fingerprint.

  • Cite this

    Qorib, M. R., & Adriani, M. (2019). Building MEDISCO: Indonesian Speech Corpus for Medical Domain. In M. Dong, F. Z. Ruskanda, H. Sujaini, A. Romadhony, M. Bijaksana, E. Nurfadhilah, L. R. Aini, & A. B. P. Negara (Eds.), Proceedings of the 2018 International Conference on Asian Language Processing, IALP 2018 (pp. 133-138). [8629259] (Proceedings of the 2018 International Conference on Asian Language Processing, IALP 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IALP.2018.8629259