Developing Indonesian-English hybrid machine translation system

Evi Yulianti, Indra Budi, Achmad Nizar Hidayanto, Hisar Maruli Manurung, Mirna Adriani

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Citations (Scopus)

Abstract

Rule Based Machine Translation (RBMT) and Statistical Machine translation (SMT) have different approach in performing translation task. RBMT uses linguistic rule between two languages which is built manually by human in general, whereas SMT uses co-occurrence statistic of word in parallel corpora. We combine those different approaches into Indonesian-English Hybrid Machine Translation (HMT) system to get the advantage from both kind of information. Initially, Indonesian text is inputted into RBMT. Then, the output will be edited by SMT to generate the final translation of English text. SMT is capable to do this because on the training process, it uses RBMT's output (English) as source material and real translation (English) as target material. Unavailability of ready to use Indonesian-English RBMT system becomes a challenge to do this research. Our study shows that SMT still outperforms HMT by 8.01% in average.

Original languageEnglish
Title of host publicationICACSIS 2011 - 2011 International Conference on Advanced Computer Science and Information Systems, Proceedings
Pages265-270
Number of pages6
Publication statusPublished - 2011
Event2011 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2011 - Jakarta, Indonesia
Duration: 17 Dec 201118 Dec 2011

Publication series

NameICACSIS 2011 - 2011 International Conference on Advanced Computer Science and Information Systems, Proceedings

Conference

Conference2011 International Conference on Advanced Computer Science and Information Systems, ICACSIS 2011
Country/TerritoryIndonesia
CityJakarta
Period17/12/1118/12/11

Fingerprint

Dive into the research topics of 'Developing Indonesian-English hybrid machine translation system'. Together they form a unique fingerprint.

Cite this