Evaluating language resources for English-Indonesian CLIR

Herika Hayurani, Syandra Sari, Mirna Adriani

Research output: Contribution to journalConference articlepeer-review

2 Citations (Scopus)

Abstract

We present a report on our participation in the Indonesian-English ad hoc bilingual task of the 2006 Cross-Language Evaluation Forum (CLEF). This year we compare the use of several language resources to translate Indonesian queries into English. We used several readable machine dictionaries to perform the translation. We also used two machine translation techniques to translate the Indonesian queries. In addition to translating an Indonesian query set into English, we also translated English documents into Indonesian using the machine readable dictionaries and a commercial machine translation tool. The results show performing the task by translating the queries is better than translating the documents. Combining several dictionaries produced better result than only using one dictionary. However, the query expansion that we applied to the translated queries using the dictionaries reduced the retrieval effectiveness of the queries.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume1172
Publication statusPublished - 2006
Event2006 Cross Language Evaluation Forum Workshop, CLEF 2006, co-located with the 10th European Conference on Digital Libraries, ECDL 2006 - Alicante, Spain
Duration: 20 Sept 200622 Sept 2006

Keywords

  • Cross-language information retrieval
  • Dictionary translation
  • Machine translation
  • Parallel corpus

Fingerprint

Dive into the research topics of 'Evaluating language resources for English-Indonesian CLIR'. Together they form a unique fingerprint.

Cite this