Query and document translation for English-Indonesian cross language IR

Herika Hayurani, Syandra Sari, Mirna Adriani

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

We present a report on our participation in the Indonesian-English ad hoc bilingual task of the 2006 Cross-Language Evaluation Forum (CLEF). This year we compare the use of several language resources to translate Indonesian queries into English. We used several readable machine dictionaries to perform the translation. We also used two machine translation techniques to translate the Indonesian queries. In addition to translating an Indonesian query set into English, we also translated English documents into Indonesian using the machine readable dictionaries and a commercial machine translation tool. The results show performing the task by translating the queries is better than translating the documents. Combining several dictionaries produced better result than only using one dictionary. However, the query expansion that we applied to the translated queries using the dictionaries reduced the retrieval effectiveness of the queries.

Original languageEnglish
Title of host publicationEvaluation of Multilingual and Multi-modal Information Retrieval - 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006, Revised Selected Papers
Pages57-61
Number of pages5
Publication statusPublished - 1 Dec 2007
Event7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006 - Alicante, Spain
Duration: 20 Sep 200622 Sep 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4730 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006
Country/TerritorySpain
CityAlicante
Period20/09/0622/09/06

Fingerprint

Dive into the research topics of 'Query and document translation for English-Indonesian cross language IR'. Together they form a unique fingerprint.

Cite this