Using mutual information technique in cross-language information retrieval

Syandra Sari, Mirna Adriani

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

This paper describes the Indonesian-English cross language information system, where we investigated the problem of cross language document retrieval for Indonesian-English. Our work based on Indonesian-English parallel corpus and applied mutual information technique. Our parallel corpus was built using a machine translation tool. Indonesian queries were translated into English by means of bilingual word list that created using mutual information technique. Experimental results demonstrated that translating Indonesian queries into English using this bilingual word list achieved 41.9% of the monolingual queries.

Original languageEnglish
Title of host publicationDigital Libraries
Subtitle of host publicationUniversal and Ubiquitous Access to Information - 11th International Conference on Asian Digital Libraries, ICADL 2008, Proceedings
Pages276-284
Number of pages9
DOIs
Publication statusPublished - 31 Dec 2008
Event11th International Conference on Asian Digital Libraries, ICADL 2008 - Bali, Indonesia
Duration: 2 Dec 20085 Dec 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5362 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Conference on Asian Digital Libraries, ICADL 2008
Country/TerritoryIndonesia
CityBali
Period2/12/085/12/08

Keywords

  • Cross-language information retrieval
  • Mutual information
  • Parallel corpus

Fingerprint

Dive into the research topics of 'Using mutual information technique in cross-language information retrieval'. Together they form a unique fingerprint.

Cite this