Term similarity-based query expansion for cross-language information retrieval

Mirna Adriani, C. J. Van Rijsbergen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

35 Citations (Scopus)

Abstract

We propose a query expansion technique which is based on a statistical similarity measure among terms to improve the effectiveness of the dictionary-based cross-language information retrieval (CLIR) method. We employ a term similarity-based sense disambiguation technique proposed in our earlier work to enhance the accuracy of the dictionary-based query translation method. The query expansion technique is then applied to the translation of queries to further improve their retrieval performance. We demonstrate the effectiveness of the two techniques combined using queries in three languages, namely, German, Spanish, and Indonesian, to retrieve English documents from a standard TREC (Text Retrieval Conference) collection. The results of our experiments indicate that the term similarity-based techniques work better when there are more phrases in the queries. In addition, our results also re-emphasize other researchers’ finding that phrase recognition and translation are critical to CLIR’s effectiveness.

Original languageEnglish
Title of host publicationResearch and Advanced Technology for Digital Libraries - 3rd European Conference, ECDL 1999, Proceedings
EditorsSerge Abiteboul, Anne-Marie Vercoustre
PublisherSpringer Verlag
Pages311-322
Number of pages12
ISBN (Print)3540665587, 9783540665588
DOIs
Publication statusPublished - 1999
Event3rd European Conference on Research and Advanced Technology for Digital Libraries, ECDL 1999 - Paris, France
Duration: 22 Sept 199924 Sept 1999

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1696
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd European Conference on Research and Advanced Technology for Digital Libraries, ECDL 1999
Country/TerritoryFrance
CityParis
Period22/09/9924/09/99

Fingerprint

Dive into the research topics of 'Term similarity-based query expansion for cross-language information retrieval'. Together they form a unique fingerprint.

Cite this