TY - GEN
T1 - Term similarity-based query expansion for cross-language information retrieval
AU - Adriani, Mirna
AU - Van Rijsbergen, C. J.
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 1999.
PY - 1999
Y1 - 1999
N2 - We propose a query expansion technique which is based on a statistical similarity measure among terms to improve the effectiveness of the dictionary-based cross-language information retrieval (CLIR) method. We employ a term similarity-based sense disambiguation technique proposed in our earlier work to enhance the accuracy of the dictionary-based query translation method. The query expansion technique is then applied to the translation of queries to further improve their retrieval performance. We demonstrate the effectiveness of the two techniques combined using queries in three languages, namely, German, Spanish, and Indonesian, to retrieve English documents from a standard TREC (Text Retrieval Conference) collection. The results of our experiments indicate that the term similarity-based techniques work better when there are more phrases in the queries. In addition, our results also re-emphasize other researchers’ finding that phrase recognition and translation are critical to CLIR’s effectiveness.
AB - We propose a query expansion technique which is based on a statistical similarity measure among terms to improve the effectiveness of the dictionary-based cross-language information retrieval (CLIR) method. We employ a term similarity-based sense disambiguation technique proposed in our earlier work to enhance the accuracy of the dictionary-based query translation method. The query expansion technique is then applied to the translation of queries to further improve their retrieval performance. We demonstrate the effectiveness of the two techniques combined using queries in three languages, namely, German, Spanish, and Indonesian, to retrieve English documents from a standard TREC (Text Retrieval Conference) collection. The results of our experiments indicate that the term similarity-based techniques work better when there are more phrases in the queries. In addition, our results also re-emphasize other researchers’ finding that phrase recognition and translation are critical to CLIR’s effectiveness.
UR - http://www.scopus.com/inward/record.url?scp=84958034101&partnerID=8YFLogxK
U2 - 10.1007/3-540-48155-9_20
DO - 10.1007/3-540-48155-9_20
M3 - Conference contribution
AN - SCOPUS:84958034101
SN - 3540665587
SN - 9783540665588
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 311
EP - 322
BT - Research and Advanced Technology for Digital Libraries - 3rd European Conference, ECDL 1999, Proceedings
A2 - Abiteboul, Serge
A2 - Vercoustre, Anne-Marie
PB - Springer Verlag
T2 - 3rd European Conference on Research and Advanced Technology for Digital Libraries, ECDL 1999
Y2 - 22 September 1999 through 24 September 1999
ER -