Abstract
This year, we have participated in the Ad-Hoc Robust Multilingual track with the aim of evaluating two important issues in Cross-Lingual Information Retrieval (CLIR) systems. This paper first describes the method applied for query expansion in a multilingual environment by using web search results provided by the Google engine in order to increase retrieval robustness. Unfortunately, the results obtained are disappointing. The second issue reported alludes to the robustness of several common merging algorithms. We have found that 2-step RSV merging algorithms perform better than others algorithms when evaluating using geometric average.
This work has been supported by the Spanish Government (MCYT) with grant TIC2003-07158-C04-04 and the RFC/PP2006/Id_514 granted by the University of Jaén.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Voorhees, E.M.: The TREC Robust Retrieval Track, TREC Report (2005)
Kwok, K.L., Grunfeld, L., Deng, P.: Improving Weak Ad-Hoc Retrieval by Web Assistance and Data Fusion. In: Lee, G.G., Yamada, A., Meng, H., Myaeng, S.-H. (eds.) AIRS 2005. LNCS, vol. 3689, pp. 17–30. Springer, Heidelberg (2005)
Kwok, K.L., Grunfeld, L., Sun, H.L., Deng, P.: TREC 2004 Robust Track Experiments using PIRCS, 2004 (2005)
Grunfeld, L., Kwok, K.L., Dinstl, N., Deng, P.: TREC 2003 Robust, HARD and QA Track Experiments using PIRCS (2003)
Dumais, S.T.: Latent Semantic Indexing (LSI) and TREC-2. In: Harman, D.K. (ed.) Proceedings of TREC’2, Gaithersburg. NIST, vol. 500-215, pp. 105–115 (1994)
Martinez-Santiago, F., Ureña, L.A., Martin, M.: A merging strategy proposal: two step retrieval status value method. Information Retrieval 9(1), 71–93 (2006)
Porter, M.F.: An algorithm for suffix stripping. Program 14, 130–137 (1980)
Robertson, S.E, Walker, S., Beaulieu, M.: Experimentation as a way of life: Okapi at TREC. Information Processing and Management 1, 95–108 (2000)
Savoy, J.: Cross-Language Information Retrieval: experiments based on CLEF 2000 corpora. Information Processing and Management 39, 75–115 (2003)
Llopis, F., Garcia Puigcerver, H., Cano, M., Toral, A., Espi, H.: IR-n System, a Passage Retrieval Architecture. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 57–64. Springer, Heidelberg (2004)
Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: Proceedings of the 18th International Conference of the ACM SIGIR 1995, pp. 21–28. The ACM Press, New York (1995)
Calve, A., Savoy, J.: Database merging strategy based on logistic regression. Information Processing and Management 36, 341–359 (2000)
Powell, A.L., French, J.C., Callan, J., Connell, M., Viles, C.L.: The impact of database selection on distributed searching. In: Proceedings of the 23rd International Conference of the ACM-SIGIR 2000, pp. 232–239. ACM Press, New York (2000)
Voorhees, E., Gupta, N.K., Johnson-Laird, B.: The collection fusion problem. In: Harman, D.K. (ed.) Proceedings of the 3rd Text Retrieval Conference TREC-3, National Institute of Standards ad Technology, Special Publication, vol. 500-225, pp. 95–104 (1995)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martínez-Santiago, F., Montejo-Ráez, A., García-Cumbreras, M.Á., Ureña-López, L.A. (2007). SINAI at CLEF 2006 Ad Hoc Robust Multilingual Track: Query Expansion Using the Google Search Engine. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_16
Download citation
DOI: https://doi.org/10.1007/978-3-540-74999-8_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74998-1
Online ISBN: 978-3-540-74999-8
eBook Packages: Computer ScienceComputer Science (R0)