Combination Methods for Crosslingual Web Retrieval
Jaap Kamps, Maarten de Rijke, and Börkur Sigurbjörnsson.
In: Accessing Multilingual Information Repositories: 6th Workshop of the Cross-Language Evaluation Forum, CLEF 2005. Lecture Notes in Computer Science. Volume: 4022. Pages: 856-864. 2006.
Link: springerlink
Abstract
We investigate a range of crosslingual web retrieval tasks using the test suite of the CLEF 2005 WebCLEF track, which features a stream of known-item topics in various languages. Our main findings are: (i) straightforward indexing and retrieval is effective for mixed monolingual web retrieval; (ii) standard machine translation methods are effective for bilingual web retrieval; but (iii) standard combination methods are ineffective for multilingual web retrieval; we analyze the failure and suggest an alternative Z-score normalization that leads to effective multilingual retrieval results.
Bibtex
@inproceedings{kamps2006combination, author = {Jaap Kamps and Maarten de Rijke and B"orkur Sigurbj"ornsson}, title = {Combination Methods for Crosslingual Web Retrieval}, editor = {C. Peters and F.C. Gey and J. Gonzalo and G.J.F. Jones and M. Kluck and B. Magnini and H. Müller and M. de Rijke }, booktitle = {Accessing Multilingual Information Repositories: 6th Workshop of the Cross-Language Evaluation Forum, CLEF 2005}, series = {Lecture Notes in Computer Science}, volume = {4022}, pages = {856--864}, publisher = {Springer-Verlag}, year = {2006}, doi = {http://dx.doi.org/10.1007/11878773_93},}