Using Wikipedia categories and links in entity ranking
conference contribution
posted on 2024-10-30, 22:31authored byAnne-Marie Vercoustre, Jovan Pehcevski, James Thom
This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our approach utilises the known categories, the link structure of Wikipedia, as well as the link co-occurrences with the examples (when provided) to improve the effectiveness of entity ranking. Our experiments on both the training and the testing data sets demonstrate that the use of categories and the link structure of Wikipedia can significantly improve entity retrieval effectiveness. We also use our system for the ad hoc tasks by inferring target categories from the title of the query. The results were worse than when using a full-text search engine, which confirms our hypothesis that ad hoc retrieval and entity retrieval are two different tasks.
History
Start page
321
End page
335
Total pages
15
Outlet
Focused Access to XML Documents
Editors
N. Fuhr, J. Kamps, M. Lalmas, A. Trotman
Name of conference
6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007