posted on 2024-10-31, 15:34authored byMichael Bendersky, Bruce Croft
We propose to use the search log to study long queries, in order to understand the types of information needs that are behind them, and to design techniques to improve search ef- fectiveness when they are used. Long queries arise in many different applications, such as CQA (community-based ques-tion answering) and literature search, and they have beenstudied to some extent using TREC data. They are also, however, quite common in web search, as can be seen by looking at the distribution of query lengths in a large scale search log. In this paper we analyze the long queries in the search log with the aim of identifying the characteristics of the most commonly occurring types of queries, and the issues involved with using them effectively in a search engine. In addition, we propose a simple yet effective method for evaluating the performance of the queries in the search log using a combination of the click data in the search log with the existing TREC corpora.