RMIT University
Browse

A concise integer linear programming formulation for implicit search result diversification

conference contribution
posted on 2024-10-31, 21:08 authored by Hai-Tao Yu, Adam Jatowt, Roi Blanco Gonzalez, Hideo Joho, Joemon Jose, Long Chen, Fajie Yuan
To cope with ambiguous and/or underspecified queries, search result diversification (SRD) is a key technique that has attracted a lot of attention. This paper focuses on implicit SRD, where the possible subtopics underlying a query are unknown beforehand. We formulate implicit SRD as a process of selecting and ranking k exemplar documents that utilizes integer linear programming (ILP). Unlike the common practice of relying on approximate methods, this formulation enables us to obtain the optimal solution of the objective function. Based on four benchmark collections, our extensive empirical experiments reveal that: (1) The factors, such as different initial runs, the number of input documents, query types and the ways of computing document similarity significantly affect the performance of diversification models. Careful examinations of these factors are highly recommended in the development of implicit SRD methods. (2) The proposed method can achieve substantially improved performance over the state-of-the-art unsupervised methods for implicit SRD.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1145/3018661.3018710
  2. 2.
    ISBN - Is published in 9781450346757 (urn:isbn:9781450346757)

Start page

191

End page

200

Total pages

10

Outlet

Proceedings of the Tenth ACM International Conference on Web Search and Data Mining (WSDM 2017)

Editors

Maarten de Rijke, Milad Shokouhi

Name of conference

WSDM 2017

Publisher

Association for Computing Machinery

Place published

New York, United States

Start date

2017-02-06

End date

2017-02-10

Language

English

Copyright

Copyright © 2017 Association for Computing Machinery (ACM)

Former Identifier

2006077299

Esploro creation date

2020-06-22

Fedora creation date

2017-08-28

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC