RMIT University
Browse

Load-balancing in distributed selective search

conference contribution
posted on 2024-10-31, 19:35 authored by Yubin Kim, Jamie Callan, Shane CulpepperShane Culpepper, Alistair Moffat
Simulation and analysis have shown that selective search can reduce the cost of large-scale distributed information retrieval. By partitioning the collection into small topical shards, and then using a resource ranking algorithm to choose a subset of shards to search for each query, fewer postings are evaluated. Here we extend the study of selective search using a fine-grained simulation investigating: selective search efficiency in a parallel query processing environment; the difference in efficiency when term-based and sample-based resource selection algorithms are used; and the effect of two policies for assigning index shards to machines. Results obtained for two large datasets and four large query logs confirm that selective search is significantly more efficient than conventional distributed search. In particular, we show that selective search is capable of both higher throughput and lower latency in a parallel environment than is exhaustive search.

Funding

Beyond keyword search for ranked document retrieval

Australian Research Council

Find out more...

History

Related Materials

  1. 1.
    DOI - Is published in 10.1145/2911451.2914689
  2. 2.
    ISBN - Is published in 9781450340694 (urn:isbn:9781450340694)

Start page

905

End page

908

Total pages

4

Outlet

Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

Name of conference

SIGIR'16

Publisher

Association for Computing Machinery

Place published

New York, United States

Start date

2016-07-19

End date

2016-07-21

Language

English

Copyright

© 2016 The Author(s)

Former Identifier

2006064236

Esploro creation date

2020-06-22

Fedora creation date

2016-08-17

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC