RMIT University
Browse

Seeing the forest from trees: Blog retrieval by aggregating post similarity scores

conference contribution
posted on 2024-10-31, 10:18 authored by Zhixin Zhou, Xiuzhen ZhangXiuzhen Zhang, Philip Vines
Blog retrieval is a new and challenging task. Instead of retrieving individual documents, this task requires retrieving collections of documents, or blog posts. It has been shown recently that the federated model of using post entries as retrieval units is an effective approach to blog retrieval, where aggregation of similarity scores for posts to rank blogs plays an important role in the final ranking of blogs. In this paper, we explore two approaches of aggregation describing the depth and width of topical relevance relationship between post entries and blogs. We further propose holistic approaches that combine both approaches. Our experiments show that the sum baseline has the best performance, although the performances of the probabilistic approach and the linear pooling approach are very similar.

History

Related Materials

  1. 1.
    ISBN - Is published in 9781921426803 (urn:isbn:9781921426803)

Start page

12

End page

19

Total pages

8

Outlet

Proceedings of the 15th Australasian Document Computing Symposium

Editors

Falk Scholer, Andrew Trotman and Andrew Turpin

Name of conference

15th Australasian Document Computing Symposium

Publisher

RMIT University

Place published

Melbourne, Australia

Start date

2010-12-10

End date

2010-12-10

Language

English

Former Identifier

2006022370

Esploro creation date

2020-06-22

Fedora creation date

2011-06-10

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC