RMIT University
Browse

Novelty detection based on sentence level patterns

conference contribution
posted on 2024-10-31, 10:48 authored by Xiaoyan Li, Bruce Croft
The detection of new information in a document stream is an important component of many potential applications. In this paper, a new novelty detection approach based on the identification of sentence level patterns is proposed. Given a user's information need, some patterns in sentences such as combinations of query words, named entities and phrases, may contain more important and relevant information than single words. Therefore, the proposed novelty detection approach focuses on the identification of previously unseen query-related patterns in sentences. Specifically, a query is preprocessed and represented with patterns that include both query words and required answer types. These patterns are used to retrieve sentences, which are then determined to be novel if it is likely that a new answer is present. An analysis of patterns in sentences was performed with data from the TREC 2002 novelty track and experiments on novelty detection were carried out on data from the TREC 2003 and 2004 novelty tracks. The experimental results show that the proposed pattern-based approach significantly outperforms all three baselines in terms of precision at top ranks.

History

Start page

744

End page

751

Total pages

8

Outlet

Proceedings of the 14th ACM International Conference on Information and Knowledge Management (CIKM 2005)

Editors

Otthein Herzog

Name of conference

14th ACM International Conference on Information and Knowledge Management (CIKM 2005)

Publisher

ACM

Place published

New York, USA

Start date

2005-10-31

End date

2005-11-05

Language

English

Copyright

Copyright 2005 ACM

Former Identifier

2006024198

Esploro creation date

2020-06-22

Fedora creation date

2011-11-25

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC