RMIT University
Browse

Evaluating text reuse discovery on the web

conference contribution
posted on 2024-10-31, 10:40 authored by Stanford Chiu, Ibrahim Uysal, Bruce Croft
Text reuse detection aims to identify duplicates, reformulations or partial rewrites of a given text. Some previous research has focused on determining text reuse instances accurately on local corpora. However, the practical usage of finding text reuse on the web has remained largely untested. In this work, we 1) introduce a novel text reuse searching interface for the web, based on a previously proposed architecture, 2) evaluate its feasibility, and 3) investigate techniques to improve both effectiveness and efficiency. Our results show that exhaustive query submission using n-grams can dramatically reduce the execution time with only small losses in accuracy.

History

Start page

299

End page

303

Total pages

5

Outlet

Proceedings of the 3rd Symposium on Information Interaction in Context (IIiX 2010)

Name of conference

3rd Symposium on Information Interaction in Context (IIiX 2010)

Publisher

ACM

Place published

New York, USA

Start date

2010-08-18

End date

2010-08-21

Language

English

Copyright

Copyright 2010 ACM

Former Identifier

2006024375

Esploro creation date

2020-06-22

Fedora creation date

2013-03-04

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC