RMIT University
Browse

An effective and efficient truth discovery framework over data streams

conference contribution
posted on 2024-10-31, 20:11 authored by Tianyi Li, Yu Gu, Xiangmin ZhouXiangmin Zhou, Qian Ma, Ge Yu
Truth discovery, a validity assessment method for conflicting data from various sources, has been widely studied in the conventional database community. However, while existing methods for static scenario involve time-consuming iterative processes, those for streams suffer from much sacrifice on accuracy due to the incremental source weight learning. In this paper, we propose a novel framework to conduct truth discovery over streams, which incorporates various iterative methods to effectively estimate the source weights, and decides the frequency of source weight computation adaptively. Specifically, we first capture the characteristics of source weight evolution, based on which a framework is modelled. Then, we define the conditions of source weight evolution for the situations with relatively small unit and cumulative errors, and construct a probabilistic model that estimates the probability of meeting these conditions. Finally, we propose a novel scheme called adaptive source reliability assessment (ASRA), which converts an estimation problem into an optimization problem. We have conducted extensive experiments over real datasets to prove the high effectiveness and efficiency of our framework.

History

Start page

180

End page

191

Total pages

12

Outlet

Proceedings of the 20th International Conference on Extending Database Technology

Editors

V. Markl, S. Orlando, B. Mitschang, P. Andritsos, K. -U. Sattler and S. Bress

Name of conference

International Conference on Extending Database Technology (EDBT)

Publisher

Springer

Place published

Switzerland

Start date

2017-03-21

End date

2017-03-24

Language

English

Copyright

© 2017 The Authors. Creative Commons Attribution Non Commercical No Derivatives 4.0 License

Former Identifier

2006070225

Esploro creation date

2020-06-22

Fedora creation date

2017-11-05

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC