RMIT University
Browse

Maximum error-bounded Piecewise Linear Representation for online stream approximation

journal contribution
posted on 2024-11-01, 18:38 authored by Qing Xie, Chaoyi Pang, Xiaofang Zhou, Xiangliang Zhang, Ke DengKe Deng
Given a time series data stream, the generation of error-bounded Piecewise Linear Representation (error-bounded PLR) is to construct a number of consecutive line segments to approximate the stream, such that the approximation error does not exceed a prescribed error bound. In this work, we consider the error bound in L∞ norm as approximation criterion, which constrains the approximation error on each corresponding data point, and aim on designing algorithms to generate the minimal number of segments. In the literature, the optimal approximation algorithms are effectively designed based on transformed space other than time-value space, while desirable optimal solutions based on original time domain (i.e., time-value space) are still lacked. In this article, we proposed two linear-time algorithms to construct error-bounded PLR for data stream based on time domain, which are named OptimalPLR and GreedyPLR, respectively. The OptimalPLR is an optimal algorithm that generates minimal number of line segments for the stream approximation, and the GreedyPLR is an alternative solution for the requirements of high efficiency and resource-constrained environment. In order to evaluate the superiority of OptimalPLR, we theoretically analyzed and compared OptimalPLR with the state-of-art optimal solution in transformed space, which also achieves linear complexity. We successfully proved the theoretical equivalence between time-value space and such transformed space, and also discovered the superiority of OptimalPLR on processing efficiency in practice. The extensive results of empirical evaluation support and demonstrate the effectiveness and efficiency of our proposed algorithms.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1007/s00778-014-0355-0
  2. 2.
    ISSN - Is published in 10668888

Journal

The VLDB (Very Large Data Bases) Journal

Volume

23

Issue

6

Start page

915

End page

937

Total pages

23

Publisher

ACM Special Interest Group

Place published

United States

Language

English

Copyright

© Springer-Verlag Berlin Heidelberg 2014

Former Identifier

2006053829

Esploro creation date

2020-06-22

Fedora creation date

2015-06-23

Usage metrics

    Scholarly Works

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC