RMIT University
Browse

State-space optimization of ETL workflows

journal contribution
posted on 2024-11-01, 12:48 authored by A Simitsis, P Vassiliadis, Timoleon Sellis
Extraction-transformation-loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization, and insertion into a data warehouse. In this paper, we derive into the logical optimization of ETL processes, modeling it as a state-space search problem. We consider each ETL workflow as a state and fabricate the state space through a set of correct state transitions. Moreover, we provide an exhaustive and two heuristic algorithms toward the minimization of the execution cost of an ETL workflow. The heuristic algorithm with greedy characteristics significantly outperforms the other two algorithms for a large set of experimental case

History

Journal

IEEE Transactions on Knowledge and Data Engineering

Volume

17

Issue

10

Start page

1404

End page

1419

Total pages

16

Publisher

IEEE

Place published

United States

Language

English

Copyright

© 2005 IEEE

Former Identifier

2006035649

Esploro creation date

2020-06-22

Fedora creation date

2012-10-05

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC