RMIT University
Browse

Optimizing ETL processes in data warehouse environments

conference contribution
posted on 2024-10-31, 16:36 authored by A Simitsis, P Vassiliadis, Timoleon Sellis
Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. Usually, these processes must be completed in a certain time window; thus, it is necessary to optimize their execution time. In this paper, we delve into the logical optimization of ETL processes, modeling it as a state-space search problem. We consider each ETL workflow as a state and fabricate the state space through a set of correct state transitions. Moreover, we provide algorithms towards the minimization of the execution cost of an ETL workflow.

History

Related Materials

  1. 1.
    ISBN - Is published in 0769522858 (urn:isbn:0769522858)

Start page

564

End page

575

Total pages

12

Outlet

Proceedings of the 21st International Conference on Data Engineering (ICDE '05)

Editors

Karl Aberer, Michael J. Franklin, Shojiro Nishio

Name of conference

21st International Conference on Data Engineering (ICDE '05)

Publisher

IEEE

Place published

United States

Start date

2005-04-05

End date

2005-04-08

Language

English

Copyright

IEEE Computer Society Washington, DC, USA ©2005

Former Identifier

2006035784

Esploro creation date

2020-06-22

Fedora creation date

2012-12-04

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC