Optimizing ETL processes in data warehouse environments
conference contribution
posted on 2024-10-31, 16:36authored byA Simitsis, P Vassiliadis, Timoleon Sellis
Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. Usually, these processes must be completed in a certain time window; thus, it is necessary to optimize their execution time. In this paper, we delve into the logical optimization of ETL processes, modeling it as a state-space search problem. We consider each ETL workflow as a state and fabricate the state space through a set of correct state transitions. Moreover, we provide algorithms towards the minimization of the execution cost of an ETL workflow.
History
Related Materials
1.
ISBN - Is published in 0769522858 (urn:isbn:0769522858)
Start page
564
End page
575
Total pages
12
Outlet
Proceedings of the 21st International Conference on Data Engineering (ICDE '05)
Editors
Karl Aberer, Michael J. Franklin, Shojiro Nishio
Name of conference
21st International Conference on Data Engineering (ICDE '05)