RMIT University
Browse

Clustering XML documents by structure

conference contribution
posted on 2024-10-31, 16:31 authored by T Dalamagas, T Cheng, K.J Winkel, Timoleon Sellis
This work explores the application of clustering methods for grouping structurally similar XML documents. Modeling the XML documents as rooted ordered labeled trees, we apply clustering algorithms using distances that estimate the similarity between those trees in terms of the hierarchical relationships of their nodes. We suggest the usage of tree structural summaries to improve the performance of the distance calculation and at the same time to maintain or even improve its quality. Experimental results are provided using a prototype testbed.

History

Start page

112

End page

121

Total pages

10

Outlet

Proceedings of the 3rd Hellenic Conference on Artificial Intelligence

Editors

Vouros G.A., Panayiotopoulos T

Name of conference

3rd Hellenic Conference on Artificial Intelligence

Publisher

Springer

Place published

Germany

Start date

2004-05-05

End date

2004-05-08

Language

English

Copyright

© Springer-Verlag

Former Identifier

2006035763

Esploro creation date

2020-06-22

Fedora creation date

2013-02-19

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC