RMIT University
Browse

On group nearest group query processing

journal contribution
posted on 2024-11-01, 18:40 authored by Ke DengKe Deng, Shazia Sadiq, Xiaofang Zhou, Hu Xu, Gabriel Fung, Yansheng Lu
Given a data point set D, a query point set Q, and an integer k, the Group Nearest Group (GNG) query finds a subset omega (vertical bar omega vertical bar <= k) of points from D such that the total distance from all points in Q to the nearest point in omega is not greater than any other subset omega' (vertical bar omega'vertical bar <= k) of points in D. GNG query is a partition-based clustering problem which can be found in many real applications and is NP-hard. In this paper, Exhaustive Hierarchical Combination (EHC) algorithm and Subset Hierarchial Refinement (SHR) algorithm are developed for GNG query processing. While EHC is capable to provide the optimal solution for k = 2, SHR is an efficient approximate approach that combines database techniques with local search heuristic. The processing focus of our approaches is on minimizing the access and evaluation of subsets of cardinality k in D since the number of such subsets is exponentially greater than vertical bar D vertical bar. To do that, the hierarchical blocks of data points at high level are used to find an intermediate solution and then refined by following the guided search direction at low level so as to prune irrelevant subsets. The comprehensive experiments on both real and synthetic data sets demonstrate the superiority of SHR in terms of efficiency and quality.

History

Journal

IEEE Transactions on Knowledge and Data Engineering

Volume

24

Issue

2

Start page

295

End page

308

Total pages

14

Publisher

Institute of Electrical and Electronics Engineers

Place published

United States

Language

English

Copyright

© 2012 IEEE

Former Identifier

2006053903

Esploro creation date

2020-06-22

Fedora creation date

2015-06-30

Usage metrics

    Scholarly Works

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC