RMIT University
Browse

New Diagonal Bundle Method for Clustering Problems in Large Data Sets

journal contribution
posted on 2024-11-02, 14:37 authored by Napsu Karmitsa, Adil Baghirov, Sona TaheriSona Taheri
Clustering is one of the most important tasks in data mining. Recent developments in computer hardware allow us to store in random access memory (RAM) and repeatedly read data sets with hundreds of thousands and even millions of data points. This makes it possible to use conventional clustering algorithms in such data sets. However, these algorithms may need prohibitively large computational time and fail to produce accurate solutions. Therefore, it is important to develop clustering algorithms which are accurate and can provide real time clustering in large data sets. This paper introduces one of them. Using nonsmooth optimization formulation of the clustering problem the objective function is represented as a difference of two convex (DC) functions. Then a new diagonal bundle algorithm that explicitly uses this structure is designed and combined with an incremental approach to solve this problem. The method is evaluated using real world data sets with both large number of attributes and large number of data points. The proposed method is compared with two other clustering algorithms using numerical results.

Funding

Exploring and exploiting structures in nonsmooth and global optimization problems

Australian Research Council

Find out more...

History

Related Materials

  1. 1.
    DOI - Is published in 10.1016/j.ejor.2017.06.010
  2. 2.
    ISSN - Is published in 03772217

Journal

European Journal of Operational Research

Volume

263

Issue

2

Start page

367

End page

379

Total pages

13

Publisher

Elsevier

Place published

Netherlands

Language

English

Copyright

© 2017 Elsevier B.V. All rights reserved.

Former Identifier

2006101897

Esploro creation date

2020-10-21

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC