RMIT University
Browse

UFSSF - An Efficient Unsupervised Feature Selection for Streaming Features

conference contribution
posted on 2024-10-31, 22:05 authored by Naif Almusallam, Zahir TariZahir Tari, Jeffrey ChanJeffrey Chan, Adil Al-Harthi
Streaming features applications pose challenges for feature selection. For such dynamic features applications: (a) features are sequentially generated and are processed one by one upon their arrival while the number of instances/points remains fixed; and (b) the complete feature space is not known in advance. Existing approaches require class labels as a guide to select the representative features. However, in real-world applications most data are not labeled and, moreover, manual labeling is costly. A new algorithm, called Unsupervised Feature Selection for Streaming Features (UFSSF), is proposed in this paper to select representative features in streaming features applications without the need to know the features or class labels in advance. UFSSF extends the k-mean clustering algorithm to include linearly dependent similarity measures so as to incrementally decide whether to add the newly arrived feature to the existing set of representative features. Those features that are not representative are discarded. Experimental results indicates that UFSSF significantly has a better prediction accuracy and running time compared to the baseline approaches.

History

Start page

495

End page

507

Total pages

13

Outlet

Proceedings of the 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2018) Part II

Editors

Dinh Phung, Vincent S. Tseng, Geoffrey I. Webb, Bao Ho, Mohadeseh Ganji, Lida Rashidi

Name of conference

PAKDD 2018: Lecture Notes in Artificial Intelligence Volume 10938

Publisher

Springer

Place published

Cham, Switzerland

Start date

2018-06-03

End date

2018-06-06

Language

English

Copyright

© Springer International Publishing AG, part of Springer Nature 2018

Former Identifier

2006086902

Esploro creation date

2020-06-22

Fedora creation date

2018-12-10

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC