RMIT University
Browse

KRNN: k rare-class nearest neighbor classification

journal contribution
posted on 2024-11-02, 02:53 authored by Xiuzhen ZhangXiuzhen Zhang, Yuxuan Li, Ramamohanarao Kotagiri, Lifang Wu, Zahir TariZahir Tari, Mohamed Cheriet
Imbalanced classification is a challenging problem. Re-sampling and cost-sensitive learning are global strategies for generality-oriented algorithms such as the decision tree, targeting inter-class imbalance. We research local strategies for the specificity-oriented learning algorithms like the k Nearest Neighbour (KNN) to address the within-class imbalance issue of positive data sparsity. We propose an algorithm k Rare-class Nearest Neighbour, or KRNN, by directly adjusting the induction bias of KNN. We propose to form dynamic query neighbourhoods, and to further adjust the positive posterior probability estimation to bias classification towards the rare class. We conducted extensive experiments on thirty real-world and artificial datasets to evaluate the performance of KRNN. Our experiments showed that KRNN significantly improved KNN for classification of the rare class, and often outperformed re-sampling and cost-sensitive learning strategies with generality-oriented base learners.

Funding

Data mining complex transactional and criminal networks

Australian Research Council

Find out more...

History

Journal

Pattern Recognition

Volume

62

Start page

33

End page

44

Total pages

12

Publisher

Elsevier

Place published

Netherlands

Language

English

Copyright

© 2016 Elsevier. All rights reserved.

Former Identifier

2006069249

Esploro creation date

2020-06-22

Fedora creation date

2017-01-05

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC