RMIT University
Browse

A multi-objective genetic algorithm for text feature selection using the relative discriminative criterion

journal contribution
posted on 2024-11-02, 12:57 authored by Mahdieh Labani, Parham Moradi, Mahdi JaliliMahdi Jalili
With exponentially increasing the number of digital documents, text classification has become a major task in data science applications. Selecting discriminative features highly relevant to class labels while having low levels of redundancy is essential to improve the performance of text classification methods. In this paper, we propose a novel multi-objective algorithm for text feature selection, called Multi-Objective Relative Discriminative Criterion (MORDC), which balances minimal redundant features against those maximally relevant to the target class. The proposed method employs a multi-objective evolutionary framework to search through the solution space. The first objective function measures the relevance of the text features to the target class, whereas the second one evaluates the correlation between the features. None of these objectives use learning to evaluate the goodness of the selected features; thus, the proposed method can be classified as a multivariate filter method. In order to assess the effectiveness of the proposed method, several experiments are performed on three real-world datasets. Comparisons with state-of-the-art feature selection methods show that in most cases MORDC results in better classification performance.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1016/j.eswa.2020.113276
  2. 2.
    ISSN - Is published in 09574174

Journal

Expert Systems with Applications

Volume

149

Number

113276

Start page

1

End page

21

Total pages

21

Publisher

Elsevier Ltd

Place published

United Kingdom

Language

English

Copyright

© 2020 Elsevier Ltd. All rights reserved.

Former Identifier

2006099229

Esploro creation date

2020-09-08

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC