RMIT University
Browse

Learning the naive Bayes classifier with optimization models

journal contribution
posted on 2024-11-02, 13:53 authored by Sona TaheriSona Taheri, Musa Mammadov
Naive Bayes is among the simplest probabilistic classifiers. It often performs surprisingly well in many real world applications, despite the strong assumption that all features are conditionally independent given the class. In the learning process of this classifier with the known structure, class probabilities and conditional probabilities are calculated using training data, and then values of these probabilities are used to classify new observations. In this paper, we introduce three novel optimization models for the naive Bayes classifier where both class probabilities and conditional probabilities are considered as variables. The values of these variables are found by solving the corresponding optimization problems. Numerical experiments are conducted on several real world binary classification data sets, where continuous features are discretized by applying three different methods. The performances of these models are compared with the naive Bayes classifier, tree augmented naive Bayes, the SVM, C4.5 and the nearest neighbor classifier. The obtained results demonstrate that the proposed models can significantly improve the performance of the naive Bayes classifier, yet at the same time maintain its simple structure.

History

Related Materials

  1. 1.
    DOI - Is published in 10.2478/amcs-2013-0059
  2. 2.
    ISSN - Is published in 1641876X

Journal

International Journal of Applied Mathematics and Computer Science

Volume

23

Issue

4

Start page

787

End page

795

Total pages

9

Publisher

University of Zielona Gora Walter de Gruyter

Place published

Poland

Language

English

Copyright

© AMCS 2020 Open Access. The non-commercial use of the article will be governed by the Creative Commons Attribution-NonCommercial-NoDerivs license as currently displayed on http://creativecommons.org/licenses/by-nc-nd/3.0/.

Former Identifier

2006101935

Esploro creation date

2020-10-21

Usage metrics

    Scholarly Works

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC