RMIT University
Browse

Fuzzy regression functions with a noise cluster and the impact of outliers on mainstream machine learning methods in the regression setting

journal contribution
posted on 2024-11-02, 13:49 authored by Srinivas Chakravarty, Haydar DemirhanHaydar Demirhan, Furkan Baser
The presence of outliers in the dependent and/or independent features distorts predictions with machine learning techniques and may lead to erroneous conclusions. It is important to implement methods that are robust against the outliers to make reliable predictions and to know the accuracy of the existing methods when data is contaminated with outliers. The focus of this study is to propose a robust fuzzy regression functions (FRFN) approach against the outliers and evaluate the performance of the proposed and several mainstream machine learning approaches in the presence of outliers for the regression problem. The proposed FRFN approach is based on fuzzy k-means clustering with a noise cluster. We compare the accuracy of Artificial Neural Networks (ANN), Support Vector Machines (SVM) and the proposed FRFN approaches with different training algorithms/kernel functions via simulated and real benchmark datasets. In total, accuracies of 36 ANN, SVM, and FRNF implementations with training algorithms and kernel and loss functions have been evaluated and compared to each other with samples containing outliers via a Monte Carlo simulation setting. It is observed in both Monte Carlo simulations and applications with benchmark dataset that FRFN with ANN trained with Bayes regularization algorithm and FRFN with SVM with Gaussian kernel outperforms the classical implementations of ANN and SVMs under the existence of outliers. The proposed noise cluster implementation considerably increases the robustness of fuzzy regression functions against outliers.

History

Journal

Applied Soft Computing Journal

Volume

96

Number

106535

Start page

1

End page

17

Total pages

17

Publisher

Elsevier

Place published

Netherlands

Language

English

Copyright

© 2020 Elsevier B.V. All rights reserved.

Former Identifier

2006101300

Esploro creation date

2023-04-28

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC