RMIT University
Browse

Robustness of Principal Component Analysis with Spearman’s Rank Matrix

journal contribution
posted on 2024-11-03, 11:22 authored by Kodai Watanabe, Kanta Naito, Ingeborg KochIngeborg Koch
This paper is concerned with robust principal component analysis (PCA) based on spatial sign and spatial rank vectors. The most common PC approach is based on the eigenvectors of the sample covariance matrix; however, this approach is known to be sensitive to outliers. Several robust alternatives based on spatial sign and spatial rank vectors have been discussed including Kendall’s tau or Marden’s rank and Spearman’s rank matrices. Our aims in this paper are to investigate properties of PCA based on Spearman’s rank matrix from theoretical and practical view points and to compare the performance of PCA based on these robust alternatives. A concentration inequality for an estimator of Spearman’s rank matrix is derived, which reveals consistency of the estimator. The influence functions for eigenvalues and eigenvectors of Spearman’s rank matrix are examined which establish the asymptotics for PCA based on Spearman’s rank matrix. Monte Carlo simulations and application to a real dataset show that the performance of PCA using Spearman’s rank matrix is comparable to or sometime better than that of Kendall’s tau and Marden’s rank matrices.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1007/s42519-023-00358-z
  2. 2.
    ISSN - Is published in 15598608

Journal

Journal of Statistical Theory and Practice

Volume

18

Number

6

Issue

1

Start page

1

End page

29

Total pages

29

Publisher

Springer

Place published

Germany

Language

English

Copyright

© Grace Scientific Publishing 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s)

Former Identifier

2006128110

Esploro creation date

2024-02-10

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC