RMIT University
Browse

On the inter-dataset generalization of machine learning approaches to Parkinson's disease detection from voice

journal contribution
posted on 2024-11-03, 10:40 authored by Mate Hires, Peter Drotar, Nemuel Daniel Pah, Quoc Cuong NgoQuoc Cuong Ngo, Dinesh KumarDinesh Kumar
Background and Objective: Parkinson's disease is the second-most-common neurodegenerative disorder that affects motor skills, cognitive processes, mood, and everyday tasks such as speaking and walking. The voices of people with Parkinson's disease may become weak, breathy, or hoarse and may sound emotionless, with slurred words and mumbling. Algorithms for computerized voice analysis have been proposed and have shown highly accurate results. However, these algorithms were developed on single, limited datasets, with participants possessing similar demographics. Such models are prone to overfitting and are unsuitable for generalization, which is essential in real-world applications. Methods: We evaluated the computerized Parkinson's disease diagnosis performance of various machine learning models and showed that these models degraded rapidly when used on different datasets. We evaluated two mainstream state-of-the-art approaches, one based on deep convolutional neural networks and another based on voice feature extraction followed by a shallow classifier (i.e., extreme gradient boosting (XGBoost)). Results: An investigation with four datasets (CzechPD, PC-GITA, ITA, and RMIT-PD) proved that even if the algorithms yielded excellent performance on a single dataset, the results obtained on new data or even a mix of datasets were very unsatisfactory. Conclusions: More work needs to be done to make computerized voice analysis methods for Parkinson's disease diagnosis suitable for real-world applications.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1016/j.ijmedinf.2023.105237
  2. 2.
    ISSN - Is published in 13865056

Journal

International Journal of Medical Informatics

Volume

179

Number

105237

Start page

1

End page

9

Total pages

9

Publisher

Elsevier

Place published

Ireland

Language

English

Copyright

Crown Copyright © 2023 Published by Elsevier B.V. All rights reserved.

Former Identifier

2006126558

Esploro creation date

2023-11-25

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC