RMIT University
Browse

Deep convolutional neural network for detection of pathological speech

conference contribution
posted on 2024-11-03, 13:34 authored by Lukas Vavrek, Mate Hires, Dinesh KumarDinesh Kumar, Peter Drotar
This paper describes the investigation of the use of the deep neural networks (DNN) for the detection of pathological speech. The state-of-the-art VGG16 convolutional neural network based transfer learning was the basis of this work and different approaches were trialed. We tested the different architectures using the Saarbrucken Voice database (SVD). To overcome limitations due to language and education, the SVD was limited to/a/,/i/and/u/vowel subsets with sustained natural pitch. The scope of this study was only diseases that classify as organic dysphonia. We utilized multiple simple networks trained separately on different vowel subsets and combined them as a single model ensemble. It was found that model ensemble achieved an accuracy on pathological speech detection of 82 %. Thus, our results show that pre-trained convolutional neural networks can be used for transfer learning when input is the spectrogram representation of the voice signal. This is significant because it overcomes the need for very large data size that is required to train DNN, and is suitable for computerized analysis of the speech without limitation of the language skills of the patients.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1109/SAMI50585.2021.9378656
  2. 2.
    ISBN - Is published in 9781728180533 (urn:isbn:9781728180533)

Number

9378656

Start page

245

End page

249

Total pages

5

Outlet

Proceedings of the IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI 2021)

Name of conference

SAMI 2021

Publisher

IEEE

Place published

United States

Start date

2021-01-21

End date

2021-01-23

Language

English

Copyright

© 2021 IEEE.

Former Identifier

2006106195

Esploro creation date

2021-05-22

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC