RMIT University
Browse

Artificial bandwidth extension to improve automatic emotion recognition from narrow-band coded speech

conference contribution
posted on 2024-10-31, 20:40 authored by Abas Albahri, Catherine Sandoval Rodriguez, Margaret LechMargaret Lech
Narrow-band speech coding techniques were previously found to reduce the accuracy of automatic Speech Emotion Recognition (SER), as well as speech and speaker recognition rates. Artificial Bandwidth Extension (ABE) based on spectral folding and spectral envelope estimation has been applied to compressed narrowband speech to test if an improvement in SER can be achieved. The modelling and classification of speech was performed with a benchmark approach based on the GMM classifier and a set of speech acoustic parameters including MFCCs, TEO and glottal parameters. The tests used the Berlin Emotional Speech data base. In general, ABE led to an improvement of SER accuracy; however the amount of improvement varied between different features, genders, and speech compression rates. In all cases, SER accuracy with ABE was at least 10% lower than for uncompressed speech.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1109/ICSPCS.2016.7843305
  2. 2.
    ISBN - Is published in 9781509009411 (urn:isbn:9781509009411)

Start page

1

End page

7

Total pages

7

Outlet

Proceedings of the 10th International Conference on Signal Processing and Communication Systems, (ICSPCS 2016)

Editors

Tadeusz A Wysocki and Beata J Wysocki

Name of conference

ICSPCS 2016

Publisher

IEEE

Place published

United States

Start date

2016-12-19

End date

2016-12-21

Language

English

Copyright

© 2016 IEEE

Former Identifier

2006071049

Esploro creation date

2020-06-22

Fedora creation date

2017-03-06

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC