RMIT University
Browse

Emotional speech synthesis based on prosodic feature modification

journal contribution
posted on 2024-11-01, 14:28 authored by Ling He, Hua Huang, Margaret LechMargaret Lech
The synthesis of emotional speech has wide applications in the field of human-computer interaction, medicine, industry and so on. In this work, an emotional speech synthesis system is proposed based on prosodic features modification and Time Domain Pitch Synchronous OverLap Add (TD-PSOLA) waveform concatenative algorithm. The system produces synthesized speech with four types of emotion: angry, happy, sad and bored. The experiment results show that the proposed emotional speech synthesis system achieves a good performance. The produced utterances present clear emotional expression. The subjective test reaches high classification accuracy for different types of synthesized emotional speech utterances.

History

Related Materials

  1. 1.
    DOI - Is published in 10.4236/eng.2013.510B015
  2. 2.
    ISSN - Is published in 19473931

Journal

Engineering

Volume

5

Start page

73

End page

77

Total pages

5

Publisher

Scientific Research Publishing

Place published

United States

Language

English

Copyright

© 2013 SciRes

Former Identifier

2006043820

Esploro creation date

2020-06-22

Fedora creation date

2014-03-11

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC