RMIT University
Browse

On optimal modelling of speech spectral transitions

conference contribution
posted on 2024-10-30, 14:28 authored by Chandranath Athaudage, Margaret LechMargaret Lech
In this paper, we propose an optimal spectral transition modelling technique for speech. The proposed technique optimizes the spectral interpolation trajectory by minimizing the mean-square-error of spectral parameters on a frame-by-frame basis. The performance of the proposed techniques is compared with that of two spectral interpolation techniques, namely the linear interpolation and the Gaussian interpolation, reported in literature. Line spectral frequencies are used as the short-term spectral parameter representation of the speech signal. The regions between maximally stable (stationary) frames in the spectral parameter sequence are identified as the regions of spectral transitions. Numerical results show that both linear and Gaussian interpolation techniques have similar modelling performance in terms of average spectral distortion. The proposed optimal technique shows an improved modelling accuracy in terms of average spectral distortion (up to 1 dB improvement), in comparison to that of the linear and Gaussian techniques. The proposed technique can be useful for speech processing applications such as coding and recognition.

History

Start page

1130

End page

1134

Total pages

5

Outlet

4th International Conference on Information, Communications and Signal Processing

Name of conference

ICICS - PCM

Publisher

IEEE

Place published

Singapore

Start date

2003-12-15

End date

2003-12-18

Language

English

Copyright

© 2003 IEEE

Former Identifier

2003001935

Esploro creation date

2020-06-22

Fedora creation date

2010-08-09

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC