RMIT University
Browse

Visual recognition of speech consonants using facial movement features

journal contribution
posted on 2024-11-01, 05:39 authored by Wai Yau, Dinesh KumarDinesh Kumar, Sridhar Poosapadi Arjunan
This paper presents a visual speech recognition technique using facial movement video. The acoustic signals of consonants are often confusing in noisy environments. To overcome this shortcoming, the focus of this paper is identifying consonants using visual information. This paper investigates the feasibility of using facial movements to identify phonemes. The proposed approach adopts a visual speech model based on the viseme model of the Moving Picture Experts Group 4 (MPEG-4) standard. It is a movement-based system, and the facial movements are segmented from the video using an accumulative image subtraction method that results in a 2-D grayscale motion history image (MHI). The MHI is classified using a combination of the discrete stationary wavelet transform (SWT) and image moments (Hu moments, geometric moments and Zernike moments). Feedforward multilayer perceptron (MLP) neural networks with backpropagation (BPN) learning algorithm are used to classify the features to investigate the performance of the three moment features. The experimental results indicate that Zernike moments have better representation ability and provide rotational invariant property for the proposed application. The results also demonstrate that the proposed technique can identify consonants reliably using the viseme model of MPEG-4 standard with a recognition rate of 85%.

History

Journal

Integrated Computer-Aided Engineering

Volume

14

Issue

1

Start page

49

End page

61

Total pages

13

Publisher

IOS Press

Place published

Amsterdam, The Netherlands

Language

English

Copyright

© 2007 - IOS Press and the author(s). All rights reserved.

Former Identifier

2006007388

Esploro creation date

2020-06-22

Fedora creation date

2010-12-06

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC