RMIT University
Browse

Lip-reading technique using spatio-temporal templates and support vector machines

chapter
posted on 2024-10-30, 15:55 authored by Wai Chee Yau, Dinesh KumarDinesh Kumar, Tharangini Chinnadurai
This paper presents a lip-reading technique to identify the unspoken phones using support vector machines. The proposed system is based on temporal integration of the video data to generate spatio-temporal templates (STT). 64 Zernike moments (ZM) are extracted from each STT. This work proposes a novel feature selection algorithm to reduce the dimensionality of the 64 ZM to 12 features. The proposed technique uses the shape of probability curve as a goodness measure for optimal feature selection. The feature vectors are classified using non-linear support vector machines.Such a system could be invaluable when it is important to communicate without making a sound, such as giving passwords when in public spaces.

History

Related Materials

  1. 1.
    ISBN - Is published in 9783540859192 (urn:isbn:9783540859192)
  2. 2.

Start page

610

End page

617

Total pages

8

Outlet

Progress in Pattern Recognition, Image Analysis Applications

Editors

Jose Ruiz-Schulcloper; Walter G Kropatsch

Publisher

Springer

Place published

Germany

Language

English

Copyright

© Springer-Verlag Berlin Heidelberg 2008

Former Identifier

2006010088

Esploro creation date

2020-06-22

Fedora creation date

2011-08-18

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC