A support vector machine-based dynamic network for visual speech recognition applications (Q1424532)

From MaRDI portal
scientific article
Language Label Description Also known as
English
A support vector machine-based dynamic network for visual speech recognition applications
scientific article

    Statements

    A support vector machine-based dynamic network for visual speech recognition applications (English)
    0 references
    0 references
    0 references
    16 March 2004
    0 references
    Summary: Visual speech recognition is an emerging research field. In this paper, we examine the suitability of support vector machines for visual speech recognition. Each word is modeled as a temporal sequence of visemes corresponding to the different phones realized. One support vector machine is trained to recognize each viseme and its output is converted to a posterior probability through a sigmoidal mapping. To model the temporal character of speech, the support vector machines are integrated as nodes into a Viterbi lattice. We test the performance of the proposed approach on a small visual speech recognition task, namely the recognition of the first four digits in English. The word recognition rate obtained is at the level of the previons best reported rates.
    0 references
    mouth shape recognition
    0 references
    visemes
    0 references
    support vector machines
    0 references
    Viterbi lattice
    0 references

    Identifiers