Multistream articulatory feature-based models for visual speech recognition
We study the problem of automatic visual speech recognition (VSR) using dynamic Bayesian network (DBN)-based models consisting of multiple sequences of hidden states, each corresponding to an articulatory feature (AF) such as lip opening (LO) or lip rounding (LR). A bank of discriminative articulato...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 31(2009), 9 vom: 15. Sept., Seite 1700-7
|
1. Verfasser: |
Saenko, Kate
(VerfasserIn) |
Weitere Verfasser: |
Livescu, Karen,
Glass, James,
Darrell, Trevor |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2009
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Evaluation Study
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S. |