Discriminative analysis of lip motion features for speaker identification and speech-reading

There have been several studies that jointly use audio, lip intensity, and lip geometry information for speaker identification and speech-reading applications. This paper proposes using explicit lip motion information, instead of or in addition to lip intensity and/or geometry information, for speak...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 15(2006), 10 vom: 08. Okt., Seite 2879-91
1. Verfasser:	Cetingül, H Ertan (VerfasserIn)
Weitere Verfasser:	Yemez, Yücel, Erzin, Engin, Tekalp, A Murat
Format:	Aufsatz
Sprache:	English
Veröffentlicht:	2006
Zugriff auf das übergeordnete Werk:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:	Journal Article Research Support, Non-U.S. Gov't


LEADER	01000naa a22002652 4500
001	NLM165808055
003	DE-627
005	20231223105505.0
007	tu
008	231223s2006 xx \|\|\|\|\| 00\| \|\|eng c
028	5	2	\|a pubmed24n0553.xml
035			\|a (DE-627)NLM165808055
035			\|a (NLM)17022256
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Cetingül, H Ertan \|e verfasserin \|4 aut
245	1	0	\|a Discriminative analysis of lip motion features for speaker identification and speech-reading
264		1	\|c 2006
336			\|a Text \|b txt \|2 rdacontent
337			\|a ohne Hilfsmittel zu benutzen \|b n \|2 rdamedia
338			\|a Band \|b nc \|2 rdacarrier
500			\|a Date Completed 20.11.2006
500			\|a Date Revised 26.10.2019
500			\|a published: Print
500			\|a Citation Status MEDLINE
520			\|a There have been several studies that jointly use audio, lip intensity, and lip geometry information for speaker identification and speech-reading applications. This paper proposes using explicit lip motion information, instead of or in addition to lip intensity and/or geometry information, for speaker identification and speech-reading within a unified feature selection and discrimination analysis framework, and addresses two important issues: 1) Is using explicit lip motion information useful, and, 2) if so, what are the best lip motion features for these two applications? The best lip motion features for speaker identification are considered to be those that result in the highest discrimination of individual speakers in a population, whereas for speech-reading, the best features are those providing the highest phoneme/word/phrase recognition rate. Several lip motion feature candidates have been considered including dense motion features within a bounding box about the lip, lip contour motion features, and combination of these with lip shape features. Furthermore, a novel two-stage, spatial, and temporal discrimination analysis is introduced to select the best lip motion features for speaker identification and speech-reading applications. Experimental results using an hidden-Markov-model-based recognition system indicate that using explicit lip motion information provides additional performance gains in both applications, and lip motion features prove more valuable in the case of speech-reading application
650		4	\|a Journal Article
650		4	\|a Research Support, Non-U.S. Gov't
700	1		\|a Yemez, Yücel \|e verfasserin \|4 aut
700	1		\|a Erzin, Engin \|e verfasserin \|4 aut
700	1		\|a Tekalp, A Murat \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t IEEE transactions on image processing : a publication of the IEEE Signal Processing Society \|d 1992 \|g 15(2006), 10 vom: 08. Okt., Seite 2879-91 \|w (DE-627)NLM09821456X \|x 1941-0042 \|7 nnns
773	1	8	\|g volume:15 \|g year:2006 \|g number:10 \|g day:08 \|g month:10 \|g pages:2879-91
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_NLM
912			\|a GBV_ILN_350
951			\|a AR
952			\|d 15 \|j 2006 \|e 10 \|b 08 \|c 10 \|h 2879-91