Deep Learning for Visual Speech Analysis : A Survey

Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment. As a powerful AI strategy, deep learning techniques have extensively promoted the developmen...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence. - 1979. - 46(2024), 9 vom: 01. Aug., Seite 6001-6022
1. Verfasser:	Sheng, Changchong (VerfasserIn)
Weitere Verfasser:	Kuang, Gangyao, Bai, Liang, Hou, Chenping, Guo, Yulan, Xu, Xin, Pietikainen, Matti, Liu, Li
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2024
Zugriff auf das übergeordnete Werk:	IEEE transactions on pattern analysis and machine intelligence
Schlagworte:	Journal Article Review


LEADER	01000caa a22002652 4500
001	NLM369681509
003	DE-627
005	20240808232742.0
007	cr uuu---uuuuu
008	240315s2024 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1109/TPAMI.2024.3376710 \|2 doi
028	5	2	\|a pubmed24n1495.xml
035			\|a (DE-627)NLM369681509
035			\|a (NLM)38478434
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Sheng, Changchong \|e verfasserin \|4 aut
245	1	0	\|a Deep Learning for Visual Speech Analysis \|b A Survey
264		1	\|c 2024
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 07.08.2024
500			\|a Date Revised 08.08.2024
500			\|a published: Print-Electronic
500			\|a Citation Status MEDLINE
520			\|a Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment. As a powerful AI strategy, deep learning techniques have extensively promoted the development of visual speech learning. Over the past five years, numerous deep learning based methods have been proposed to address various problems in this area, especially automatic visual speech recognition and generation. To push forward future research on visual speech, this paper will present a comprehensive review of recent progress in deep learning methods on visual speech analysis. We cover different aspects of visual speech, including fundamental problems, challenges, benchmark datasets, a taxonomy of existing methods, and state-of-the-art performance. Besides, we also identify gaps in current research and discuss inspiring future research directions
650		4	\|a Journal Article
650		4	\|a Review
700	1		\|a Kuang, Gangyao \|e verfasserin \|4 aut
700	1		\|a Bai, Liang \|e verfasserin \|4 aut
700	1		\|a Hou, Chenping \|e verfasserin \|4 aut
700	1		\|a Guo, Yulan \|e verfasserin \|4 aut
700	1		\|a Xu, Xin \|e verfasserin \|4 aut
700	1		\|a Pietikainen, Matti \|e verfasserin \|4 aut
700	1		\|a Liu, Li \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t IEEE transactions on pattern analysis and machine intelligence \|d 1979 \|g 46(2024), 9 vom: 01. Aug., Seite 6001-6022 \|w (DE-627)NLM098212257 \|x 1939-3539 \|7 nnns
773	1	8	\|g volume:46 \|g year:2024 \|g number:9 \|g day:01 \|g month:08 \|g pages:6001-6022
856	4	0	\|u http://dx.doi.org/10.1109/TPAMI.2024.3376710 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_NLM
912			\|a GBV_ILN_350
951			\|a AR
952			\|d 46 \|j 2024 \|e 9 \|b 01 \|c 08 \|h 6001-6022