Human3.6M : Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments

We introduce a new dataset, Human3.6M, of 3.6 Million accurate 3D Human poses, acquired by recording the performance of 5 female and 6 male subjects, under 4 different viewpoints, for training realistic human sensing systems and for evaluating the next generation of human pose estimation models and...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence. - 1979. - 36(2014), 7 vom: 01. Juli, Seite 1325-39
1. Verfasser:	Ionescu, Catalin (VerfasserIn)
Weitere Verfasser:	Papava, Dragos, Olaru, Vlad, Sminchisescu, Cristian
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2014
Zugriff auf das übergeordnete Werk:	IEEE transactions on pattern analysis and machine intelligence
Schlagworte:	Journal Article Research Support, Non-U.S. Gov't


LEADER	01000naa a22002652 4500
001	NLM252591828
003	DE-627
005	20231224164433.0
007	cr uuu---uuuuu
008	231224s2014 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1109/TPAMI.2013.248 \|2 doi
028	5	2	\|a pubmed24n0842.xml
035			\|a (DE-627)NLM252591828
035			\|a (NLM)26353306
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Ionescu, Catalin \|e verfasserin \|4 aut
245	1	0	\|a Human3.6M \|b Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments
264		1	\|c 2014
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 07.03.2016
500			\|a Date Revised 17.03.2022
500			\|a published: Print
500			\|a Citation Status MEDLINE
520			\|a We introduce a new dataset, Human3.6M, of 3.6 Million accurate 3D Human poses, acquired by recording the performance of 5 female and 6 male subjects, under 4 different viewpoints, for training realistic human sensing systems and for evaluating the next generation of human pose estimation models and algorithms. Besides increasing the size of the datasets in the current state-of-the-art by several orders of magnitude, we also aim to complement such datasets with a diverse set of motions and poses encountered as part of typical human activities (taking photos, talking on the phone, posing, greeting, eating, etc.), with additional synchronized image, human motion capture, and time of flight (depth) data, and with accurate 3D body scans of all the subject actors involved. We also provide controlled mixed reality evaluation scenarios where 3D human models are animated using motion capture and inserted using correct 3D geometry, in complex real environments, viewed with moving cameras, and under occlusion. Finally, we provide a set of large-scale statistical models and detailed evaluation baselines for the dataset illustrating its diversity and the scope for improvement by future work in the research community. Our experiments show that our best large-scale model can leverage our full training set to obtain a 20% improvement in performance compared to a training set of the scale of the largest existing public dataset for this problem. Yet the potential for improvement by leveraging higher capacity, more complex models with our large dataset, is substantially vaster and should stimulate future research. The dataset together with code for the associated large-scale learning models, features, visualization tools, as well as the evaluation server, is available online at http://vision.imar.ro/human3.6m
650		4	\|a Journal Article
650		4	\|a Research Support, Non-U.S. Gov't
700	1		\|a Papava, Dragos \|e verfasserin \|4 aut
700	1		\|a Olaru, Vlad \|e verfasserin \|4 aut
700	1		\|a Sminchisescu, Cristian \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t IEEE transactions on pattern analysis and machine intelligence \|d 1979 \|g 36(2014), 7 vom: 01. Juli, Seite 1325-39 \|w (DE-627)NLM098212257 \|x 1939-3539 \|7 nnns
773	1	8	\|g volume:36 \|g year:2014 \|g number:7 \|g day:01 \|g month:07 \|g pages:1325-39
856	4	0	\|u http://dx.doi.org/10.1109/TPAMI.2013.248 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_NLM
912			\|a GBV_ILN_350
951			\|a AR
952			\|d 36 \|j 2014 \|e 7 \|b 01 \|c 07 \|h 1325-39