Human3.6M : Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments

We introduce a new dataset, Human3.6M, of 3.6 Million accurate 3D Human poses, acquired by recording the performance of 5 female and 6 male subjects, under 4 different viewpoints, for training realistic human sensing systems and for evaluating the next generation of human pose estimation models and...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 36(2014), 7 vom: 01. Juli, Seite 1325-39
1. Verfasser: Ionescu, Catalin (VerfasserIn)
Weitere Verfasser: Papava, Dragos, Olaru, Vlad, Sminchisescu, Cristian
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2014
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article Research Support, Non-U.S. Gov't
LEADER 01000naa a22002652 4500
001 NLM252591828
003 DE-627
005 20231224164433.0
007 cr uuu---uuuuu
008 231224s2014 xx |||||o 00| ||eng c
024 7 |a 10.1109/TPAMI.2013.248  |2 doi 
028 5 2 |a pubmed24n0842.xml 
035 |a (DE-627)NLM252591828 
035 |a (NLM)26353306 
040 |a DE-627  |b ger  |c DE-627  |e rakwb 
041 |a eng 
100 1 |a Ionescu, Catalin  |e verfasserin  |4 aut 
245 1 0 |a Human3.6M  |b Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments 
264 1 |c 2014 
336 |a Text  |b txt  |2 rdacontent 
337 |a ƒaComputermedien  |b c  |2 rdamedia 
338 |a ƒa Online-Ressource  |b cr  |2 rdacarrier 
500 |a Date Completed 07.03.2016 
500 |a Date Revised 17.03.2022 
500 |a published: Print 
500 |a Citation Status MEDLINE 
520 |a We introduce a new dataset, Human3.6M, of 3.6 Million accurate 3D Human poses, acquired by recording the performance of 5 female and 6 male subjects, under 4 different viewpoints, for training realistic human sensing systems and for evaluating the next generation of human pose estimation models and algorithms. Besides increasing the size of the datasets in the current state-of-the-art by several orders of magnitude, we also aim to complement such datasets with a diverse set of motions and poses encountered as part of typical human activities (taking photos, talking on the phone, posing, greeting, eating, etc.), with additional synchronized image, human motion capture, and time of flight (depth) data, and with accurate 3D body scans of all the subject actors involved. We also provide controlled mixed reality evaluation scenarios where 3D human models are animated using motion capture and inserted using correct 3D geometry, in complex real environments, viewed with moving cameras, and under occlusion. Finally, we provide a set of large-scale statistical models and detailed evaluation baselines for the dataset illustrating its diversity and the scope for improvement by future work in the research community. Our experiments show that our best large-scale model can leverage our full training set to obtain a 20% improvement in performance compared to a training set of the scale of the largest existing public dataset for this problem. Yet the potential for improvement by leveraging higher capacity, more complex models with our large dataset, is substantially vaster and should stimulate future research. The dataset together with code for the associated large-scale learning models, features, visualization tools, as well as the evaluation server, is available online at http://vision.imar.ro/human3.6m 
650 4 |a Journal Article 
650 4 |a Research Support, Non-U.S. Gov't 
700 1 |a Papava, Dragos  |e verfasserin  |4 aut 
700 1 |a Olaru, Vlad  |e verfasserin  |4 aut 
700 1 |a Sminchisescu, Cristian  |e verfasserin  |4 aut 
773 0 8 |i Enthalten in  |t IEEE transactions on pattern analysis and machine intelligence  |d 1979  |g 36(2014), 7 vom: 01. Juli, Seite 1325-39  |w (DE-627)NLM098212257  |x 1939-3539  |7 nnns 
773 1 8 |g volume:36  |g year:2014  |g number:7  |g day:01  |g month:07  |g pages:1325-39 
856 4 0 |u http://dx.doi.org/10.1109/TPAMI.2013.248  |3 Volltext 
912 |a GBV_USEFLAG_A 
912 |a SYSFLAG_A 
912 |a GBV_NLM 
912 |a GBV_ILN_350 
951 |a AR 
952 |d 36  |j 2014  |e 7  |b 01  |c 07  |h 1325-39