Self-Supervised Video Representation Learning by Uncovering Spatio-Temporal Statistics

This paper proposes a novel pretext task to address the self-supervised video representation learning problem. Specifically, given an unlabeled video clip, we compute a series of spatio-temporal statistical summaries, such as the spatial location and dominant direction of the largest motion, the spa...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 7 vom: 02. Juli, Seite 3791-3806
1. Verfasser: Wang, Jiangliu (VerfasserIn)
Weitere Verfasser: Jiao, Jianbo, Bao, Linchao, He, Shengfeng, Liu, Wei, Liu, Yun-Hui
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2022
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article Research Support, Non-U.S. Gov't