Self-Supervised Video Representation Learning by Uncovering Spatio-Temporal Statistics
This paper proposes a novel pretext task to address the self-supervised video representation learning problem. Specifically, given an unlabeled video clip, we compute a series of spatio-temporal statistical summaries, such as the spatial location and dominant direction of the largest motion, the spa...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 7 vom: 02. Juli, Seite 3791-3806
|
1. Verfasser: |
Wang, Jiangliu
(VerfasserIn) |
Weitere Verfasser: |
Jiao, Jianbo,
Bao, Linchao,
He, Shengfeng,
Liu, Wei,
Liu, Yun-Hui |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2022
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article
Research Support, Non-U.S. Gov't |