Learning SpatioTemporal and Motion Features in a Unified 2D Network for Action Recognition

Recent methods for action recognition always apply 3D Convolutional Neural Networks (CNNs) to extract spatiotemporal features and introduce optical flows to present motion features. Although achieving state-of-the-art performance, they are expensive in both time and space. In this paper, we propose...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 45(2023), 3 vom: 01. März, Seite 3347-3362
1. Verfasser: Wang, Mengmeng (VerfasserIn)
Weitere Verfasser: Xing, Jiazheng, Su, Jing, Chen, Jun, Liu, Yong
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2023
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article