TSM : Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Devices

The explosive growth in video streaming requires video understanding at high accuracy and low computation cost. Conventional 2D CNNs are computationally cheap but cannot capture temporal relationships; 3D CNN based methods can achieve good performance but are computationally intensive. In this paper...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 5 vom: 09. Mai, Seite 2760-2774
1. Verfasser: Lin, Ji (VerfasserIn)
Weitere Verfasser: Gan, Chuang, Wang, Kuan, Han, Song
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2022
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article