Action-Stage Emphasized Spatio-Temporal VLAD for Video Action Recognition

Despite outstanding performance in image recognition, convolutional neural networks (CNNs) do not yet achieve the same impressive results on action recognition in videos. This is partially due to the inability of CNN for modeling long-range temporal structures especially those involving individual a...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - (2019) vom: 03. Jan.
1. Verfasser: Tu, Zhigang (VerfasserIn)
Weitere Verfasser: Li, Hongyan, Zhang, Dejun, Dauwels, Justin, Li, Baoxin, Yuan, Junsong
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2019
Zugriff auf das übergeordnete Werk:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:Journal Article