Multi-Moments in Time : Learning and Interpreting Models for Multi-Action Video Understanding

Videos capture events that typically contain multiple sequential, and simultaneous, actions even in the span of only a few seconds. However, most large-scale datasets built to train models for action recognition in video only provide a single label per video. Consequently, models can be incorrectly...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 12 vom: 09. Dez., Seite 9434-9445
1. Verfasser: Monfort, Mathew (VerfasserIn)
Weitere Verfasser: Pan, Bowen, Ramakrishnan, Kandan, Andonian, Alex, McNamara, Barry A, Lascelles, Alex, Fan, Quanfu, Gutfreund, Dan, Feris, Rogerio Schmidt, Oliva, Aude
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2022
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article Research Support, Non-U.S. Gov't