Multi-Scale 2D Temporal Adjacency Networks for Moment Localization With Natural Language
We address the problem of retrieving a specific moment from an untrimmed video by natural language. It is a challenging problem because a target moment may take place in the context of other temporal moments in the untrimmed video. Existing methods cannot tackle this challenge well since they do not...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 12 vom: 01. Dez., Seite 9073-9087
|
1. Verfasser: |
Zhang, Songyang
(VerfasserIn) |
Weitere Verfasser: |
Peng, Houwen,
Fu, Jianlong,
Lu, Yijuan,
Luo, Jiebo |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2022
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article |