Long-Form Video Question Answering via Dynamic Hierarchical Reinforced Networks

Open-ended long-form video question answering is a challenging task in visual information retrieval, which automatically generates a natural language answer from the referenced long-form video contents according to a given question. However, the existing works mainly focus on short-form video questi...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 28(2019), 12 vom: 14. Dez., Seite 5939-5952
1. Verfasser: Zhao, Zhou (VerfasserIn)
Weitere Verfasser: Zhang, Zhu, Xiao, Shuwen, Xiao, Zhenxin, Yan, Xiaohui, Yu, Jun, Cai, Deng, Wu, Fei
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2019
Zugriff auf das übergeordnete Werk:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:Journal Article