Long-Form Video Question Answering via Dynamic Hierarchical Reinforced Networks
Open-ended long-form video question answering is a challenging task in visual information retrieval, which automatically generates a natural language answer from the referenced long-form video contents according to a given question. However, the existing works mainly focus on short-form video questi...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 28(2019), 12 vom: 14. Dez., Seite 5939-5952
|
1. Verfasser: |
Zhao, Zhou
(VerfasserIn) |
Weitere Verfasser: |
Zhang, Zhu,
Xiao, Shuwen,
Xiao, Zhenxin,
Yan, Xiaohui,
Yu, Jun,
Cai, Deng,
Wu, Fei |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2019
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
|
Schlagworte: | Journal Article |