Multi-Turn Video Question Answering via Hierarchical Attention Context Reinforced Networks

Multi-turn video question answering is a challenging task in visual information retrieval, which generates the accurate answer from the referenced video contents according to the visual conversation context and given question. However, the existing visual question answering methods mainly tackle the...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 28(2019), 8 vom: 27. Aug., Seite 3860-3872
1. Verfasser: Zhao, Zhou (VerfasserIn)
Weitere Verfasser: Zhang, Zhu, Jiang, Xinghua, Cai, Deng
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2019
Zugriff auf das übergeordnete Werk:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:Journal Article