Multi-Turn Video Question Answering via Hierarchical Attention Context Reinforced Networks

Multi-turn video question answering is a challenging task in visual information retrieval, which generates the accurate answer from the referenced video contents according to the visual conversation context and given question. However, the existing visual question answering methods mainly tackle the...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 28(2019), 8 vom: 27. Aug., Seite 3860-3872
Auteur principal: Zhao, Zhou (Auteur)
Autres auteurs: Zhang, Zhu, Jiang, Xinghua, Cai, Deng
Format: Article en ligne
Langue:English
Publié: 2019
Accès à la collection:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Sujets:Journal Article