Open-Ended Video Question Answering via Multi-Modal Conditional Adversarial Networks
As a challenging task in visual information retrieval, open-ended long-form video question answering automatically generates the natural language answer from the referenced video content according to the given question. However, the existing video question answering works mainly focus on the short-f...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - (2020) vom: 29. Jan.
|
1. Verfasser: |
Zhao, Zhou
(VerfasserIn) |
Weitere Verfasser: |
Xiao, Shuwen,
Song, Zehan,
Lu, Chujie,
Xiao, Jun,
Zhuang, Yueting |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2020
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
|
Schlagworte: | Journal Article |