Focal Visual-Text Attention for Memex Question Answering

Recent insights on language and vision with neural networks have been successfully applied to simple single-image visual question answering. However, to tackle real-life question answering problems on multimedia collections such as personal photo albums, we have to look at whole collections with seq...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence. - 1979. - 41(2019), 8 vom: 09. Aug., Seite 1893-1908
1. Verfasser:	Liang, Junwei (VerfasserIn)
Weitere Verfasser:	Jiang, Lu, Cao, Liangliang, Kalantidis, Yannis, Li, Li-Jia, Hauptmann, Alexander G
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2019
Zugriff auf das übergeordnete Werk:	IEEE transactions on pattern analysis and machine intelligence
Schlagworte:	Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Online verfügbar	Volltext