Focal Visual-Text Attention for Memex Question Answering

Recent insights on language and vision with neural networks have been successfully applied to simple single-image visual question answering. However, to tackle real-life question answering problems on multimedia collections such as personal photo albums, we have to look at whole collections with seq...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 41(2019), 8 vom: 09. Aug., Seite 1893-1908
1. Verfasser: Liang, Junwei (VerfasserIn)
Weitere Verfasser: Jiang, Lu, Cao, Liangliang, Kalantidis, Yannis, Li, Li-Jia, Hauptmann, Alexander G
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2019
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.