Focal Visual-Text Attention for Memex Question Answering
Recent insights on language and vision with neural networks have been successfully applied to simple single-image visual question answering. However, to tackle real-life question answering problems on multimedia collections such as personal photo albums, we have to look at whole collections with seq...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 41(2019), 8 vom: 09. Aug., Seite 1893-1908
|
1. Verfasser: |
Liang, Junwei
(VerfasserIn) |
Weitere Verfasser: |
Jiang, Lu,
Cao, Liangliang,
Kalantidis, Yannis,
Li, Li-Jia,
Hauptmann, Alexander G |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2019
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S. |