Aligning Where to See and What to Tell : Image Captioning with Region-Based Attention and Scene-Specific Contexts
Recent progress on automatic generation of image captions has shown that it is possible to describe the most salient information conveyed by images with accurate and meaningful sentences. In this paper, we propose an image captioning system that exploits the parallel structures between images and se...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 39(2017), 12 vom: 28. Dez., Seite 2321-2334
|
1. Verfasser: |
Fu, Kun
(VerfasserIn) |
Weitere Verfasser: |
Jin, Junqi,
Cui, Runpeng,
Sha, Fei,
Zhang, Changshui |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2017
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S. |