Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
Much of the recent progress in Vision-to-Language problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). This approach does not explicitly represent high-level semantic concepts, but rather seeks to progress directly from image...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 40(2018), 6 vom: 02. Juni, Seite 1367-1381
|
1. Verfasser: |
Wu, Qi
(VerfasserIn) |
Weitere Verfasser: |
Shen, Chunhua,
Wang, Peng,
Dick, Anthony,
van den Hengel, Anton |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2018
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article
Research Support, Non-U.S. Gov't |