Image Captioning and Visual Question Answering Based on Attributes and External Knowledge

Much of the recent progress in Vision-to-Language problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). This approach does not explicitly represent high-level semantic concepts, but rather seeks to progress directly from image...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 40(2018), 6 vom: 02. Juni, Seite 1367-1381
1. Verfasser: Wu, Qi (VerfasserIn)
Weitere Verfasser: Shen, Chunhua, Wang, Peng, Dick, Anthony, van den Hengel, Anton
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2018
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article Research Support, Non-U.S. Gov't