Auto-Encoding and Distilling Scene Graphs for Image Captioning

We propose scene graph auto-encoder (SGAE) that incorporates the language inductive bias into the encoder-decoder image captioning framework for more human-like captions. Intuitively, we humans use the inductive bias to compose collocations and contextual inferences in discourse. For example, when w...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 5 vom: 01. Mai, Seite 2313-2327
Auteur principal: Yang, Xu (Auteur)
Autres auteurs: Zhang, Hanwang, Cai, Jianfei
Format: Article en ligne
Langue:English
Publié: 2022
Accès à la collection:IEEE transactions on pattern analysis and machine intelligence
Sujets:Journal Article Research Support, Non-U.S. Gov't