Relationship-Embedded Representation Learning for Grounding Referring Expressions

Grounding referring expressions in images aims to locate the object instance in an image described by a referring expression. It involves a joint understanding of natural language and image content, and is essential for a range of visual tasks related to human-computer interaction. As a language-to-...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 43(2021), 8 vom: 20. Aug., Seite 2765-2779
Auteur principal: Yang, Sibei (Auteur)
Autres auteurs: Li, Guanbin, Yu, Yizhou
Format: Article en ligne
Langue:English
Publié: 2021
Accès à la collection:IEEE transactions on pattern analysis and machine intelligence
Sujets:Journal Article