Revisiting Image-Language Networks for Open-Ended Phrase Detection

Most existing work that grounds natural language phrases in images starts with the assumption that the phrase in question is relevant to the image. In this paper we address a more realistic version of the natural language grounding task where we must both identify whether the phrase is relevant to a...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 4 vom: 06. Apr., Seite 2155-2167
Auteur principal: Plummer, Bryan A (Auteur)
Autres auteurs: Shih, Kevin J, Li, Yichen, Xu, Ke, Lazebnik, Svetlana, Sclaroff, Stan, Saenko, Kate
Format: Article en ligne
Langue:English
Publié: 2022
Accès à la collection:IEEE transactions on pattern analysis and machine intelligence
Sujets:Journal Article Research Support, U.S. Gov't, Non-P.H.S.