Revisiting Image-Language Networks for Open-Ended Phrase Detection

Most existing work that grounds natural language phrases in images starts with the assumption that the phrase in question is relevant to the image. In this paper we address a more realistic version of the natural language grounding task where we must both identify whether the phrase is relevant to a...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 4 vom: 06. Apr., Seite 2155-2167
1. Verfasser: Plummer, Bryan A (VerfasserIn)
Weitere Verfasser: Shih, Kevin J, Li, Yichen, Xu, Ke, Lazebnik, Svetlana, Sclaroff, Stan, Saenko, Kate
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2022
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article Research Support, U.S. Gov't, Non-P.H.S.