Most existing work that grounds natural language phrases in images starts with the assumption that the phrase in question is relevant to the image. In this paper we address a more realistic version of the natural language grounding task where we must both identify whether the phrase is relevant to a...
Détails bibliographiques
| Publié dans: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 4 vom: 06. Apr., Seite 2155-2167
|
| Auteur principal: |
Plummer, Bryan A
(Auteur) |
| Autres auteurs: |
Shih, Kevin J,
Li, Yichen,
Xu, Ke,
Lazebnik, Svetlana,
Sclaroff, Stan,
Saenko, Kate |
| Format: | Article en ligne
|
| Langue: | English |
| Publié: |
2022
|
| Accès à la collection: | IEEE transactions on pattern analysis and machine intelligence
|
| Sujets: | Journal Article
Research Support, U.S. Gov't, Non-P.H.S. |