Context Disentangling and Prototype Inheriting for Robust Visual Grounding
Visual grounding (VG) aims to locate a specific target in an image based on a given language query. The discriminative information from context is important for distinguishing the target from other objects, particularly for the targets that have the same category as others. However, most previous me...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 46(2024), 5 vom: 25. Apr., Seite 3213-3229
|
1. Verfasser: |
Tang, Wei
(VerfasserIn) |
Weitere Verfasser: |
Li, Liang,
Liu, Xuejing,
Jin, Lu,
Tang, Jinhui,
Li, Zechao |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2024
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article |