Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning
Text-to-image person retrieval (TIPR) aims to identify the target person using textual descriptions, facing challenge in modality heterogeneity. Prior works have attempted to address it by developing cross-modal global or local alignment strategies. However, global methods typically overlook fine-gr...
Ausführliche Beschreibung
Bibliographische Detailangaben
| Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 10. Okt.
|
| 1. Verfasser: |
Cao, Min
(VerfasserIn) |
| Weitere Verfasser: |
Zhou, Xinyu,
Jiang, Ding,
Du, Bo,
Ye, Mang,
Zhang, Min |
| Format: | Online-Aufsatz
|
| Sprache: | English |
| Veröffentlicht: |
2025
|
| Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
| Schlagworte: | Journal Article |