PSVMA+ : Exploring Multi-Granularity Semantic-Visual Adaption for Generalized Zero-Shot Learning

Generalized zero-shot learning (GZSL) endeavors to identify the unseen categories using knowledge from the seen domain, necessitating the intrinsic interactions between the visual features and attribute semantic features. However, GZSL suffers from insufficient visual-semantic correspondences due to...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2024) vom: 25. Sept.
1. Verfasser: Liu, Man (VerfasserIn)
Weitere Verfasser: Bai, Huihui, Li, Feng, Zhang, Chunjie, Wei, Yunchao, Wang, Meng, Chua, Tat-Seng, Zhao, Yao
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2024
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article
LEADER 01000naa a22002652 4500
001 NLM378077236
003 DE-627
005 20240926233043.0
007 cr uuu---uuuuu
008 240926s2024 xx |||||o 00| ||eng c
024 7 |a 10.1109/TPAMI.2024.3467229  |2 doi 
028 5 2 |a pubmed24n1549.xml 
035 |a (DE-627)NLM378077236 
035 |a (NLM)39321011 
040 |a DE-627  |b ger  |c DE-627  |e rakwb 
041 |a eng 
100 1 |a Liu, Man  |e verfasserin  |4 aut 
245 1 0 |a PSVMA+  |b Exploring Multi-Granularity Semantic-Visual Adaption for Generalized Zero-Shot Learning 
264 1 |c 2024 
336 |a Text  |b txt  |2 rdacontent 
337 |a ƒaComputermedien  |b c  |2 rdamedia 
338 |a ƒa Online-Ressource  |b cr  |2 rdacarrier 
500 |a Date Revised 25.09.2024 
500 |a published: Print-Electronic 
500 |a Citation Status Publisher 
520 |a Generalized zero-shot learning (GZSL) endeavors to identify the unseen categories using knowledge from the seen domain, necessitating the intrinsic interactions between the visual features and attribute semantic features. However, GZSL suffers from insufficient visual-semantic correspondences due to the attribute diversity and instance diversity. Attribute diversity refers to varying semantic granularity in attribute descriptions, ranging from low-level (specific, directly observable) to high-level (abstract, highly generic) characteristics. This diversity challenges the collection of adequate visual cues for attributes under a uni-granularity. Additionally, diverse visual instances corresponding to the same sharing attributes introduce semantic ambiguity, leading to vague visual patterns. To tackle these problems, we propose a multi-granularity progressive semantic-visual mutual adaption (PSVMA+) network, where sufficient visual elements across granularity levels can be gathered to remedy the granularity inconsistency. PSVMA+ explores semantic-visual interactions at different granularity levels, enabling awareness of multi-granularity in both visual and semantic elements. At each granularity level, the dual semantic-visual transformer module (DSVTM) recasts the sharing attributes into instance-centric attributes and aggregates the semantic-related visual regions, thereby learning unambiguous visual features to accommodate various instances. Given the diverse contributions of different granularities, PSVMA+ employs selective cross-granularity learning to leverage knowledge from reliable granularities and adaptively fuses multi-granularity features for comprehensive representations. Experimental results demonstrate that PSVMA+ consistently outperforms state-of-the-art methods 
650 4 |a Journal Article 
700 1 |a Bai, Huihui  |e verfasserin  |4 aut 
700 1 |a Li, Feng  |e verfasserin  |4 aut 
700 1 |a Zhang, Chunjie  |e verfasserin  |4 aut 
700 1 |a Wei, Yunchao  |e verfasserin  |4 aut 
700 1 |a Wang, Meng  |e verfasserin  |4 aut 
700 1 |a Chua, Tat-Seng  |e verfasserin  |4 aut 
700 1 |a Zhao, Yao  |e verfasserin  |4 aut 
773 0 8 |i Enthalten in  |t IEEE transactions on pattern analysis and machine intelligence  |d 1979  |g PP(2024) vom: 25. Sept.  |w (DE-627)NLM098212257  |x 1939-3539  |7 nnns 
773 1 8 |g volume:PP  |g year:2024  |g day:25  |g month:09 
856 4 0 |u http://dx.doi.org/10.1109/TPAMI.2024.3467229  |3 Volltext 
912 |a GBV_USEFLAG_A 
912 |a SYSFLAG_A 
912 |a GBV_NLM 
912 |a GBV_ILN_350 
951 |a AR 
952 |d PP  |j 2024  |b 25  |c 09