Efficient Masked Autoencoders With Self-Consistency

Inspired by the masked language modeling (MLM) in natural language processing tasks, the masked image modeling (MIM) has been recognized as a strong self-supervised pre-training method in computer vision. However, the high random mask ratio of MIM results in two serious problems: 1) the inadequate d...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 46(2024), 12 vom: 05. Nov., Seite 8743-8757
1. Verfasser: Li, Zhaowen (VerfasserIn)
Weitere Verfasser: Zhu, Yousong, Chen, Zhiyang, Li, Wei, Zhao, Rui, Zhao, Chaoyang, Tang, Ming, Wang, Jinqiao
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2024
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article