A Survey on Vision Transformer
Transformer, first applied to the field of natural language processing, is a type of deep neural network mainly based on the self-attention mechanism. Thanks to its strong representation capabilities, researchers are looking at ways to apply transformer to computer vision tasks. In a variety of visu...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 45(2023), 1 vom: 01. Jan., Seite 87-110
|
1. Verfasser: |
Han, Kai
(VerfasserIn) |
Weitere Verfasser: |
Wang, Yunhe,
Chen, Hanting,
Chen, Xinghao,
Guo, Jianyuan,
Liu, Zhenhua,
Tang, Yehui,
Xiao, An,
Xu, Chunjing,
Xu, Yixing,
Yang, Zhaohui,
Zhang, Yiman,
Tao, Dacheng |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2023
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article |