Conformer : Local Features Coupling Global Representations for Recognition and Detection
With convolution operations, Convolutional Neural Networks (CNNs) are good at extracting local features but experience difficulty to capture global representations. With cascaded self-attention modules, vision transformers can capture long-distance feature dependencies but unfortunately deteriorate...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 45(2023), 8 vom: 07. Aug., Seite 9454-9468
|
1. Verfasser: |
Peng, Zhiliang
(VerfasserIn) |
Weitere Verfasser: |
Guo, Zonghao,
Huang, Wei,
Wang, Yaowei,
Xie, Lingxi,
Jiao, Jianbin,
Tian, Qi,
Ye, Qixiang |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2023
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article |