Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models
Deep model training on extensive datasets is increasingly becoming cost-prohibitive, prompting the widespread adoption of deep model fusion techniques to leverage knowledge from pre-existing models. From simple weight averaging to more sophisticated methods like AdaMerging, model fusion effectively...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 22. Sept.
|
1. Verfasser: |
Tang, Anke
(VerfasserIn) |
Weitere Verfasser: |
Shen, Li,
Luo, Yong,
Xie, Shuai,
Hu, Han,
Zhang, Lefei,
Du, Bo,
Tao, Dacheng |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2025
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article |