Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models
Deep model training on extensive datasets is increasingly becoming cost-prohibitive, prompting the widespread adoption of deep model fusion techniques to leverage knowledge from pre-existing models. From simple weight averaging to more sophisticated methods like AdaMerging, model fusion effectively...
Description complète
Détails bibliographiques
Publié dans: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 22. Sept.
|
Auteur principal: |
Tang, Anke
(Auteur) |
Autres auteurs: |
Shen, Li,
Luo, Yong,
Xie, Shuai,
Hu, Han,
Zhang, Lefei,
Du, Bo,
Tao, Dacheng |
Format: | Article en ligne
|
Langue: | English |
Publié: |
2025
|
Accès à la collection: | IEEE transactions on pattern analysis and machine intelligence
|
Sujets: | Journal Article |