Efficient Benchmarking via Bias-Bounded Subset Selection

Evaluating AI systems, particularly large models, is an essential yet computationally expensive task. The use of extensive benchmarks often leads to substantial computational/human costs that may even exceed those of pretraining. The efficiency of AI model evaluation focuses on estimating the model&...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 12. Aug.
Auteur principal: Zhuang, Yan (Auteur)
Autres auteurs: Yu, Junhao, Liu, Qi, Sun, Yuxuan, Li, Jiatong, Huang, Zhenya, Chen, Enhong
Format: Article en ligne
Langue:English
Publié: 2025
Accès à la collection:IEEE transactions on pattern analysis and machine intelligence
Sujets:Journal Article