Efficient Benchmarking via Bias-Bounded Subset Selection

Evaluating AI systems, particularly large models, is an essential yet computationally expensive task. The use of extensive benchmarks often leads to substantial computational/human costs that may even exceed those of pretraining. The efficiency of AI model evaluation focuses on estimating the model&...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 12. Aug.
1. Verfasser:	Zhuang, Yan (VerfasserIn)
Weitere Verfasser:	Yu, Junhao, Liu, Qi, Sun, Yuxuan, Li, Jiatong, Huang, Zhenya, Chen, Enhong
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2025
Zugriff auf das übergeordnete Werk:	IEEE transactions on pattern analysis and machine intelligence
Schlagworte:	Journal Article

Online verfügbar	Volltext