Efficient Benchmarking via Bias-Bounded Subset Selection

Evaluating AI systems, particularly large models, is an essential yet computationally expensive task. The use of extensive benchmarks often leads to substantial computational/human costs that may even exceed those of pretraining. The efficiency of AI model evaluation focuses on estimating the model&...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 12. Aug.
1. Verfasser: Zhuang, Yan (VerfasserIn)
Weitere Verfasser: Yu, Junhao, Liu, Qi, Sun, Yuxuan, Li, Jiatong, Huang, Zhenya, Chen, Enhong
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2025
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article