Efficient Benchmarking via Bias-Bounded Subset Selection
Evaluating AI systems, particularly large models, is an essential yet computationally expensive task. The use of extensive benchmarks often leads to substantial computational/human costs that may even exceed those of pretraining. The efficiency of AI model evaluation focuses on estimating the model&...
Ausführliche Beschreibung
Bibliographische Detailangaben
| Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 12. Aug.
|
| 1. Verfasser: |
Zhuang, Yan
(VerfasserIn) |
| Weitere Verfasser: |
Yu, Junhao,
Liu, Qi,
Sun, Yuxuan,
Li, Jiatong,
Huang, Zhenya,
Chen, Enhong |
| Format: | Online-Aufsatz
|
| Sprache: | English |
| Veröffentlicht: |
2025
|
| Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
| Schlagworte: | Journal Article |