evalscope_v0.17.0/evalscope.0.17.0/evalscope/benchmarks
hailin 2fb74d4aad first commit 2025-07-08 00:28:09 +00:00
..
aigc first commit 2025-07-08 00:28:09 +00:00
aime first commit 2025-07-08 00:28:09 +00:00
alpaca_eval first commit 2025-07-08 00:28:09 +00:00
arc first commit 2025-07-08 00:28:09 +00:00
arena_hard first commit 2025-07-08 00:28:09 +00:00
bbh first commit 2025-07-08 00:28:09 +00:00
bfcl first commit 2025-07-08 00:28:09 +00:00
ceval first commit 2025-07-08 00:28:09 +00:00
chinese_simple_qa first commit 2025-07-08 00:28:09 +00:00
cmmlu first commit 2025-07-08 00:28:09 +00:00
competition_math first commit 2025-07-08 00:28:09 +00:00
data_collection first commit 2025-07-08 00:28:09 +00:00
docmath first commit 2025-07-08 00:28:09 +00:00
drop first commit 2025-07-08 00:28:09 +00:00
frames first commit 2025-07-08 00:28:09 +00:00
general_arena first commit 2025-07-08 00:28:09 +00:00
general_mcq first commit 2025-07-08 00:28:09 +00:00
general_qa first commit 2025-07-08 00:28:09 +00:00
gpqa first commit 2025-07-08 00:28:09 +00:00
gsm8k first commit 2025-07-08 00:28:09 +00:00
hellaswag first commit 2025-07-08 00:28:09 +00:00
humaneval first commit 2025-07-08 00:28:09 +00:00
ifeval first commit 2025-07-08 00:28:09 +00:00
iquiz first commit 2025-07-08 00:28:09 +00:00
live_code_bench first commit 2025-07-08 00:28:09 +00:00
maritime_bench first commit 2025-07-08 00:28:09 +00:00
math_500 first commit 2025-07-08 00:28:09 +00:00
mmlu first commit 2025-07-08 00:28:09 +00:00
mmlu_pro first commit 2025-07-08 00:28:09 +00:00
mmlu_redux first commit 2025-07-08 00:28:09 +00:00
musr first commit 2025-07-08 00:28:09 +00:00
needle_haystack first commit 2025-07-08 00:28:09 +00:00
process_bench first commit 2025-07-08 00:28:09 +00:00
race first commit 2025-07-08 00:28:09 +00:00
simple_qa first commit 2025-07-08 00:28:09 +00:00
super_gpqa first commit 2025-07-08 00:28:09 +00:00
tool_bench first commit 2025-07-08 00:28:09 +00:00
trivia_qa first commit 2025-07-08 00:28:09 +00:00
truthful_qa first commit 2025-07-08 00:28:09 +00:00
winogrande first commit 2025-07-08 00:28:09 +00:00
__init__.py first commit 2025-07-08 00:28:09 +00:00
benchmark.py first commit 2025-07-08 00:28:09 +00:00
data_adapter.py first commit 2025-07-08 00:28:09 +00:00
filters.py first commit 2025-07-08 00:28:09 +00:00
utils.py first commit 2025-07-08 00:28:09 +00:00