evalscope/examples/viz/20250117_154856/predictions/Qwen2.5-7B-Instruct
hailin 412c475c88 Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00
..
arc_ARC-Challenge.jsonl Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00
arc_ARC-Easy.jsonl Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00
ceval_college_programming.jsonl Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00
ceval_computer_architecture.jsonl Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00
ceval_computer_network.jsonl Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00
ceval_operating_system.jsonl Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00
gsm8k_main.jsonl Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00
humaneval_openai_humaneval.jsonl Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00
ifeval_default.jsonl Start at EvalScope version 0.16.0 2025-05-23 15:32:33 +00:00