evalscope_v0.17.0/evalscope.0.17.0/examples/tasks/eval_native.yaml

6 lines
71 B
YAML

model: Qwen/Qwen2.5-0.5B-Instruct
datasets:
- gsm8k
- arc
limit: 5