evalscope/examples/tasks/eval_native.yaml

6 lines
71 B
YAML

model: Qwen/Qwen2.5-0.5B-Instruct
datasets:
- gsm8k
- arc
limit: 5