evalscope/docs/en/experiments/benchmark/index.md

10 lines
107 B
Markdown

# Benchmarking
Here are the benchmarking results for some models:
:::{toctree}
:maxdepth: 1
mmlu.md
:::