This website requires JavaScript.
Explore
Help
Register
Sign In
hailin
/
evalscope_v0.17.0
Watch
1
Star
0
Fork
You've already forked evalscope_v0.17.0
0
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
2fb74d4aad
evalscope_v0.17.0
/
evalscope.0.17.0
/
docs
/
en
/
experiments
/
benchmark
/
index.md
107 B
Raw
Blame
History
Benchmarking
Here are the benchmarking results for some models:
:::{toctree} :maxdepth: 1
mmlu.md :::