This website requires JavaScript.
Explore
Help
Register
Sign In
hailin
/
evalscope
Watch
1
Star
0
Fork
You've already forked evalscope
0
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
main
evalscope
/
docs
/
zh
/
experiments
/
benchmark
/
index.md
103 B
Raw
Permalink
Blame
History
基准测试
记录了一些模型的基准测试结果:
:::{toctree} :maxdepth: 1
mmlu.md :::