77 lines
2.3 KiB
Markdown
77 lines
2.3 KiB
Markdown

|
|
|
|
<br>
|
|
|
|
# Welcome to the EvalScope Tutorial!
|
|
|
|
## Getting Started
|
|
To help users get up to speed quickly, we recommend the following process:
|
|
- For users who want to use EvalScope, we recommend first reading the [Quick Start](get_started/introduction.md) section to set up the environment and start a mini experiment to familiarize themselves with the process.
|
|
- For some basic usage, we suggest users read the User Guides which include [How to Use Arena Mode for Evaluation](user_guides/arena.md), [How to Use Other Evaluation Backends](user_guides/backend/index.md), and [How to Use the Model Service Stress Testing Tool](user_guides/stress_test/index.md).
|
|
- If you want to customize more modules, such as adding datasets and models, we provide an [Advanced Tutorial](advanced_guides/custom_dataset/index.md).
|
|
- Additionally, we offer [Third-Party Tools](third_party/toolbench.md) to help users quickly evaluate models, such as using ToolBench for evaluation.
|
|
- Finally, we provide [Best Practices](best_practice/swift_integration.md) to assist users in evaluation, such as how to use Swift for evaluation.
|
|
We always welcome PRs and issues from users to improve EvalScope.
|
|
|
|
:::{toctree}
|
|
:hidden:
|
|
:caption: 🚀 Quick Start
|
|
get_started/introduction.md
|
|
get_started/installation.md
|
|
get_started/basic_usage.md
|
|
get_started/visualization.md
|
|
get_started/parameters.md
|
|
get_started/supported_dataset.md
|
|
get_started/faq.md
|
|
:::
|
|
|
|
:::{toctree}
|
|
:hidden:
|
|
:caption: 🔧 User Guides
|
|
user_guides/backend/index.md
|
|
user_guides/stress_test/index.md
|
|
user_guides/aigc/index.md
|
|
user_guides/arena.md
|
|
:::
|
|
|
|
:::{toctree}
|
|
:hidden:
|
|
:caption: 🛠️ Advanced Tutorials
|
|
advanced_guides/collection/index.md
|
|
advanced_guides/custom_dataset/index.md
|
|
advanced_guides/custom_model.md
|
|
advanced_guides/add_benchmark.md
|
|
:::
|
|
|
|
:::{toctree}
|
|
:hidden:
|
|
:caption: 🧰 Third-Party Tools
|
|
third_party/toolbench.md
|
|
third_party/longwriter.md
|
|
:::
|
|
|
|
:::{toctree}
|
|
:hidden:
|
|
:caption: 🧪 Benchmarking Results
|
|
experiments/benchmark/index.md
|
|
experiments/speed_benchmark/index.md
|
|
:::
|
|
|
|
:::{toctree}
|
|
:hidden:
|
|
:caption: 📖 Best Practices
|
|
best_practice/qwen3.md
|
|
best_practice/eval_qwq.md
|
|
best_practice/iquiz.md
|
|
best_practice/deepseek_r1_distill.md
|
|
best_practice/think_eval.md
|
|
best_practice/swift_integration.md
|
|
best_practice/llm_full_stack.md
|
|
:::
|
|
|
|
:::{toctree}
|
|
:hidden:
|
|
:caption: 🌟 Blogs
|
|
blog/index.md
|
|
:::
|