613 B
613 B
Mixed Data Evaluation
This framework supports mixing multiple evaluation datasets for a unified evaluation, aiming to use less data to achieve a more comprehensive assessment of the model's capabilities.
The overall evaluation process is as follows:
- Define a data mixing schema: Specify which datasets to use for evaluation and how the data should be grouped.
- Sample data: The framework will sample from each specified dataset according to the schema.
- Unified evaluation: The sampled data will be used in a unified evaluation process.
:::{toctree} :maxdepth: 2
schema.md sample.md evaluate.md :::