inference/sglang/benchmark/json_schema
hailin 466c38a2d4 first commit 2025-05-27 11:42:20 +08:00
..
README.md first commit 2025-05-27 11:42:20 +08:00
bench_sglang.py first commit 2025-05-27 11:42:20 +08:00

README.md

Run benchmark

Benchmark sglang

Run Llama-8b

python3 -m sglang.launch_server --model-path meta-llama/Llama-3.1-8B-Instruct --port 30000

Benchmark

python3 bench_sglang.py