History

hailin 38d813617c first commit		2025-08-03 20:28:19 +08:00
..
README.md	first commit	2025-08-03 20:28:19 +08:00
faq.md	first commit	2025-08-03 20:28:19 +08:00
metrics.md	first commit	2025-08-03 20:28:19 +08:00
reproducibility.md	first commit	2025-08-03 20:28:19 +08:00
security.md	first commit	2025-08-03 20:28:19 +08:00
troubleshooting.md	first commit	2025-08-03 20:28:19 +08:00
usage_stats.md	first commit	2025-08-03 20:28:19 +08:00
v1_guide.md	first commit	2025-08-03 20:28:19 +08:00

README.md

Using vLLM

vLLM supports the following usage patterns:

Inference and Serving: Run a single instance of a model.
Deployment: Scale up model instances for production.
Training: Train or fine-tune a model.