VLLM
2023
Utilizing vLLM for Efficient Language Model Serving
August 20, 2023