United States · Information Technology & Services
Education
Information Technology & Services
Research
vLLM is a fast and easy-to-use library for LLM inference and serving. Originally developed in the Sky Computing Lab at UC Berkeley, vLLM has evolved into a community-driven project with contributions from both academia and industry. It offers state-of-the-art serving throughput, efficient memory management, continuous batching, fast model execution, and support for various hardware and models. vLLM provides seamless integration with popular Hugging Face models, high-throughput serving, and support for diverse hardware plugins.
2023
Founded
Information Technology & Services
Industry
United States
Location
499,718
Ranking
11 employees
Size

Get full access to view complete information
