Avatar

    vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    python

    Apache License 2.0

    Last Updated: 7/27/2024

    Sign up to start tracking your projects!