StarSearch
A high-throughput and memory-efficient inference and serving engine for LLMs
30d
Apache License 2.0
Last Updated: 7/27/2024