GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs

github.com

GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs

github.com

luckystarr@feddit.deM to luckystarr@feddit.deEnglish · 1 year ago

A high-throughput and memory-efficient inference and serving engine for LLMs - GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs

You must log in or register to comment.

Chat