Skip to content
@JiusiServe

JiusiServe

Popular repositories Loading

  1. LongVideoSparseAttention LongVideoSparseAttention Public

    Long Video Sparse Attention

    Python 18 4

  2. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 7 9

  3. light-llm-simulator light-llm-simulator Public

    A python-based LLM performance simulator with vLLM, notable for its lightweight design, easy scalability.

    Python 5 6

  4. vllm-ascend vllm-ascend Public

    Forked from vllm-project/vllm-ascend

    Community maintained hardware plugin for vLLM on Ascend

    Python 4 7

  5. LM-service LM-service Public

    Python 4 11

  6. Mooncake Mooncake Public

    Forked from kvcache-ai/Mooncake

    Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

    C++ 2

Repositories

Showing 9 of 9 repositories

Top languages

Loading…

Most used topics

Loading…