Skip to content

Popular repositories Loading

  1. openinfer openinfer Public

    Pure Rust + CUDA LLM inference engine — no PyTorch, OpenAI-compatible, serves Qwen3 to Kimi-K2

    Rust 507 75

  2. website website Public

    Documentation website for openinfer — Astro Starlight on Cloudflare Workers

    CSS 4 1

  3. DeepGEMM DeepGEMM Public

    Forked from deepseek-ai/DeepGEMM

    DeepGEMM: clean and efficient BLAS kernel library on GPU

    Cuda

Repositories

Showing 3 of 3 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…