mindie

Here are 3 public repositories matching this topic...

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

cuda inference openai llama maas rocm ascend llm llm-serving vllm genai llm-inference qwen deepseek sglang distributed-inference high-performance-inference mindie

Enterprise-grade LLM automated deployment tool that makes AI servers truly "plug-and-play".

agent transformer ai-server llm llm-serving vllm llm-inference ollama mindie

🚀 Master GPU kernel programming and optimization for high-performance AI systems with this comprehensive learning guide and resource hub.

Add a description, image, and links to the mindie topic page so that developers can more easily learn about it.

To associate your repository with the mindie topic, visit your repo's landing page and select "manage topics."