🎯
Focusing
Popular repositories Loading
-
-
rag-llm-cpu
rag-llm-cpu PublicForked from validatedpatterns-sandbox/rag-llm-cpu
Pattern that installs a cpu-based inference service, multiple RAG db providers, and a demo frontend allowing visibility into the RAG process.
Shell
-
-
llm-cpu-serving
llm-cpu-serving PublicForked from rh-ai-quickstart/llm-cpu-serving
This quickstart will serve a small language model on CPUs, using vLLM inference runtime
Shell
-
-
caveman
caveman PublicForked from JuliusBrussee/caveman
🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
