FrugalAI - OpenRouter LLM Proxy

An intelligent LLM proxy that automatically routes requests to the best free model available on OpenRouter. Provides both OpenAI and Anthropic compatible API endpoints.

Features

Automatic Model Selection: Intelligently selects the best free model based on:
- Parameter count (model size)
- Popularity score
- Context length
- Architecture preferences
Dual API Compatibility: Works with both OpenAI and Anthropic client libraries
Smart Caching: Caches model list to reduce API calls
Configurable Constraints: Set minimum parameter counts and popularity thresholds
Streaming Support: Full support for streaming responses

Installation

go install github.com/mosajjal/frugalai/cmd/frugalai@latest

Or build from source:

git clone https://github.com/mosajjal/frugalai.git
cd frugalai
go build -o frugalai ./cmd/frugalai

Usage

Basic Usage

# Using environment variables
export FRUGALAI_API_KEY="your-openrouter-api-key"
frugalai

# Using CLI flags
frugalai -api-key "your-openrouter-api-key"

Configuration Options

Flag	Environment Variable	Default	Description
`-api-key`, `-k`	`FRUGALAI_API_KEY`	required	OpenRouter API key
`-port`, `-p`	`FRUGALAI_PORT`	`8080`	Server port
`-min-params`	`FRUGALAI_MIN_PARAMS`	`0`	Minimum parameter count
`-min-popularity`	`FRUGALAI_MIN_POPULARITY`	`0`	Minimum popularity score
`-enable-openai`	-	`true`	Enable OpenAI-compatible API
`-enable-anthropic`	-	`true`	Enable Anthropic-compatible API
`-openai-path`	-	`/v1`	OpenAI endpoint path
`-anthropic-path`	-	`/v1`	Anthropic endpoint path
`-log-level`	`FRUGALAI_LOG_LEVEL`	`info`	Log level
`-cache-ttl`	`FRUGALAI_CACHE_TTL`	`300`	Model cache TTL (seconds)
`-preferred-arch`	`FRUGALAI_PREFERRED_ARCH`	-	Preferred architectures (comma-separated)

Example Configurations

Only use models with at least 30B parameters:

frugalai -k "$API_KEY" -min-params 30000000000

Prefer transformer-based models:

frugalai -k "$API_KEY" -preferred-arch "transformer,llama"

Run on custom port with debug logging:

frugalai -k "$API_KEY" -p 9000 -log-level debug

API Endpoints

OpenAI-Compatible API

POST http://localhost:8080/v1/chat/completions
GET  http://localhost:8080/v1/models

Anthropic-Compatible API

POST http://localhost:8080/v1/messages

Utility Endpoints

GET http://localhost:8080/health     # Health check
GET http://localhost:8080/model      # Current selected model info

Client Examples

OpenAI Python Client

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8080/v1",
    api_key="any-key"  # Not used by proxy
)

response = client.chat.completions.create(
    model="auto",  # Let proxy select best model
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

Anthropic Python Client

import anthropic

client = anthropic.Anthropic(
    base_url="http://localhost:8080/v1",
    api_key="any-key"  # Not used by proxy
)

message = client.messages.create(
    model="claude-3-haiku",  # Will be auto-selected
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello!"}]
)
print(message.content[0].text)

cURL Examples

# OpenAI format
curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "auto",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

# Anthropic format
curl http://localhost:8080/v1/messages \
  -H "Content-Type: application/json" \
  -H "anthropic-version: 2023-06-01" \
  -d '{
    "model": "claude-3-haiku",
    "max_tokens": 1024,
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Model Selection Algorithm

The proxy scores models based on several factors:

Popularity (30% weight): Logarithmic scale based on usage
Parameters (40% weight): Larger models get higher scores
Context Length (20% weight): Longer context is preferred
Architecture Bonus (10%): Bonus for preferred architectures
Quality Bonus: Additional bonus for known high-quality model families

Quality bonuses are applied for:

Claude/Anthropic models: +0.15
GPT/OpenAI models: +0.12
Gemini/Google models: +0.10
Mistral/Mixtral: +0.08
Llama/Meta: +0.08

Getting an OpenRouter API Key

Visit OpenRouter.ai
Sign up for a free account
Get your API key from the settings page

Free models on OpenRouter rotate periodically. This proxy automatically selects the best available free model at any given time.

Development

# Run tests
go test ./...

# Run with coverage
go test -cover ./...

# Build
go build -o frugalai ./cmd/frugalai

Docker

Building the Image

docker build -t frugalai .

Running the Container

Basic usage:

docker run -d \
  --name frugalai \
  -p 8080:8080 \
  -e FRUGALAI_API_KEY="your-openrouter-api-key" \
  frugalai

With custom options:

docker run -d \
  --name frugalai \
  -p 9000:9000 \
  -e FRUGALAI_API_KEY="your-openrouter-api-key" \
  -e FRUGALAI_PORT=9000 \
  -e FRUGALAI_MIN_PARAMS=7000000000 \
  -e FRUGALAI_LOG_LEVEL=debug \
  frugalai

Using Docker Compose

Create a docker-compose.yml file:

version: '3.8'

services:
  frugalai:
    image: frugalai
    container_name: frugalai
    ports:
      - "8080:8080"
    environment:
      - FRUGALAI_API_KEY=${FRUGALAI_API_KEY}
      - FRUGALAI_PORT=8080
      - FRUGALAI_MIN_PARAMS=0
      - FRUGALAI_LOG_LEVEL=info
    restart: unless-stopped
    healthcheck:
      test: ["CMD", "wget", "--no-verbose", "--tries=1", "--spider", "http://localhost:8080/health"]
      interval: 30s
      timeout: 3s
      start_period: 5s
      retries: 3

Run with Docker Compose:

# Start the service
docker-compose up -d

# View logs
docker-compose logs -f

# Stop the service
docker-compose down

Volume Mounts

For custom configuration or caching, mount a volume:

docker run -d \
  --name frugalai \
  -p 8080:8080 \
  -v ${PWD}/data:/home/appuser/.local/share/frugalai \
  -e FRUGALAI_API_KEY="your-openrouter-api-key" \
  frugalai

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
cmd/frugalai		cmd/frugalai
internal		internal
AGENTS.md		AGENTS.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FrugalAI - OpenRouter LLM Proxy

Features

Installation

Usage

Basic Usage

Configuration Options

Example Configurations

API Endpoints

OpenAI-Compatible API

Anthropic-Compatible API

Utility Endpoints

Client Examples

OpenAI Python Client

Anthropic Python Client

cURL Examples

Model Selection Algorithm

Getting an OpenRouter API Key

Development

Docker

Building the Image

Running the Container

Using Docker Compose

Volume Mounts

License

About

Uh oh!

Releases 6

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FrugalAI - OpenRouter LLM Proxy

Features

Installation

Usage

Basic Usage

Configuration Options

Example Configurations

API Endpoints

OpenAI-Compatible API

Anthropic-Compatible API

Utility Endpoints

Client Examples

OpenAI Python Client

Anthropic Python Client

cURL Examples

Model Selection Algorithm

Getting an OpenRouter API Key

Development

Docker

Building the Image

Running the Container

Using Docker Compose

Volume Mounts

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages