PathScore

SVG generation benchmark for open-source LLMs via OpenRouter.

Compare any set of models on SVG generation quality using pairwise VLM judging and ELO rankings.

https://github.com/adamholter/pathscore/raw/main/demo.mp4

How it works

Configure — Select models, define prompts, pick a judge model
Run — All SVG generations fire in parallel; pairwise VLM judging starts as generations complete
Results — ELO leaderboard + win-rate heatmap + full comparison browser

Setup

npm install
cp .env.example .env  # Add your OpenRouter API key
node server.cjs

Open http://localhost:7642

Run tests:

npm test

Backend architecture notes: BACKEND_ARCHITECTURE.md

Environment variables

OPENROUTER_API_KEY=sk-or-...
PORT=7642
PATHSCORE_EXTENSION_RUNTIME=legacy
PATHSCORE_INVARIANT_CHECKS=0

Export formats

JSON — Full dataset (configs, SVGs, comparisons, metadata) for reproducibility
HTML — Standalone report page with leaderboard and heatmap
PDF — Browser print-to-PDF

Pairing strategy

For N models and M prompts:

Generates N × M SVGs in parallel
Creates all N × (N-1) / 2 unique pairs per prompt
Randomly assigns A/B positions to eliminate position bias
Runs each pair through the VLM judge (configurable 1-5 runs per pair)

ELO system

Standard ELO starting at 1000 with K=32. Win=1, Tie=0.5, Loss=0.

Tech stack

Backend: Node.js + Express + SQLite (better-sqlite3)
Models: OpenRouter API (400+ models available)
Frontend: Vanilla JS SPA, PathScore brand identity
Streaming: Server-Sent Events for live run updates

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
public		public
src/server		src/server
test		test
.gitignore		.gitignore
BACKEND_ARCHITECTURE.md		BACKEND_ARCHITECTURE.md
FLOW_BUILDER.md		FLOW_BUILDER.md
JUDGE_EVALUATOR.md		JUDGE_EVALUATOR.md
README.md		README.md
cli.cjs		cli.cjs
demo.mp4		demo.mp4
package-lock.json		package-lock.json
package.json		package.json
server.cjs		server.cjs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PathScore

How it works

Setup

Environment variables

Export formats

Pairing strategy

ELO system

Tech stack

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PathScore

How it works

Setup

Environment variables

Export formats

Pairing strategy

ELO system

Tech stack

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages