Skip to content

eabait/codebase-onboarding-skill

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Codebase Onboarding Skill

Generate DeepWiki-style onboarding documentation for any codebase — source-linked, diagram-rich, and structured for both humans and LLMs to navigate.

Validated against axios, flask, and express with a mean eval score of 95.3 / 100 across three independent LLM harnesses (Claude, Gemini CLI, OpenAI Codex CLI).


What This Skill Produces

A hierarchical wiki of Markdown pages, each containing:

  • TL;DR — 2–3 sentences summarising the subsystem
  • Architecture diagram — Mermaid flowchart or sequence diagram
  • Relevant Source Files table — every claim links back to real file paths
  • Key Concepts table — structured reference for onboarding engineers
  • Prose with inline citationspath/to/file.ext:L45–L87 on every claim
  • [NEEDS INVESTIGATION] markers — honest flags on anything unverified
wiki/
├── 00-index.md          Table of contents + navigation guide
├── 01-overview.md       High-level architecture + Mermaid diagram
├── 02-request-lifecycle.md
├── 03-middleware.md
└── ...

Example outputs (Claude baseline, SKILL.md v2.2, scored 95.3/100)

axios — Promise-based HTTP client

flask — Python web framework

express — Node.js web framework


Repository Layout

codebase-onboarding-skill/
├── SKILL.md                   ← Skill invocation & workflow instructions
├── scripts/
│   ├── analyze.py             ← Codebase reconnaissance (AST, PageRank, git)
│   ├── eval.py                ← Wiki quality scorer (6 dimensions, max 100)
│   └── requirements.txt       ← Optional Python deps
├── references/
│   ├── page-template.md       ← Required page structure
│   ├── diagram-patterns.md    ← Mermaid diagram templates by scenario
│   └── language-guides.md     ← Language/framework-specific analysis guidance
├── agents/
│   └── openai.yaml            ← Harness metadata for implicit invocation
└── evals/
    ├── run.sh                 ← Unified eval runner (Claude + Gemini + Codex)
    ├── score.py               ← Multi-harness comparison scorer
    ├── parse_gemini.py        ← Splits Gemini's delimited output into .md files
    ├── README.md              ← Eval setup & runbook
    └── results/               ← Committed benchmark artifacts

Getting Started

1. Install the skill

Install via the Vercel Labs Skills framework:

# Install to all agents
npx skills add eabait/codebase-onboarding-skill --all

# Or install to a specific agent
npx skills add eabait/codebase-onboarding-skill -a claude-code

2. Ask your agent to onboard you

Point your agent at a repository and ask:

"Onboard me to this codebase."

That's it. The skill instructs the agent to install dependencies, run the codebase analysis, and generate the full wiki automatically — no manual steps required.

The agent will:

  1. Install Python deps (scripts/requirements.txt) if needed
  2. Run scripts/analyze.py on the repository to produce a structured analysis
  3. Read SKILL.md and the reference docs
  4. Generate a complete wiki and write the pages to wiki/

Manual invocation (without the skills framework)

If you're using an agent directly, pass SKILL.md and the reference docs in context alongside the repo path. See evals/README.md for per-harness examples with Claude Code, Gemini CLI, and Codex CLI.


Evaluation

Run the built-in eval harness to benchmark quality across models:

# All harnesses (Claude + Gemini + Codex), all 3 reference repos
bash evals/run.sh

# Single harness
bash evals/run.sh --harness gemini
bash evals/run.sh --harness codex axios

Benchmark results (March 2026, SKILL.md v2.2)

Harness Model Score Cit/page vs Baseline
Claude claude-sonnet-4-6 95.3 13.7
Codex gpt-5.2 92.2 39.6 −3.1
Gemini gemini-2.5-pro 91.7 6.3 −3.6

Full results:

Scorer dimensions

scripts/eval.py grades six dimensions (max 100 points):

Dimension Max Measures
Structure 25 TL;DR, required headings, source-files table, index page
Citations 30 Citation count, format (file.py:L45), density per page
Diagrams 15 Mermaid blocks, multi-node diagrams
Tables 10 Key Concepts tables, component references
Completeness 10 Page count, word count, topic coverage
Transparency 10 [NEEDS INVESTIGATION] markers
# Score any wiki output directory
python3 scripts/eval.py path/to/runs/ --output report.json

Contributing

Pull requests welcome. When adding support for a new language or framework, update references/language-guides.md. When adding a new eval harness, follow the pattern in evals/run.sh and document it in evals/README.md.


License

MIT — see LICENSE.

About

Agent skill for codebase onboarding and understanding

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors