Code Quality Evaluator

A portable AI coding skill for evidence-based code quality evaluation.

Code Quality Evaluator helps an AI coding agent review a repository across five practical dimensions: readability, architecture, change health, engineering practices, and code smells. It produces a weighted scorecard, score rationale, concrete findings, and a remediation roadmap. It is designed to work as a Codex skill, Claude Code/ClawCode skill or command, OpenClaw rule, Cursor rule, or generic Markdown prompt.

Current version: 0.4.0

What It Includes

File	Purpose
`SKILL.md`	Mini-style active skill body: workflow, decision rules, trigger rules, scoring model, checklist.
`scripts/collect_metrics.py`	Dependency-free project inventory script.
`scripts/render_html_report.py`	Optional renderer for standalone HTML reports from JSON summaries.
`scripts/validate_skill.py`	Skill pack validator for versions, required files, encoding, and report artifacts.
`evals/run_evals.py`	Lightweight regression evals for report quality and localization.
`references/rubric.md`	Detailed scoring calibration for the five dimensions.
`references/nano.md`	Compact fallback rules for tight context.
`references/report-template.md`	Formal report structure with score rationale and roadmap.
`references/html-report.md`	HTML report schema, render command, and fallback guidance.
`references/traceability.md`	Mapping from evaluation dimensions to software engineering references.
`references/platforms.md`	Cross-platform installation and degradation guidance.
`references/versioning.md`	Versioning and release policy.
`references/evolution.md`	Feedback-driven evolution process and anti-regression rules.
`references/dev-loop.md`	Local development, validation, and release loop.
`agents/openai.yaml`	Optional UI metadata for Codex skill lists.
`VERSION`	Current release version.
`CHANGELOG.md`	Release history.

Supported Hosts

Host	Recommended Use
Codex	Copy the whole folder into the Codex skills directory.
Claude Code / ClawCode	Use as a skill folder when supported, or copy `SKILL.md` into a command file.
OpenClaw	Use as a skill/rule folder when supported, or use `SKILL.md` as a prompt-injection rule.
Cursor	Use as a Manual or Agent Requested rule; use `references/nano.md` for compact always-on context.
Generic agent	Paste `SKILL.md` into project instructions or a reusable prompt.

Evaluation Dimensions

Dimension	Weight
Readability and Style	25%
Architecture Design	25%
Change and Refactoring Health	20%
Engineering Practices	15%
Code Smell Inventory	15%

Default Reference Set

The skill is designed around a small default set of references:

Clean Code
A Philosophy of Software Design
Refactoring
Working Effectively with Legacy Code
The Pragmatic Programmer

Additional references can be used by project type:

Project Type	Useful Extra References
Backend service	Clean Architecture; Release It!
Data-intensive system	Designing Data-Intensive Applications
Domain-heavy product	Domain-Driven Design; Domain-Driven Design Distilled; Implementing Domain-Driven Design
Enterprise application	Patterns of Enterprise Application Architecture
Legacy or migration-heavy code	Working Effectively with Legacy Code; Refactoring

Grade Bands

Grade	Score	Meaning
S	90-100	Excellent
A	80-89	Strong
B	70-79	Solid
C	60-69	Mixed
D	40-59	Risky
F	0-39	Critical

Install

Codex

Clone the repository and copy the skill folder into your Codex skills directory:

git clone https://github.com/Johnny-zbb/code-quality-evaluator.git
mkdir -p ~/.codex/skills
cp -r code-quality-evaluator ~/.codex/skills/code-quality-evaluator

Restart Codex so the skill metadata is reloaded.

Claude Code / ClawCode

Use the host's skill directory when available:

mkdir -p ~/.claude/skills
cp -r code-quality-evaluator ~/.claude/skills/code-quality-evaluator

If only command files are supported, copy SKILL.md into the command location and keep the references/ files nearby for manual loading.

OpenClaw / Generic

Copy the whole folder into the host's skill or rule directory when possible. If the host only supports a single instruction file, use SKILL.md and treat scripts/ and references/ as optional supporting material.

See references/platforms.md for host-specific fallback guidance.

Project-Level Prompt Injection

If your tool does not support skill folders, you can still copy the content of SKILL.md into that tool's instruction or command system. This works as a prompt template, but it will not have the same bundled-resource discovery behavior as a real skill folder.

Usage

Ask naturally:

"Evaluate this repo's code quality."
"Score this project."
"Analyze maintainability and technical debt."
"Review the architecture and code smells."
"Prepare a handoff quality report."
"Critique this quality report."

Output

The skill asks the agent to produce:

Project overview and inventory.
Weighted composite score.
Score rationale for each dimension.
Per-dimension findings with file and line evidence.
Top improvement suggestions.
Remediation roadmap.
Real project strengths.
Confidence level and inspection gaps.

Output Modes

Mode	Default	Use When
Markdown	Yes	Any host, normal evaluation, maximum portability.
HTML	No	The user asks for a visual, printable, dashboard-like, or shareable report.
JSON	No	The user wants machine-readable output or an input file for HTML rendering.

HTML reports are optional enhancements. When Python and file writing are available, generate a JSON summary and render it with:

python scripts/render_html_report.py reports/code-quality-report.json reports/code-quality-report.html

Set language in the report JSON to localize HTML labels, for example zh for Chinese or en for English. Human-facing report text should match the user's language.

Notes

This is a skill, not a standalone static analyzer. The included script collects useful metrics, but the final evaluation still depends on agent inspection and reasoning.

Development Validation

Before publishing changes, run:

python scripts/validate_skill.py .
python evals/run_evals.py .

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
evals		evals
examples		examples
platforms		platforms
references		references
scripts		scripts
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
SKILL.md		SKILL.md
VERSION		VERSION

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code Quality Evaluator

What It Includes

Supported Hosts

Evaluation Dimensions

Default Reference Set

Grade Bands

Install

Codex

Claude Code / ClawCode

OpenClaw / Generic

Project-Level Prompt Injection

Usage

Output

Output Modes

Notes

Development Validation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Code Quality Evaluator

What It Includes

Supported Hosts

Evaluation Dimensions

Default Reference Set

Grade Bands

Install

Codex

Claude Code / ClawCode

OpenClaw / Generic

Project-Level Prompt Injection

Usage

Output

Output Modes

Notes

Development Validation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages