Published Benchmark Artifacts

This directory contains published benchmark artifacts and, during local development, may also contain intermediate run outputs.

Official Public Reports

Use these artifacts as the public source of truth:

compact tool-call benchmark:
- zcp_mcp_tool_call_benchmark.md
- zcp_mcp_tool_call_benchmark.json
Excel semantic workflow benchmark:
- full_semantic_compare_v5/excel_llm_token_benchmark.md
- full_semantic_compare_v5/excel_llm_token_benchmark.json
- full_semantic_compare_v5/semantic_benchmark_summary.md

Non-Public Or Intermediate Artifacts

Do not treat checkpoint files, smoke runs, or older comparison directories as official release evidence. Those are for local iteration only and should be removed or archived before a public release.

Reproducing Public Benchmarks

Use the helper scripts in scripts or the benchmark entrypoints under examples.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Published Benchmark Artifacts

Official Public Reports

Non-Public Or Intermediate Artifacts

Reproducing Public Benchmarks

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Published Benchmark Artifacts

Official Public Reports

Non-Public Or Intermediate Artifacts

Reproducing Public Benchmarks