docs: README v0.11 highlights + honest v1.4 framing + readability reorganization#35
Open
AIMLPM wants to merge 4 commits into
Open
docs: README v0.11 highlights + honest v1.4 framing + readability reorganization#35AIMLPM wants to merge 4 commits into
AIMLPM wants to merge 4 commits into
Conversation
Replaces the stale v0.10 highlights section with v0.11.0 (binary downloads + filters) and v0.11.1 (aggregator URL filter) bullets, keeping the v0.10.1 local-embedder and Tenacity retry items. Adds an honest framing block on the v1.4 leaderboard result: markcrawl 1st on cost but 7th of 7 on answer quality + retrieval MRR. Notes the active improvement work (v0.11.1 aggregator filter targets a measured retrieval failure mode; v0.12 track + bench v1.5 methodology hardening underway). The benchmark table on lines 339-359 was auto-updated by PR #33; this commit only changes the prose highlights.
…oncrete improvement target
…raction, schema table, footer consolidation Five changes to improve human + LLM readability: 1. Rename '## Installation / Upgrading' (line 22) → '## What's New' to resolve the duplicate-header collision with the canonical '## Installation' section. Anchor links + LLM section parsing now disambiguate cleanly. 2. Add a top-of-README 'Latest: v0.11.1 (2026-05-12)' pointer so version queries don't require triangulating across the file. 3. Add a 'pages.jsonl' schema table in the Quickstart section — 9 rows covering all current fields (url, title, crawled_at, citation, tool, text, downloads, images, screenshot). Lets downstream code-gen and LLM tooling rely on the schema without inferring from one example. 4. Extract Common Recipes (254 lines, 30% of the README) to a new docs/RECIPES.md file with table-of-contents and proper '### <recipe name>' headings (replacing the bash-comments-as-headings pattern). README retains a 14-line teaser pointing to the new file. The benchmark '<details>' comparison block stays in README. 5. Consolidate 4 micro-sections (Contributing, Security, Privacy, License — 14 lines combined) into a single '## Project info' bulleted block. Net result: 814 → 581 lines (-28%), no duplicate section anchors, all internal links verified to resolve.
Pre-existing lint failure on main since PR #34 (v0.11.1 ship) — ruff wants underscore-prefixed names sorted first in the import list. Applied `ruff check --fix`. No semantic change to the tests.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Three concerns rolled into one README pass so the result is review-ready:
1. v0.11 highlights (replaces stale v0.10 section)
2. Honest v1.4 benchmark framing
Rewritten for zero-context readers. Replaces the original "anchor-bias" jargon with plain-language explanation of:
Plus an explicit goal: 7th → mid-pack on the next benchmark cycle (+0.10 to +0.20 MRR).
3. Readability reorganization
Details
` comparison block stays in README.Net effect
Verification
🤖 Generated with Claude Code