Skip to content

README: state the real generality + scale evidence; retract "near-flat"#267

Merged
Nelson Spence (Fieldnote-Echo) merged 1 commit into
mainfrom
readme/scale-and-generality-evidence
Jun 20, 2026
Merged

README: state the real generality + scale evidence; retract "near-flat"#267
Nelson Spence (Fieldnote-Echo) merged 1 commit into
mainfrom
readme/scale-and-generality-evidence

Conversation

@toadkicker

@toadkicker toadkicker commented Jun 20, 2026

Copy link
Copy Markdown
Collaborator

What

Tightens the README benchmark framing so public claims match the artifacts in this branch, and updates benchmark comments that still described the old flat-scaling story.

Why

The previous README mixed public release evidence with broader research notes, and it overstated the scale story. The public branch currently supports a narrower claim: Harrier-Q8 embeddings on public BEIR data, with the default make benchmark-beir covering scifact and trec-covid and optional harness support for nfcorpus/fiqa.

Changes

  • Frames larger-corpus and alternate-encoder results as active research until checked-in artifacts land.
  • Retracts the flat-scaling per-query wording and describes the compressed scan as O(n) with a smaller working set than exact float brute force.
  • Narrows the HNSW comparison to the committed public trec-covid run.
  • Corrects the footprint/rerank wording: b=2 RankQuant is 256 B/vector, b=4 is 512 B/vector, and the reported sign→rq2 row accounts for both stage-1 sign codes and the RankQuant reranker at 384 B/vector. The public two-stage path is sign/bitmap candidate generation followed by RankQuant b=2 rerank; it does not retain or rescore against the original float corpus.
  • Updates stale benchmark comments to match the O(n) compressed-code scan framing.

Validation

  • cargo fmt --check
  • git diff --check
  • python -m py_compile benchmarks/beir/beir_plot.py

@qodo-code-review

Copy link
Copy Markdown

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 098b6ac863

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "Codex (@codex) review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "Codex (@codex) address that feedback".

Comment thread README.md Outdated
Comment thread README.md Outdated
Comment thread README.md Outdated

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: c9f784b4d5

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "Codex (@codex) review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "Codex (@codex) address that feedback".

Comment thread README.md Outdated
Comment thread README.md Outdated
Comment thread README.md Outdated
@chatgpt-codex-connector

Copy link
Copy Markdown

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

Fieldnote-Echo
Fieldnote-Echo previously approved these changes Jun 20, 2026
@chatgpt-codex-connector

Copy link
Copy Markdown

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

Tighten README benchmark framing to the evidence available in this public branch.

The README now describes the public Harrier-Q8 BEIR harness and the corpus sizes covered by those artifacts, while treating larger-corpus and alternate-encoder results as active research until public artifacts land.

It also removes near-flat per-query cost language and clarifies the O(n) compressed-code scan story, small/medium corpus latency claims, and footprint wording. The reported two-stage path is sign/bitmap candidate generation followed by RankQuant rerank; it does not retain original floats for library-side reranking.

Signed-off-by: Nelson Spence <nelson@projectnavi.ai>
@Fieldnote-Echo Nelson Spence (Fieldnote-Echo) force-pushed the readme/scale-and-generality-evidence branch from 96c6cca to 92efc66 Compare June 20, 2026 03:03
@chatgpt-codex-connector

Copy link
Copy Markdown

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

@Fieldnote-Echo Nelson Spence (Fieldnote-Echo) merged commit 2b13fb4 into main Jun 20, 2026
31 checks passed
@Fieldnote-Echo Nelson Spence (Fieldnote-Echo) deleted the readme/scale-and-generality-evidence branch June 20, 2026 03:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants