Add optional Ollama-backed semantic sidecar reranking (lexical-first, additive) by HiveForensicsAI · Pull Request #29 · HiveForensics-AI/knolo-core

HiveForensicsAI · 2026-03-09T10:15:00Z

Motivation

Provide an optional local semantic reranking layer backed by Ollama embeddings while preserving Knolo's deterministic lexical-first retrieval model.
Keep semantic logic additive, explainable, and local-first without introducing a vector DB or changing the .knolo pack format.
Use a sidecar index to enable fast, portable reranking over only lexical top-N candidates and allow gating by lexical confidence.

Description

Added a semantic core under packages/core/src/semantic/ with types.ts, cosine.ts, sidecar.ts, provider.ts, and rerank.ts implementing provider interfaces, normalization/cosine utilities, sidecar serialization/validation, and lexical-topN reranking/blending.
Extended query() API and validation to accept semantic options (sidecar/provider/topN/minLexConfidence/blend/etc.) and attached per-hit evidence fields (retrieval, lexicalScore, semanticScore, blendedScore, modelId) while preserving the default lexical-first behavior.
Implemented an Ollama adapter in a new package @knolo/semantic-ollama that implements EmbeddingProvider with batching, timeouts, and clear errors, and added CLI sidecar commands in the knolo binary: semantic:index, semantic:inspect, and semantic:validate.
Added tests and README docs: sidecar rerank + validation tests, cosine helper tests, CLI semantic validation tests, and README usage/troubleshooting for the sidecar workflow.

Testing

Built core and adapter packages with npm run build --workspace @knolo/core and npm run build --workspace @knolo/semantic-ollama and both builds succeeded.
Ran core unit tests with npm run test --workspace @knolo/core and all tests passed (including testSemanticSidecarRerankAndValidation and testCosineHelpers).
Ran CLI tests with npm run test --workspace @knolo/cli and all CLI tests passed (including semantic:validate success/failure cases).
Verified knolo semantic:index / semantic:inspect / semantic:validate flows via unit tests that exercise sidecar creation/inspection/validation.

Codex Task

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b8ac727f6d

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-09T10:20:31Z

packages/core/src/query.ts

 function shouldRerankWithSemantic(pack: Pack, opts: ResolvedSemanticOpts, confidence: number): boolean {
  if (!opts.enabled || opts.mode !== "rerank") return false;
-  if (!pack.semantic) return false;
+  if (!pack.semantic && !opts.sidecar) return false;


Wire semantic.sidecarPath into rerank eligibility

query() accepts semantic.sidecarPath, but semantic rerank is gated by if (!pack.semantic && !opts.sidecar) return false and the function never resolves sidecarPath into opts.sidecar. In practice, callers that pass only sidecarPath (with semantic.enabled, queryEmbedding, and even force) silently fall back to lexical-only retrieval, so the new option does not actually enable sidecar reranking.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-09T10:20:31Z

packages/core/src/query.ts

      namespace: pack.namespaces?.[r.blockId] ?? undefined,
+      evidence: {
+        retrieval: retrievalMode,
+        lexicalScore: r.score,


Preserve pre-rerank score in evidence.lexicalScore

When semantic reranking runs, prelim is replaced with blended/semantic scores, but evidence.lexicalScore is still assigned from r.score here. That means hybrid results report reranked scores as "lexical" scores, which corrupts explainability/telemetry and any downstream logic that uses lexical confidence when semantic rerank is enabled.

Useful? React with 👍 / 👎.

Add optional semantic sidecar reranking with Ollama adapter

b8ac727

HiveForensicsAI added the codex label Mar 9, 2026 — with ChatGPT Codex Connector

chatgpt-codex-connector bot reviewed Mar 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optional Ollama-backed semantic sidecar reranking (lexical-first, additive)#29

Add optional Ollama-backed semantic sidecar reranking (lexical-first, additive)#29
HiveForensicsAI wants to merge 1 commit intomainfrom
codex/implement-optional-local-semantic-reranking

HiveForensicsAI commented Mar 9, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 9, 2026

Uh oh!

chatgpt-codex-connector bot Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

HiveForensicsAI commented Mar 9, 2026

Motivation

Description

Testing

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant