Need a Knowledge Base Retrieval Tool to Provide Supplemental Grounding

<html><head></head><body><h1>Knowledge Base Retrieval Tool</h1>
<h2>Summary</h2>
<p>A new tool that lets agents retrieve grounding context from a curated knowledge base of semantic terms, taxonomies, and domain definitions. Agents invoke it at query time to pull authoritative chunks, improving terminology consistency and factual accuracy.</p>
<p>No foundation analyzer code — this is purely retrieval.</p>
<hr>
<h2>Components</h2>
<h3>CDK Stack</h3>
<p>Defines the full deployment unit. Creates the Lambda function, wires IAM permissions for Bedrock (embeddings) and OpenSearch Serverless (vector search), and registers the tool with the AgentCore Gateway. Single stack, single <code>cdk deploy</code>.</p>
<h3>Lambda Function</h3>
<p>Entry point for tool invocations. Receives a query from the gateway, calls the embedder, queries the vector index, filters by score threshold, and returns ranked chunks with metadata. ARM64, Node 20, ~512MB memory, 15s timeout.</p>
<h3>Config</h3>
<p>Centralizes tunable defaults: tool name/description, embedding model ID (<code>amazon.titan-embed-text-v2:0</code>), index name, top-k (default 5), score threshold (default 0.55), Lambda sizing. All overridable via stack props.</p>
<h3>Embedder</h3>
<p>Thin client around Bedrock <code>InvokeModel</code>. Takes a query string, returns a normalized vector. Uses Titan Embed Text v2 at 256dimensions.</p>
<h3>Retriever</h3>
<p>Handles OpenSearch Serverless kNN queries with SigV4-signed requests. Supports optional metadata filters (e.g. <code>{ "domain": "finance", "taxonomy": "GL" }</code>). Returns content, score, metadata, and source reference per hit.</p>
<h3>Gateway Registration</h3>
<p>Registers <code>knowledge_base_retrieve</code> as a tool in the AgentCore Gateway with its input schema and Lambda ARN. Agents see the tool name, description, and schema — standard tool-use contract.</p>
<hr>
<h2>Tool Schema (what agents see)</h2>

Parameter | Type | Required | Description
-- | -- | -- | --
query | string | yes | Natural-language query to search the KB with
top_k | number | no | Override number of results (default 5)
filter | object | no | Metadata key-value filter (e.g. domain, taxonomy)


<hr>
<h2>Response Shape</h2>
<p>Each invocation returns a list of results, each containing:</p>
<ul>
<li><strong>content</strong> — the retrieved text chunk</li>
<li><strong>score</strong> — similarity score</li>
<li><strong>metadata</strong> — domain, taxonomy, source doc, chunk ID, etc.</li>
<li><strong>source</strong> — reference back to the originating document</li>
</ul>
<hr>
<h2>Infrastructure Dependencies</h2>
<ul>
<li>OpenSearch Serverless collection (vector index, pre-existing)</li>
<li>Bedrock access for Titan Embed Text v2</li>
<li>AgentCore Gateway (pre-existing, receives tool registration)</li>
</ul></body></html>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need a Knowledge Base Retrieval Tool to Provide Supplemental Grounding #44

Knowledge Base Retrieval Tool

Summary

Components

CDK Stack

Lambda Function

Config

Embedder

Retriever

Gateway Registration

Tool Schema (what agents see)

Response Shape

Infrastructure Dependencies

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Parameter	Type	Required	Description
query	string	yes	Natural-language query to search the KB with
top_k	number	no	Override number of results (default 5)
filter	object	no	Metadata key-value filter (e.g. domain, taxonomy)

Need a Knowledge Base Retrieval Tool to Provide Supplemental Grounding #44

Description

Knowledge Base Retrieval Tool

Summary

Components

CDK Stack

Lambda Function

Config

Embedder

Retriever

Gateway Registration

Tool Schema (what agents see)

Response Shape

Infrastructure Dependencies

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions