Description
AgeMem (Agentic Memory) trains memory operations—store, retrieve, update, summarize, discard—as callable tools via a three-stage RL pipeline with step-wise GRPO. Crucially, learned policies discover proactive summarization: the agent summarizes intermediate results before the context window fills, not after hitting the limit.
Directly maps to CI-799's zeph-context and zeph-agent-context changes. Current Zeph compaction triggers reactively (budget threshold); AgeMem demonstrates a learned, proactive trigger is significantly more effective.
Relevance to Zeph
Implementation Sketch
- Expose compaction/summarization trigger as a callable tool available to the agent
- Collect trajectory data (context size, task progress, upcoming tool calls)
- Train a lightweight GRPO policy to predict optimal summarization moments
- Alternatively: use a simple learned threshold on context_budget_used × task_horizon
Complexity vs Benefit
- Complexity: High (RL training pipeline) | Benefit: High (proactive compaction prevents context blowout mid-task)
- Near-term: expose trigger as tool (low complexity), defer RL training to post-v1.0
Source
Description
AgeMem (Agentic Memory) trains memory operations—store, retrieve, update, summarize, discard—as callable tools via a three-stage RL pipeline with step-wise GRPO. Crucially, learned policies discover proactive summarization: the agent summarizes intermediate results before the context window fills, not after hitting the limit.
Directly maps to CI-799's zeph-context and zeph-agent-context changes. Current Zeph compaction triggers reactively (budget threshold); AgeMem demonstrates a learned, proactive trigger is significantly more effective.
Relevance to Zeph
zeph-context:manager.rs,assembler.rs— proactive compaction trigger policyzeph-agent-context:summarization/scheduling.rs— could replace heuristic scheduling with RL-informed schedulingImplementation Sketch
Complexity vs Benefit
Source
.local/specs/NNN-agemem-proactive-compaction/spec.mdbefore implementation