From 259484c16114215b315113b6ecfe660352c9230f Mon Sep 17 00:00:00 2001
From: yishuiliunian <yishuiliunian@gmail.com>
Date: Sun, 5 Apr 2026 23:00:49 +0800
Subject: [PATCH] feat(prompt): add concurrency discipline guidance for
 sub-agent spawning
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

LLM agents could spawn excessive parallel sub-agents on complex tasks,
wasting resources and tokens. Rather than hard-coding limits in the
runtime, guide the LLM via system prompt to self-regulate concurrency
based on task complexity — start small, assess before parallelizing,
iterate instead of pre-allocating, and match parallelism to actual
task breadth.
---
 .../prompts/tools/agent-guidelines.md           | 17 ++++++++++++++---
 1 file changed, 14 insertions(+), 3 deletions(-)

diff --git a/crates/loopal-prompt-system/prompts/tools/agent-guidelines.md b/crates/loopal-prompt-system/prompts/tools/agent-guidelines.md
index f08809d..7b6694b 100644
--- a/crates/loopal-prompt-system/prompts/tools/agent-guidelines.md
+++ b/crates/loopal-prompt-system/prompts/tools/agent-guidelines.md
@@ -10,7 +10,7 @@ You can spawn sub-agents to handle tasks autonomously. Each agent runs in its ow
 
 ## When to Spawn
 
-- **Parallel independent tasks**: Multiple searches, analyses, or implementations that don't depend on each other — launch multiple agents in one message
+- **Parallel independent tasks**: Multiple searches, analyses, or implementations that don't depend on each other
 - **Deep codebase exploration**: Use an `explore` agent (read-only, optimized for search) to investigate large or unfamiliar areas
 - **Architecture planning**: Use a `plan` agent (read-only) to design implementation approaches
 - **Protecting context**: Offload research-heavy work to keep your main context focused
@@ -18,8 +18,20 @@ You can spawn sub-agents to handle tasks autonomously. Each agent runs in its ow
 ## When NOT to Spawn
 
 - Trivial tasks you can do in one or two tool calls
-- Tasks that need your accumulated conversation context (sub-agents start fresh or with a directive)
+- Tasks that need your accumulated conversation context (sub-agents start fresh)
 - Sequential single-file changes where continuity matters
+- When you already have enough information to proceed directly
+
+## Concurrency Discipline
+
+**Prefer fewer, focused agents over many parallel ones.** Each agent consumes an OS process, LLM context, and tokens. Spawning too many at once wastes resources and often produces redundant or shallow results compared to fewer well-scoped agents.
+
+Scale concurrency to task complexity:
+- **Start small.** Default to the minimum number of agents that covers the task. A single well-prompted agent often outperforms several vague ones.
+- **Assess before parallelizing.** Only spawn multiple agents when you can identify truly independent sub-tasks — each with a distinct scope and expected output.
+- **Iterate, don't pre-allocate.** Run a first batch, review what came back, then decide whether more agents are needed. Avoid spawning "just in case."
+- **Avoid redundant exploration.** Don't split one search across many explore agents — give one agent a comprehensive, well-scoped prompt instead. Reserve multiple explore agents for genuinely separate areas of the codebase.
+- **Consider the cost.** A complex multi-area refactoring may justify several parallel agents; a focused bug investigation rarely does. Match the parallelism to the real breadth of the work.
 
 ## Agent Types
 
@@ -32,5 +44,4 @@ You can spawn sub-agents to handle tasks autonomously. Each agent runs in its ow
 - Sub-agent results are NOT shown to the user — you must summarize what was found or accomplished.
 - Always include a short description (3-5 words) when spawning.
 - For open-ended research, use `explore` type. For implementation, use default.
-- Launch multiple independent agents simultaneously for maximum efficiency.
 - Trust agent outputs generally, but verify critical findings before acting on them.