๐ English README | ไธญๆๆๆกฃ
Stop AI coding agents from saying โdoneโ while leaving half the task unfinished.
Agent Long Task Skill is a transparent, Markdown-first long-task execution protocol for AI coding agents. It helps agents refine messy requirements, create structured task plans, track progress, recover from failures, verify completion, and generate handoff-ready reports.
AI coding agents are useful, but long tasks expose predictable failure modes:
- They miss requirements during multi-step work.
- They say "done" while leaving bugs, skipped items, or unfinished work.
- Long prompts are messy, token-heavy, repetitive, and easy to misunderstand.
- Context loss makes continuation difficult after interruptions.
- A simple checklist is not enough when tasks require investigation, recovery, verification, and handoff.
This Skill defines a full lifecycle:
messy user request
โ requirement refinement and compression
โ structured execution document
โ task list
โ step-by-step execution
โ failure/blocker recovery attempts
โ final verification pass
โ final user-facing report
The agent should first compress a vague or multi-requirement request into a concise execution document. It should preserve the user's intent while removing repetition, ambiguity, and token-heavy wording. Then it maps every requirement to one or more task IDs, executes tasks one by one, records failures and blockers honestly, performs a final verification audit, and reports the real result.
The first output should be a concise execution document, not a long restatement of the user's entire prompt.
- Do not copy the user's full messy prompt repeatedly.
- Convert vague language into concise executable requirements.
- Keep each task short and testable.
- Keep acceptance criteria specific but not verbose.
- Preserve traceability from original requirement to task ID.
- Prefer concise summaries over long explanations.
- Requirement refinement before implementation.
- A structured task plan with stable task IDs.
- Status tracking for every item.
- Acceptance criteria for each task.
- Failure and blocker recovery attempts within task scope.
- Final verification before claiming completion.
- Handoff notes when work is interrupted, blocked, or too large for one pass.
- Chinese-friendly and multilingual user-facing task output.
Final verification is an audit pass, not a second full execution pass.
During final verification, the agent reviews the task list, changed files, test/build results, acceptance criteria, and visible outcomes. It should not redo every task from the beginning. If verification finds a specific missing, incomplete, or broken item, the agent should perform a targeted fix and update the relevant task status.
Only after every required task is verified may the agent claim the long task is complete. If any task remains pending, in_progress, failed, blocked, or unverified, the agent must not claim full completion.
Agent Long Task Skill is for:
- Claude Code users.
- Codex users.
- OpenCode users.
- AI coding agent users.
- Developers doing multi-file fixes, refactors, productization, bug batches, UI redesigns, continuous delivery cleanup, or long implementation tasks.
The default language is English, but the workflow is useful for Chinese and multilingual teams because user-facing task titles, notes, handoff reports, and final reports can follow the user's language.
This repository is Markdown-first and does not include a graphical UI. Here, task UI support means the generated task plan, status summary, handoff report, and final report can use localized user-facing labels.
If the user prompt is in Chinese, the agent should produce user-facing task titles, notes, summaries, handoff reports, and final reports in Chinese by default. If the prompt is in English, it should use English by default.
Machine-readable task status values stay stable in English:
pending
in_progress
done
failed
blocked
verified
skipped
English display labels:
{
"pending": "โช Pending",
"in_progress": "๐ก In Progress",
"done": "โ
Done",
"failed": "๐ด Failed",
"blocked": "๐ Blocked",
"verified": "๐ต Verified",
"skipped": "๐ฃ Skipped"
}Chinese display labels:
{
"pending": "โช ๅพ
ๅค็",
"in_progress": "๐ก ่ฟ่กไธญ",
"done": "โ
ๅทฒๅฎๆ",
"failed": "๐ด ๅคฑ่ดฅ",
"blocked": "๐ ้ปๅก",
"verified": "๐ต ๅทฒ้ชๆถ",
"skipped": "๐ฃ ๅทฒ่ทณ่ฟ"
}Install as a user-level Claude skill:
Copy this folder to:
~/.claude/skills/agent-long-task-skill/
Install at the project level:
.claude/skills/agent-long-task-skill/
No package manager is required. There is no backend, web app, telemetry, network behavior, install script, or hidden automation.
Ask your coding agent to use the Skill before a long or messy implementation task:
Use this skill for this long task: fix the settings page, clean up the upload flow, verify the build, and write a final report.
You can also be more explicit:
This is a multi-requirement task. Refine it into a task plan first, then execute and verify item by item.
For Chinese tasks:
่ฟๆฏไธไธชๅค้ๆฑ้ฟไปปๅกใๅ
ๆ้ๆฑๅ็ผฉๆด็ๆไปปๅก่ฎกๅ๏ผๅๆไปปๅก ID ้้กนๆง่กใ้ชๆถๅนถ่พๅบๆ็ปๆฅๅใ
When used during an agent run, the Skill may generate:
.agent/task-plan.json
.agent/status.json
.agent/handoff.md
.agent/final-report.md
These files are execution artifacts. They are intentionally plain text or JSON so humans can inspect them.
Use these machine-readable statuses consistently:
pending
in_progress
done
failed
blocked
verified
skipped
pending: not started.in_progress: currently being worked on.done: implementation work appears completed, but final verification may still be pending.verified: checked against acceptance criteria and confirmed complete.failed: attempted and failed after reasonable recovery attempts.blocked: cannot proceed due to missing information, dependency, permission, or technical blocker.skipped: intentionally skipped with a stated reason.
Important distinction: done is not the same as verified. A task should only become verified after the final verification pass or after explicit acceptance checking. The agent must not claim full completion while important tasks are only done but not verified.
When a task fails, the agent should:
- Diagnose the likely cause.
- Try reasonable safe recovery methods within scope.
- Avoid infinite retries.
- Avoid unrelated large rewrites unless necessary.
- Record what was tried.
- If unresolved, mark
failedand provide a recovery path.
When a task is blocked, the agent should:
- Mark
blockedimmediately. - Explain the blocker.
- State what information, dependency, permission, API key, design decision, or user input is needed.
- Continue with other independent tasks when possible.
During execution, the agent keeps a full ordered task board visible in progress updates. This makes long tasks easier to follow because the user can see all tasks, current status, remaining work, failures, blockers, and verification progress at a glance.
This is a Markdown progress board, not a graphical dashboard.
The live task board is different from the final report. It is shown during execution, should stay near the bottom of progress updates when practical, and should be generated from the current task plan state rather than a separate progress list.
Example:
## Live Task Progress
| # | Task | Status |
|---|---|---|
| 1 | Refine requirements and create task plan | ๐ต Verified |
| 2 | Fix homepage course entry points | ๐ก In Progress |
| 3 | Fix mobile back button behavior | โช Pending |
| 4 | Run build verification | โช Pending |
Remaining: 2 | Done: 1 | Verified: 1 | Failed: 0 | Blocked: 0The Skill keeps a full ordered task board visible during long-task execution, so users can quickly see what is done, what is in progress, what is pending, and what is blocked.
After final verification, the agent returns a compact completion report with task status, key changes, verification performed, and recommended local checks.
The final response must include:
- Completed and verified tasks.
- Failed tasks.
- Blocked tasks.
- Skipped tasks.
- Files changed.
- Verification performed.
- Remaining risks.
- Next recommended actions.
If all required tasks are verified, the agent may say:
All required tasks have been completed and verified.
If not all required tasks are verified, the agent must say:
The task is not fully complete yet. The following items remain failed, blocked, pending, or unverified.
After all tasks are attempted and final verification is complete, the agent should return a compact report that is easy to scan. It should not only say "done."
The final report should include:
- Task status table.
- Key changes.
- Verification performed.
- Failed or blocked items.
- Recommended local checks.
The status table is required. It should show at least the task number, task title, and user-facing status label:
## Final Report
| # | Task | Status |
|---|---|---|
| 1 | Fix homepage and course entry points | ๐ต Verified |
| 2 | Run build verification | ๐ต Verified |Final verification must be reflected in the final report. The report should show verified completion status, not just implementation status. Failed, blocked, skipped, pending, or unverified tasks must not be hidden.
Use recommended local checks when the user may need to verify UI, device behavior, build output, runtime behavior, deployment, or account-specific behavior.
Markdown-first keeps the protocol:
- Transparent.
- Auditable.
- Free of hidden runtime behavior.
- Free of network access.
- Easy to adapt.
- Compatible with multiple agents.
The Skill is designed to be understood by humans first and automated later only if that becomes useful.
agent-long-task-skill/
โโ README.md
โโ README.zh-CN.md
โโ SKILL.md
โโ LICENSE
โโ docs/
โ โโ images/
โ โโ live-task-board.png
โ โโ final-report.png
โโ templates/
โ โโ task-plan.md
โ โโ acceptance-checklist.md
โ โโ handoff.md
โ โโ final-report.md
โ โโ live-task-board.md
โ โโ zh-CN/
โ โโ task-plan.md
โ โโ acceptance-checklist.md
โ โโ handoff.md
โ โโ final-report.md
โ โโ live-task-board.md
โโ schemas/
โ โโ agent-task-plan.schema.json
โโ examples/
โโ messy-user-request.md
โโ messy-user-request.zh-CN.md
โโ refined-task-plan.json
โโ refined-task-plan.zh-CN.json
โโ handoff-example.md
โโ handoff-example.zh-CN.md
โโ final-report-example.md
โโ final-report-example.zh-CN.md
โโ live-task-board-example.md
โโ live-task-board-example.zh-CN.md
SKILL.md- Templates
- JSON schema
- Examples
- Stronger schemas
- More examples
AGENTS.md,CLAUDE.md, and Codex instruction templates
- Optional CLI or dashboard integration
- Status viewer
- code-hub compatibility
MIT License.

