Skip to content

colbymchenry/devpit

 
 

Repository files navigation

DevPit

Sequential agent pipeline for AI coding agents

DevPit runs specialized AI agents one at a time on a task. Each agent runs in its own tmux session with full visibility. You can attach and watch any agent work in real-time.

Screen Recording 2026-04-06 at 9 02 44 AM

How It Works

1_15zN8-DF92E1iZJUE_cqBg
dp pipeline "Add a health check endpoint"

DevPit executes a workflow — an ordered list of steps. Each step:

  1. Spawns an AI agent in a tmux session
  2. Sends the task + context from previous steps
  3. Waits for the agent to finish
  4. Captures output and passes it to the next step

Steps can have loop-back conditions (e.g., tester fails → jump back to coder, retry up to 3 times).

The default workflow runs: architect → coder → tester ↔ coder (retry) → reviewer → design-qa ↔ coder (retry)

Custom workflows support arbitrary step sequences, context dependencies, and configurable pass/fail markers.

Installation

Prerequisites

From source

git clone https://github.com/colbymchenry/devpit.git
cd devpit
make install  # builds and installs to ~/.local/bin/dp

From npm

npm install -g devpit

Quick Start

# 1. Create a workflow (one-time)
dp create --default

# 2. Run the pipeline
dp pipeline "Add a health check endpoint"

dp create spawns Claude to interview you about your project, then generates agent files (.claude/agents/*.md) and a workflow (.claude/workflows/default.yaml). Use --default for the standard template or describe a custom workflow.

Commands

dp (no args)

Launch the interactive TUI dashboard. View running and past pipelines, start new runs, create workflows, and edit workflow configs — all from one interface.

dp pipeline "task"

Run a workflow pipeline. Loads the default workflow from .claude/workflows/default.yaml, or specify a custom one with --workflow.

dp pipeline "Fix the login form validation"
dp pipeline "Refactor auth module" --agent gemini
dp pipeline "Optimize performance" --workflow optimize
Flag Default Description
--agent claude AI runtime (claude, gemini, codex, etc.)
--model opus[1m] Model override
--timeout 10m Max time per step
--retries 3 Max loop-back retries
--workflow default Custom workflow name (from .claude/workflows/)

dp create [prompt]

Create a new workflow interactively. Claude scans your project, interviews you, and generates agent files and a workflow YAML.

dp create                                           # TUI create form
dp create --default                                 # Standard template
dp create "benchmark loop that tests and improves"  # Custom workflow

dp pipeline agent <name> "prompt"

Run a single agent interactively — spawns a tmux session and attaches your terminal.

dp pipeline agent architect "Design a caching layer"
dp pipeline agent coder "Implement the plan" --detach

dp pipeline follow "task"

Queue a follow-up task that reuses the same agent sessions with full context.

dp pipeline follow "Make the button blue instead of green"

dp pipeline status

Show running pipeline sessions with working/idle state.

dp pipeline peek <name>

Read an agent's recent terminal output.

dp pipeline peek coder
dp pipeline peek tester -n 200

dp pipeline stop

Stop all running pipeline agent sessions.

TUI Dashboard

Run dp with no arguments to launch the interactive dashboard:

  • Dashboard — view running and past pipeline runs, retry failed ones, kill active sessions
  • New run (n) — launch a pipeline with a task, workflow, and agent selection
  • Create workflow (c) — generate a new workflow with Claude
  • Edit workflow (e) — modify workflow configs: reorder steps, edit fields, add/remove steps
  • History (h) — browse past runs with status and details

Custom Workflows

Workflows are YAML files in .claude/workflows/:

name: optimize
description: Iterative benchmark-and-improve loop
steps:
  - name: baseline
    agent: benchmarker
  - name: analyst
    context: [baseline]
  - name: improver
    agent: coder
    context: [analyst]
    directive: "Implement the improvements proposed by the analyst"
  - name: verifier
    agent: benchmarker
    context: [improver]
    loop:
      goto: analyst
      max: 3
      pass: "PIPELINE_RESULT:PASS"
      fail: "PIPELINE_RESULT:FAIL"

Run with dp pipeline "your task" --workflow optimize.

Edit workflows in the TUI with e from the dashboard, or directly in YAML.

Agent Files

Agents are markdown files in .claude/agents/ with YAML frontmatter:

---
name: architect
description: Plans implementation before code gets written
model: opus
tools: Read, Glob, Grep, Bash
effort: high
---

You are the architect. Analyze the task, identify affected files,
plan the implementation, and flag risks...

dp create generates these based on your project type and preferences.

Multi-Runtime Support

DevPit works with multiple AI CLIs. The --agent flag selects the runtime:

dp pipeline "task" --agent claude    # Claude Code (default)
dp pipeline "task" --agent gemini    # Gemini CLI
dp pipeline "task" --agent codex     # OpenAI Codex
dp pipeline "task" --agent copilot   # GitHub Copilot

Each runtime has its own readiness detection, prompt delivery, and startup dialog handling built into the tmux layer.

License

MIT

About

Your dev crewpit

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages

  • Go 98.1%
  • JavaScript 1.1%
  • Other 0.8%