agent-implementation-skill

✓Clean

Multi-model agent implementation workflow for software development. Orchestrates research, evaluation, design baseline, implementation, RCA, structured decomposition, constraint discovery, model selection, and agent-driven Stage 3 codemap exploration across external AI models (GPT, GLM, Claude). Use when implementing features through a structured multi-phase pipeline with worktrees, dynamic scheduling, and SQLite-backed agent coordination.

⭐ 1 stars🍴 0 forks↓ 0 installs📄 MIT

Install Command

npx skills add nestharus/agent-implementation-skill

architecture code-review project-management

Author

nestharus

Repository

nestharus/agent-implementation-skill

Discovered via

github topic

Weekly installs

Quality score

27/100

Last commit

2/24/2026

SKILL.md

---
name: agent-implementation-skill
description: Multi-model agent implementation workflow for software development. Orchestrates research, evaluation, design baseline, implementation, RCA, structured decomposition, constraint discovery, model selection, and agent-driven Stage 3 codemap exploration across external AI models (GPT, GLM, Claude). Use when implementing features through a structured multi-phase pipeline with worktrees, dynamic scheduling, and SQLite-backed agent coordination.
---

# Development Workflow

Single entry point for the full development lifecycle. Read this file,
determine what phase you're in or what the user needs, then read the
relevant sub-file from this directory.

## Paths

Everything lives in this skill folder. WORKFLOW_HOME is: !`dirname "$(grep -rl '^name: agent-implementation-skill' ~/.claude/skills/*/SKILL.md .claude/skills/*/SKILL.md 2>/dev/null | head -1)" 2>/dev/null`

When dispatching scripts or agents, export `WORKFLOW_HOME` with the path
above. Scripts also self-locate via `dirname` as a fallback when invoked
directly.

```
$WORKFLOW_HOME/
  SKILL.md              # this file â entry point
  implement.md          # multi-model implementation pipeline
  research.md           # exploration â alignment â proposal
  rca.md                # root cause analysis
  evaluate.md           # proposal review
  baseline.md           # constraint extraction
  audit.md              # structured task decomposition
  constraints.md        # constraint discovery
  models.md             # model selection guide
  scripts/
    workflow.sh         # schedule driver ([wait]/[run]/[done]/[fail])
    db.sh               # SQLite-backed coordination database
    scan.sh             # Stage 3 coordinator: dispatches agents to explore codespace and build codemap, then per-section file identification
    section-loop.py     # strategic section-loop orchestrator: integration proposals, strategic implementation, cross-section communication, global coordination (Stages 4-5 of implement.md)
  tools/
    extract-docstring-py  # extract Python module docstrings
    extract-summary-md    # extract YAML frontmatter from markdown
    README.md             # tool interface spec (for Opus to write new tools)
  agents/
    orchestrator.md     # event-driven step dispatcher (model: claude-opus)
    monitor.md          # task-level pipeline monitor â detects cycles/stuck (model: glm)
    qa-monitor.md       # deep QA monitor â 26 rules, 5 categories, PAUSE authority (model: claude-opus)
    agent-monitor.md    # per-agent loop detector â watches narration (model: glm)
    state-detector.md   # workspace state reporter (model: claude-opus)
    exception-handler.md # RCA on failed steps (model: claude-opus)
    microstrategy-writer.md # tactical per-file breakdown (model: gpt-5.3-codex-high)
    section-re-explorer.md  # re-explores sections with no related files (model: claude-opus)
    setup-excerpter.md      # extracts section excerpts from globals (model: claude-opus)
    bridge-agent.md         # resolves cross-section interface friction (model: gpt-5.3-codex-xhigh)
  templates/
    implement-proposal.md   # 10-step implementation schedule
    research-cycle.md       # 7-step research schedule
    rca-cycle.md            # 6-step RCA schedule
```

Workspaces live on native filesystem for performance, separate from project:
- **Planspace**: `~/.claude/workspaces/<task-slug>/` â schedule, state, log, artifacts, coordination database
- **Codespace**: project root or worktree â where source code lives

Clean up planspace when workflow is fully complete (`rm -rf` the workspace dir).

## Phase Detection

Check these in order:

1. **User explicitly requested an action** â Read the matching file
2. **Test failures need investigation** â `rca.md`
3. **Proposal exists, not yet evaluated** â `evaluate.md`
4. **Proposal evaluated, no baseline** â `baseline.md`
5. **Baseline exists, implementation needed** â `implement.md`
6. **No proposal exists** â `research.md`
7. **Something feels wrong about a change** â `constraints.md`
8. **Need to pick a model** â `models.md`
9. **Need structured task decomposition** â `audit.md`

## Files

| File | What It Does |
|------|-------------|
| `research.md` | Exploration â alignment â proposal â refinement |
| `evaluate.md` | Proposal alignment review (Accept / Reject / Push Back) |
| `baseline.md` | Atomize proposal into constraints / patterns / tradeoffs |
| `implement.md` | Multi-model implementation with worktrees + dynamic scheduling |
| `rca.md` | Root cause analysis + architectural fix for test failures |
| `audit.md` | General structured task decomposition + delegation |
| `constraints.md` | Surface implicit constraints, validate design principles |
| `models.md` | Model selection guide for multi-model workflows |

## Design Philosophy

These principles govern all pipeline behavior. Violations are alignment
failures.

1. **Alignment over audit** â Check directional coherence between adjacent
   layers ("is it solving the right problem?"), never feature coverage
   against a checklist ("is it done?"). The system is never done.
2. **Strategy over brute force** â Strategy collapses many waves of problems
   in one go. Brute force leads to countless cycles. Fewer tokens, fewer
   cycles, same quality.
3. **Scripts dispatch, agents decide** â Scripts do mechanical coordination
   (dispatch, check, log). Agents do reasoning (explore, understand, decide).
   Strategic decisions (grouping, relatedness, signal interpretation) belong
   to agents, not scripts.
4. **Heuristic exploration, not exhaustive scanning** â Build a routing map
   (codemap), then use it for targeted investigation. Never catalog every
   file. The cost of occasionally routing wrong is far less than exhaustive
   scanning.
5. **Problems, not features** â We decompose problems all the way down, then
   solve tiny problems. Proposals describe strategies, not implementations.
   We never do feature coverage because we generate as we go.
6. **Proposals must solve the same problems** â Alternative proposals are
   valid only if they solve the original problems. An optimization or
   complexity argument is an excuse. Do not introduce constraints the user
   did not specify.

### Terminology Contract

- **"Audit"** only ever means alignment against stated problems and
  constraints â never feature coverage against a checklist.
- **"Alignment"** is directional coherence between adjacent layers:
  does the work solve the problem it claims to solve?
- **"Feature coverage"** is explicitly banned as a verification method.
  Plans describe problems and strategies, not enumerable features.

## The Full Lifecycle

```
Exploration â Alignment â Proposal â Review â Baseline â Implementation â Verification
  (research.md)           (evaluate.md) (baseline.md) (implement.md)    (rca.md)
```

Phases iterate: Review may loop back to Research. Implementation may
trigger tangent research cycles. Verification may reveal architectural
issues requiring RCA.

## Artifact Flow

```
[Raw Idea]
    â
[Exploration Notes]              â research.md Phase A
    â
[Alignment Document]             â research.md Phase B
    â
[Proposal]                       â research.md Phase C
    â
[Evaluation Report]              â evaluate.md (iterate if REJECT/PUSH BACK)
    â
[Design Baseline]                â baseline.md (constraints/, patterns/, TRADEOFFS.md)
    â
[Section Files â Integration Proposals â Strategic Implementation â Code]  â implement.md
    â
[Tests â Debug â Constraint Check â Lint â Commit]   â implement.md + rca.md
```

## Workflow Orchestration

For multi-step workflows, use the orchestration system instead of running
everything from memory.

### Dispatch: All Agents via `uv run agents`

**CRITICAL**: All step dispatch goes through `uv run agents` via Bash.
Never use Claude's Task tool to spawn sub-agents â it causes "sibling"
errors and reliability issues. The agent runner automatically unsets
`CLAUDECODE` so sibling Claude sessions can launch.

```bash
# Sequential dispatch â model directly with prompt file
uv run agents --model <model> --file <planspace>/artifacts/step-N-prompt.md \
  > <planspace>/artifacts/step-N-output.md 2>&1

# Agent file dispatch â agent instructions prepended to prompt
uv run agents --agent-file "$WORKFLOW_HOME/agents/exception-handler.md" \
  --file <planspace>/artifacts/exception-prompt.md

# Parallel dispatch with db.sh coordination
(uv run agents --model gpt-5.3-codex-high --file <prompt-A.md> && \
  bash "$WORKFLOW_HOME/scripts/db.sh" send <planspace>/run.db orchestrator "done:block-A") &
(uv run agents --model gpt-5.3-codex-high --file <prompt-B.md> && \
  bash "$WORKFLOW_HOME/scripts/db.sh" send <planspace>/run.db orchestrator "done:block-B") &
bash "$WORKFLOW_HOME/scripts/db.sh" recv <planspace>/run.db orchestrator
bash "$WORKFLOW_HOME/scripts/db.sh" recv <planspace>/run.db orchestrator

# Codemap exploration dispatch (Opus explores the codespace)
uv run agents --model claude-opus --project <codespace> \
  --file <planspace>/artifacts/scan-logs/codemap-prompt.md \
  > <planspace>/artifacts/codemap.md 2>&1
```

### Schedule Templates

Pre-built schedules in `$WORKFLOW_HOME/templates/`. Each step specifies its model:
```
[wait] 1. step-name | model-name -- description (skill-section-reference)
```
- `implement-proposal.md` â full 10-step implementation pipeline
- `research-cycle.md` â research â evaluate â propose â refine
- `rca-cycle.md` â investigate â plan fix â apply â verify

### Stage 3 Codemap Exploration

Stage 3 dispatches agents to explore and understand the codebase:
1. An Opus agent explores the codespace â reads files, follows its curiosity, builds understanding.
2. The agent writes `<planspace>/artifacts/codemap.md` capturing what it discovered.
3. Per-section Opus agents use the codemap to identify related files for each section.
4. Deep scan dispatches GLM agents to reason about specific file relevance in context.

Control and recovery:
- If `codemap.md` already exists, reuse it only if the codespace
  fingerprint is unchanged or the verifier confirms validity; otherwise
  rebuild.
- If a section already has `## Related Files`, validate the list against
  the current codemap/section content; skip only if unchanged.
- Non-zero codemap exit stops Stage 3 before section exploration.

### Model Roles

| Model | Used For |
|-------|----------|
| `claude-opus` | Section setup (excerpt extraction), alignment checks (shape/direction), decomposition, codemap exploration, per-section file identification |
| `gpt-5.3-codex-high` | Integration proposals, strategic implementation, coordinated fixes, extraction, investigation |
| `gpt-5.3-codex-high2` | Constraint alignment check (same capability, different quota) |
| `gpt-5.3-codex-xhigh` | Deep architectural synthesis, proposal drafting |
| `glm` | Test running, verification, quick commands, deep file analysis, semantic impact analysis, sub-agent exploration during integration proposals |

### Prompt Files

Step agents receive self-contained prompt files (they cannot read
`$WORKFLOW_HOME`). The orchestrator builds each prompt from:
1. **Skill section text** â copied verbatim from the referenced skill file
2. **Planspace path** â so the agent can read/write state and artifacts
3. **Codespace path** â so the agent knows where source code lives
4. **Context** â relevant content from `state.md`
5. **Output contract** â what the agent should return on success/failure

Written to: `<planspace>/artifacts/step-N-prompt.md`

### Workspace Structure

Each workflow gets a planspace at `~/.claude/workspaces/<task-slug>/`:
- `schedule.md` â task queue with status markers (copied from template)
- `state.md` â current position + accumulated facts
- `log.md` â append-only execution log
- `artifacts/` â prompt files, output files, working files for steps
  - `artifacts/sections/` â section excerpts (proposal + alignment excerpts)
  - `artifacts/proposals/` â integration proposals per section
  - `artifacts/snapshots/` â post-completion file snapshots per section
  - `artifacts/notes/` â cross-section consequence notes
  - `artifacts/coordination/` â global coordinator state and fix prompts
  - `artifacts/decisions/` â accumulated parent decisions per section (from pause/resume)
- `run.db` â coordination database (messages, events, agent registry)
- `constraints/` â discovered constraints (promote later)
- `tradeoffs/` â discovered tradeoffs (promote later)

### Coordination System (db.sh)

SQLite-backed coordination for agent messaging. One `run.db` per pipeline
run â messages are claimed (not consumed), history is preserved, and the
database file is the complete audit trail.

```bash
# Initialize the coordination database (idempotent)
bash "$WORKFLOW_HOME/scripts/db.sh" init <planspace>/run.db

# Send a message to an agent
bash "$WORKFLOW_HOME/scripts/db.sh" send <planspace>/run.db <target> [--from <agent>] "message text"

# Block until a message arrives (agent sleeps, no busy-loop)
bash "$WORKFLOW_HOME/scripts/db.sh" recv <planspace>/run.db <name> [timeout_seconds]

# Check pending count (non-blocking)
bash "$WORKFLOW_HOME/scripts/db.sh" check <planspace>/run.db <name>

# Read all pending messages
bash "$WORKFLOW_HOME/scripts/db.sh" drain <planspace>/run.db <name>

# Agent lifecycle
bash "$WORKFLOW_HOME/scripts/db.sh" register <planspace>/run.db <name> [pid]
bash "$WORKFLOW_HOME/scripts/db.sh" unregister <planspace>/run.db <name>
bash "$WORKFLOW_HOME/scripts/db.sh" agents <planspace>/run.db
bash "$WORKFLOW_HOME/scripts/db.sh" cleanup <planspace>/run.db [name]

# Event logging and querying
bash "$WORKFLOW_HOME/scripts/db.sh" log <planspace>/run.db <kind> [tag] [body] [--agent <name>]
bash "$WORKFLOW_HOME/scripts/db.sh" tail <planspace>/run.db [kind] [--since <id>] [--limit <n>]
bash "$WORKFLOW_HOME/scripts/db.sh" query <planspace>/run.db <kind> [--tag <t>] [--agent <a>] [--since <id>] [--limit <n>]
```

**Key patterns**:
- Orchestrator blocks on `recv` waiting for parallel step results
- Step agents send `done:<step>:<summary>` or `fail:<step>:<error>` when finished
- Section-loop sends `summary:setup:`, `summary:proposal:`, `summary:proposal-align:`, `summary:impl:`, `summary:impl-align:`, `status:coordination:` messages; `complete` only on full success; `fail:<num>:coordination_exhausted:<summary>` on coordination timeout
- Mailbox is required for orchestrator/step coordination boundaries
- Codemap exploration is a single Opus agent that explores the codespace directly
- Agents needing user input send `ask:<step>:<question>`, then block on their own mailbox
- User or orchestrator can send `abort` to any agent to trigger graceful shutdown
- `agents` command shows who's registered and who's waiting â detect stuck agents

## Cross-Cutting Tools

- **audit.md** â Structured decomposition + delegation for any large task
- **constraints.md** â Before implementation or when something feels wrong
- **models.md** â Which external model to use for any given task

Similar Skills

multi-agent-code-review✓Clean

Run parallel code reviews with multiple AI agents, then synthesize into one report. Triggers on "review code" or "multi-agent review".

code-review architecture

⭐ 0↓ 0ktaletsk/multi-agent-code-review-skill

npx skills add ktaletsk/multi-agent-code-review-skill

workflow-orchestration✓Clean

Orchestrates agent workflow for non-trivial tasks: plan-first mode, subagent use, self-improvement loops, verification before done, and autonomous bug fixing. Use for any task with 3+ steps, architectural decisions, bug reports, or when the user corrects the agent. Ensures plans go in tasks/todo.md, lessons in tasks/lessons.md, and changes stay minimal and provably correct.

project-management devops architecture

⭐ 0↓ 0dnh33/workflow-orchestration

npx skills add dnh33/workflow-orchestration

swarm-iosm✓Clean

Orchestrate complex development with AUTOMATIC parallel subagent execution, continuous dispatch scheduling, dependency analysis, file conflict detection, and IOSM quality gates. Analyzes task dependencies, builds critical path, launches parallel background workers with lock management, monitors progress, auto-spawns from discoveries. Use for multi-file features, parallel implementation streams, automated task decomposition, brownfield refactoring, or when user mentions "parallel agents", "orchestrate", "swarm", "continuous dispatch", "automatic scheduling", "PRD", "quality gates", "decompose work", "Mixed/brownfield".

project-management devops architecture

⭐ 2↓ 0rokoss21/swarm-iosm

npx skills add rokoss21/swarm-iosm

mastering-langgraph✓Clean

Build stateful AI agents and agentic workflows with LangGraph in Python. Covers tool-using agents with LLM-tool loops, branching workflows, conversation memory, human-in-the-loop oversight, and production monitoring. Use when - (1) building agents that use tools and loop until task complete, (2) creating multi-step workflows with conditional branches, (3) adding persistence/memory across turns with checkpointers, (4) implementing human approval with interrupt(), (5) debugging via time-travel or LangSmith. Covers StateGraph, nodes, edges, add_conditional_edges, MessagesState, thread_id, Command objects, and ToolMessage handling. Examples include chatbots, calculator agents, and structured workflows.

architecture code-review productivity

⭐ 17↓ 0SpillwaveSolutions/mastering-langgraph-agent-skill

npx skills add SpillwaveSolutions/mastering-langgraph-agent-skill