swarm-iosm

✓Clean
Orchestrate complex development with AUTOMATIC parallel subagent execution, continuous dispatch scheduling, dependency analysis, file conflict detection, and IOSM quality gates. Analyzes task dependencies, builds critical path, launches parallel background workers with lock management, monitors progress, auto-spawns from discoveries. Use for multi-file features, parallel implementation streams, automated task decomposition, brownfield refactoring, or when user mentions "parallel agents", "orchestrate", "swarm", "continuous dispatch", "automatic scheduling", "PRD", "quality gates", "decompose work", "Mixed/brownfield".
⭐ 2 stars🍴 0 forks↓ 0 installs📄 MIT
Install Command
npx skills add rokoss21/swarm-iosm
project-management devops architecture
Author
rokoss21
Repository
rokoss21/swarm-iosm
Discovered via
github topic
Weekly installs
Quality score
24/100
Last commit
1/19/2026
SKILL.md

---
name: swarm-iosm
version: 2.1
description: Orchestrate complex development with AUTOMATIC parallel subagent execution, continuous dispatch scheduling, dependency analysis, file conflict detection, and IOSM quality gates. Analyzes task dependencies, builds critical path, launches parallel background workers with lock management, monitors progress, auto-spawns from discoveries. Use for multi-file features, parallel implementation streams, automated task decomposition, brownfield refactoring, or when user mentions "parallel agents", "orchestrate", "swarm", "continuous dispatch", "automatic scheduling", "PRD", "quality gates", "decompose work", "Mixed/brownfield".
user-invocable: true
allowed-tools: Read, Grep, Glob, Bash, Write, Edit, Task, AskUserQuestion, TodoWrite
---

# Swarm Workflow (IOSM)

A structured workflow for complex development tasks that combines PRD-driven planning, parallel subagent execution, and IOSM (ImproveâOptimizeâShrinkâModularize) quality gates.

## Quick Start

**For new features/projects (Greenfield):**
```
/swarm-iosm new-track "Add user authentication with JWT"
```

**For existing codebases (Brownfield):**
```
/swarm-iosm setup
/swarm-iosm new-track "Refactor payment processing module"
```

**Check progress:**
```
/swarm-iosm status
```

## When to Use This Skill

Use Swarm Workflow when:
- Task requires multiple parallel work streams (exploration, implementation, testing, docs)
- Need formal PRD and decomposition for complex features
- Want structured reports and traceability ("who did what and why")
- Brownfield refactoring that needs careful planning and rollback strategy
- Team collaboration requiring artifact-based handoffs
- Quality gates (IOSM) are needed for acceptance

Don't use for:
- Simple single-file changes
- Quick bug fixes
- Exploratory tasks without implementation

## Core Commands

### `/swarm-iosm setup`
Initialize project context for Swarm workflow.

**What it does:**
1. Creates `swarm/` directory structure
2. Generates project context files (product.md, tech-stack.md, workflow.md)
3. Initializes tracks.md registry

**When to use:** First time in a project, or when project context has significantly changed.

### `/swarm-iosm new-track "<description>"`
Create a new feature/task track with PRD and implementation plan.

**What it does:**
1. Requirements gathering (AskUserQuestion for mode/priorities/constraints)
2. Generate PRD (`swarm/tracks/<id>/PRD.md`)
3. Create spec (`spec.md`) and plan (`plan.md`) with phases/tasks/dependencies
4. Identify subagent roles needed
5. Create metadata.json with track info

**Arguments:** Brief description of the feature/task (e.g., "Add OAuth2 authentication")

### `/swarm-iosm implement [track-id]`
Execute the implementation plan using parallel subagents.

**What it does:**
1. Load plan from track
2. Identify parallelizable tasks vs. sequential chains
3. Launch subagents (suggests background for long-running, foreground for interactive)
4. Each subagent produces structured report in `reports/`
5. Monitor progress and collect outputs

**Arguments:** Optional track-id (defaults to most recent track)

### `/swarm-iosm status [track-id]`
Show progress summary for a track.

**What it does:**
1. Parse plan.md for task statuses
2. List completed reports
3. Show blockers and open questions
4. Display dependency chain status

### `/swarm-iosm watch [track-id]`
Open a live monitoring dashboard for a track. (v1.3)

**What it does:**
1. Calculates real-time metrics (velocity, ETA, progress %)
2. Renders an ASCII progress bar
3. Shows status of all tasks in the track
4. Refreshes data from reports and checkpoints

**Example usage:**
```
/swarm-iosm watch
```

### `/swarm-iosm simulate [track-id]`
Run a dry-run simulation of the implementation plan. (v1.3)

**What it does:**
1. Loads implementation plan and resource constraints
2. Simulates dispatch loop with virtual time
3. Identifies bottlenecks and potential conflicts
4. Generates ASCII timeline and simulation report
5. Estimates total parallel execution time vs serial

**Example usage:**
```
/swarm-iosm simulate
/swarm-iosm simulate 2026-01-17-001
```

### `/swarm-iosm resume [track-id]`
Resume an interrupted implementation from the latest checkpoint. (v1.3)

**What it does:**
1. Loads latest checkpoint from `checkpoints/latest.json`
2. Reconciles state by reading all report files in `reports/`
3. Identifies completed vs pending tasks
4. Recalculates the ready queue
5. Shows a summary of progress and next steps

**Example usage:**
```
/swarm-iosm resume
/swarm-iosm resume 2026-01-17-001
```

### `/swarm-iosm retry <task-id> [--foreground] [--reset-brief]`
Retry a failed task with optional mode changes. (v1.2)

**What it does:**
1. Reads error diagnosis from task report using parse_errors.py
2. Shows error diagnosis to user with suggested fixes
3. Asks user to choose: apply fix, manual fix, or skip
4. Regenerates subagent brief with error context
5. Relaunches task using Task tool
6. Tracks retry count (max 3 per task)

**Arguments:**
- `<task-id>`: Task to retry (e.g., T04)
- `--foreground`: Force foreground execution (for interactive debugging)
- `--reset-brief`: Regenerate brief from scratch (vs. reuse existing)

**Error-specific behaviors:**
- **Permission Denied**: Always suggest --foreground
- **MCP Tool Unavailable**: Force foreground mode
- **Import Error**: Suggest pip install before retry
- **Test Failed**: Ask user: "Fix code or update tests?"

**Example usage:**
```
/swarm-iosm retry T04
/swarm-iosm retry T04 --foreground
/swarm-iosm retry T04 --reset-brief
```

### Inter-Agent Communication (v2.0)

Subagents can share knowledge via `shared_context.md`.

**Protocol:**
1. Subagent discovers a pattern (e.g., "Use `schemas.py` for all models").
2. Subagent writes to "Shared Context Updates" in their report.
3. Orchestrator runs `merge_context.py` to update `shared_context.md`.
4. Subsequent subagents read `shared_context.md` in their brief.

**Example Report Update:**
```markdown
## Shared Context Updates
- [Error Handling]: Always wrap API calls in `try/except ApiError`.
```

### `/swarm-iosm integrate <track-id>`
Collect subagent reports and create integration plan.

**What it does:**
1. Read all reports from `swarm/tracks/<id>/reports/`
2. Identify conflicts and resolution strategy
3. Generate integration_report.md with merge order
4. Run IOSM quality gates
5. Create iosm_report.md with gate results and IOSM-Index

### `/swarm-iosm revert-plan <track-id>`
Generate rollback guide for a track (does not execute git revert).

**What it does:**
1. Analyze files touched (from reports)
2. Identify commits/changes to revert
3. Suggest checkpoint/branch strategy
4. Create rollback_guide.md with manual steps

## Advanced Features (v2.0)

### Task Dependencies Visualization (`--graph`)
Generate a Mermaid diagram of the task dependency graph.

**Usage:**
```bash
/swarm-iosm simulate --graph
```
Generates `dependency_graph.mermaid`.

### Anti-Pattern Detection
The planner automatically checks for:
- Monolithic tasks (XL + many touches)
- Low parallelism (<1.2x speedup)
- Missing quality gates
- Circular dependencies

Warnings appear in `simulate` and `validate` output.

### Template Customization
You can override standard templates by placing files in `swarm/templates/`.

**Resolution Order:**
1. `swarm/templates/<name>` (Project-specific)
2. `.claude/skills/swarm-iosm/templates/<name>` (Skill defaults)

**Supported Templates:**
- `prd.md`, `plan.md`, `subagent_brief.md`, `subagent_report.md`

### Resource Constraints & Cost Control
Define limits in `plan.md` or metadata to prevent overload.

**Defaults:**
- Max Parallel Background: 6
- Max Parallel Foreground: 2
- Max Total: 8
- Cost Limit: $10.00

**Model Selection:**
- **Auto-select:** Haiku (read-only), Sonnet (standard), Opus (security/arch).

## Instructions for Claude

---

## ORCHESTRATOR RESPONSIBILITIES

**CRITICAL:** The main agent (Claude) acts as **ORCHESTRATOR ONLY**. You coordinate subagents but DO NOT do implementation work yourself.       

### MANDATORY RULES

#### â ORCHESTRATOR DOES:

1. **Analyze & Plan**
   - Parse `plan.md` and build dependency graph
   - Generate `orchestration_plan.md` with waves/critical path
   - Detect file conflicts and resolve scheduling

2. **Launch Subagents**
   - Create detailed briefs for each subagent (using templates)
   - Launch parallel waves in **single message** (multiple Task tool calls)
   - Default to background mode (unless interactive)
   - Pre-resolve all questions for background tasks

3. **Monitor & Handle Blockers**
   - Use `/bashes` to track background tasks
   - Resume stuck tasks in foreground if needed
   - Apply fallback strategy (retry â resume â recovery task)

4. **Integrate & Gate**
   - Collect all subagent reports
   - Resolve merge conflicts
   - Run IOSM quality gates
   - Generate `integration_report.md` and `iosm_report.md`

5. **Meta-work** (ONLY exception to "no implementation")
   - Update `plan.md` status
   - Fix metadata (`metadata.json`, `tracks.md`)
   - Resolve integration conflicts (merge reports)
   - Generate final reports/docs

#### â ORCHESTRATOR NEVER DOES:

1. **Implementation work:**
   - â Write application code (services, models, API, UI)
   - â Write tests (unit, integration, performance)
   - â Refactor existing code

2. **Analysis work:**
   - â Explore codebase (that's Explorer's job)
   - â Design architecture (that's Architect's job)
   - â Run security scans (that's SecurityAuditor's job)

3. **Specialized work:**
   - â Write documentation (that's DocsWriter's job)
   - â Debug performance (that's PerfAnalyzer's job)

**Exception:** If a task is trivial (<5 min) meta-work (e.g., add entry to `tracks.md`), orchestrator MAY do it. But if it's real logic/code â delegate.

---

### ORCHESTRATION WORKFLOW

```
Phase 0: Requirements Intake
    â
Phase 1: PRD Generation
    â
Phase 2: Decomposition & Planning (create plan.md)
    â
[NEW] Phase 2.5: Orchestration Planning â AUTOMATIC
    â
Phase 3: Subagent Execution (CONTINUOUS DISPATCH) â v1.1
    â
Phase 4: Integration & IOSM Gates
    â
Phase 5: Deployment Prep
```

---

## CONTINUOUS DISPATCH LOOP (v1.1 â MANDATORY)

**ÐÐ»ÑÑÐµÐ²Ð¾Ðµ Ð¸Ð·Ð¼ÐµÐ½ÐµÐ½Ð¸Ðµ v1.1:** ÐÑÐºÐµÑÑÑÐ°ÑÐ¾Ñ ÑÐ°Ð±Ð¾ÑÐ°ÐµÑ Ð² ÑÐµÐ¶Ð¸Ð¼Ðµ **continuous scheduling** â ÐºÐ°Ðº ÑÐ¾Ð»ÑÐºÐ¾ Ð·Ð°Ð´Ð°ÑÐ° ÑÑÐ°Ð½Ð¾Ð²Ð¸ÑÑÑ READY, Ð¾Ð½Ð° Ð·Ð°Ð¿ÑÑÐºÐ°ÐµÑÑÑ Ð½ÐµÐ¼ÐµÐ´Ð»ÐµÐ½Ð½Ð¾, Ð±ÐµÐ· Ð¾Ð¶Ð¸Ð´Ð°Ð½Ð¸Ñ "ÐºÐ¾Ð½ÑÐ° Ð²Ð¾Ð»Ð½Ñ".

### ÐÐ»Ð°Ð²Ð½ÑÐ¹ Ð¿ÑÐ¸Ð½ÑÐ¸Ð¿

> **"Ð Ð°Ð±Ð¾ÑÐ°Ð¹ Ð² ÑÐµÐ¶Ð¸Ð¼Ðµ continuous scheduling: ÐºÐ°Ðº ÑÐ¾Ð»ÑÐºÐ¾ Ð¿Ð¾ÑÐ²Ð»ÑÐµÑÑÑ READY Ð·Ð°Ð´Ð°ÑÐ° Ð±ÐµÐ· ÐºÐ¾Ð½ÑÐ»Ð¸ÐºÑÐ¾Ð² touches Ð¸ Ð±ÐµÐ· needs_user_input â Ð½ÐµÐ¼ÐµÐ´Ð»ÐµÐ½Ð½Ð¾ Ð·Ð°Ð¿ÑÑÐºÐ°Ð¹ ÐµÑ Ð² background, Ð´Ð°Ð¶Ðµ ÐµÑÐ»Ð¸ Ð´ÑÑÐ³Ð¸Ðµ Ð·Ð°Ð´Ð°ÑÐ¸ ÐµÑÑ Ð²ÑÐ¿Ð¾Ð»Ð½ÑÑÑÑÑ. ÐÐ¾ÑÐ»Ðµ ÐºÐ°Ð¶Ð´Ð¾Ð³Ð¾ Ð±Ð°ÑÑÐ° ÑÐ¾Ð±Ð¸ÑÐ°Ð¹ SpawnCandidates Ð¸Ð· Ð¾ÑÑÑÑÐ¾Ð² Ð¸ Ð°Ð²ÑÐ¾Ð¼Ð°ÑÐ¸ÑÐµÑÐºÐ¸ Ð´Ð¾Ð±Ð°Ð²Ð»ÑÐ¹ Ð¸Ñ Ð² backlog. ÐÑÐ¾Ð´Ð¾Ð»Ð¶Ð°Ð¹ ÑÐ¸ÐºÐ», Ð¿Ð¾ÐºÐ° Ð½Ðµ Ð´Ð¾ÑÑÐ¸Ð³Ð½ÑÑÑ Ð·Ð°Ð´Ð°Ð½Ð½ÑÐµ IOSM Gate targets."**   

### Continuous Orchestration Loop

```
LOOP (Ð´Ð¾ Ð´Ð¾ÑÑÐ¸Ð¶ÐµÐ½Ð¸Ñ Gate targets):

  1. CollectReady()
     ââââ Ð¡Ð¾Ð±ÑÐ°ÑÑ Ð·Ð°Ð´Ð°ÑÐ¸, Ñ ÐºÐ¾ÑÐ¾ÑÑÑ deps Ð²ÑÐ¿Ð¾Ð»Ð½ÐµÐ½Ñ

  2. Classify()
     ââââ ÐÐ°Ð¶Ð´Ð¾Ð¹ Ð·Ð°Ð´Ð°ÑÐµ Ð¿ÑÐ¸ÑÐ²Ð¾Ð¸ÑÑ ÑÐµÐ¶Ð¸Ð¼:
        - background: safe, no user input needed
        - foreground: needs user decision
        - blocked_user: needs_user_input=true, Ð½Ðµ Ð¼Ð¾Ð¶ÐµÐ¼ Ð°Ð²ÑÐ¾-ÑÐµÑÐ¸ÑÑ
        - blocked_conflict: touches Ð¿ÐµÑÐµÑÐµÐºÐ°ÑÑÑÑ Ñ running

  3. ConflictCheck()
     ââââ Parallel launch Ð¢ÐÐÐ¬ÐÐ tasks Ð±ÐµÐ· Ð¿ÐµÑÐµÑÐµÑÐµÐ½Ð¸Ñ touches (Ð´Ð»Ñ write)
     ââââ Read-only tasks ÐÐ¡ÐÐÐÐ Ð¼Ð¾Ð¶Ð½Ð¾ Ð¿Ð°ÑÐ°Ð»Ð»ÐµÐ»Ð¸ÑÑ

  4. DispatchBatch()
     ââââ ÐÐ°Ð¿ÑÑÑÐ¸ÑÑ READY tasks ÐÐÐÐÐ Ð¡ÐÐÐÐ©ÐÐÐÐÐ (max 3-6 per batch)
     ââââ ÐÑÐ¸Ð¾ÑÐ¸ÑÐµÑ: critical_path > high_severity_spawn > read-only_fillers
     ââââ ÐÐ°Ð¶Ð´ÑÐ¹ batch Ð¿Ð¾Ð»ÑÑÐ°ÐµÑ batch_id Ð´Ð»Ñ ÑÑÐµÐºÐ¸Ð½Ð³Ð°
     ââââ ÐÐµ Ð¶Ð´Ð°ÑÑ "ÐºÐ¾Ð½ÑÐ° Ð²Ð¾Ð»Ð½Ñ" â dispatch immediately

  5. Monitor()
     ââââ ÐÐµÑÐ¸Ð¾Ð´Ð¸ÑÐµÑÐºÐ¸ ÑÐ¸ÑÐ°ÑÑ outputs background tasks
     ââââ Ð¡Ð¾Ð±Ð¸ÑÐ°ÑÑ SpawnCandidates Ð¸Ð· Ð¾ÑÑÑÑÐ¾Ð²

  6. AutoSpawn()
     ââââ ÐÑÐ»Ð¸ Ð½Ð°Ð¹Ð´ÐµÐ½Ñ SpawnCandidates â ÑÐ¾Ð·Ð´Ð°ÑÑ Ð½Ð¾Ð²ÑÐµ tasks
     ââââ ÐÐ¾Ð±Ð°Ð²Ð¸ÑÑ Ð² backlog Ð¸ Ð²ÐµÑÐ½ÑÑÑÑÑ Ðº ÑÐ°Ð³Ñ 1

  7. GateCheck()
     ââââ ÐÑÐ¾Ð²ÐµÑÐ¸ÑÑ ÑÑÐ»Ð¾Ð²Ð¸Ñ Gate-I/M/O/S
     ââââ ÐÑÐ»Ð¸ Ð´Ð¾ÑÑÐ¸Ð³Ð½ÑÑÑ â Ð¾ÑÑÐ°Ð½Ð¾Ð²Ð¸ÑÑÑÑ + gate-report
     ââââ ÐÑÐ»Ð¸ Ð½ÐµÑ â Ð°Ð²ÑÐ¾-spawn remediation tasks Ð¸ Ð¿ÑÐ¾Ð´Ð¾Ð»Ð¶Ð¸ÑÑ

END LOOP
```

### Task States (Ð²Ð½ÑÑÑÐµÐ½Ð½Ð¸Ð¹ ÑÑÐµÐºÐ¸Ð½Ð³)

| State | ÐÐ¿Ð¸ÑÐ°Ð½Ð¸Ðµ |
|-------|----------|
| `backlog` | ÐÑÐµ Ð¸Ð·Ð²ÐµÑÑÐ½ÑÐµ Ð·Ð°Ð´Ð°ÑÐ¸ |
| `ready` | Deps satisfied, Ð¼Ð¾Ð¶Ð½Ð¾ Ð·Ð°Ð¿ÑÑÐºÐ°ÑÑ |
| `running` | ÐÑÐ¿Ð¾Ð»Ð½ÑÐµÑÑÑ (background Ð¸Ð»Ð¸ foreground) |
| `blocked_user` | needs_user_input=true, Ð¶Ð´ÑÑ ÑÐµÑÐµÐ½Ð¸Ñ |
| `blocked_conflict` | touches Ð·Ð°Ð½ÑÑÑ Ð´ÑÑÐ³Ð¾Ð¹ running task |
| `done` | ÐÐ°Ð²ÐµÑÑÐµÐ½Ð° |

**ÐÑÐ°Ð²Ð¸Ð»Ð¾:** ÐÑÐ»Ð¸ Ð·Ð°Ð´Ð°ÑÐ° ÑÑÐ°Ð»Ð° READY Ð² Ð¼Ð¾Ð¼ÐµÐ½Ñ, ÐºÐ¾Ð³Ð´Ð° Ð´ÑÑÐ³Ð¸Ðµ Ð²ÑÐ¿Ð¾Ð»Ð½ÑÑÑÑÑ â **Ð·Ð°Ð¿ÑÑÐºÐ°ÑÑ ÑÑÐ°Ð·Ñ**, Ð½Ðµ Ð¶Ð´Ð°ÑÑ checkpoint.

### Touches Lock Manager

ÐÐ»Ñ Ð±ÐµÐ·Ð¾Ð¿Ð°ÑÐ½Ð¾Ð³Ð¾ Ð¿Ð°ÑÐ°Ð»Ð»ÐµÐ»Ð¸Ð·Ð¼Ð° Ð¾ÑÐºÐµÑÑÑÐ°ÑÐ¾Ñ Ð´Ð¾Ð»Ð¶ÐµÐ½ Ð¾ÑÑÐ»ÐµÐ¶Ð¸Ð²Ð°ÑÑ "Ð·Ð°Ð½ÑÑÑÐµ" ÑÐ°Ð¹Ð»Ñ:  

```
touches_lock: Set[path] = {}

ÐÑÐ¸ Ð·Ð°Ð¿ÑÑÐºÐµ task:
  1. ÐÑÐ¾Ð²ÐµÑÐ¸ÑÑ: task.touches â© touches_lock == â ?
  2. ÐÑÐ»Ð¸ Ð´Ð° â touches_lock.add(task.touches), Ð·Ð°Ð¿ÑÑÑÐ¸ÑÑ
  3. ÐÑÐ»Ð¸ Ð½ÐµÑ â blocked_conflict, Ð¶Ð´Ð°ÑÑ Ð¾ÑÐ²Ð¾Ð±Ð¾Ð¶Ð´ÐµÐ½Ð¸Ñ

ÐÑÐ¸ Ð·Ð°Ð²ÐµÑÑÐµÐ½Ð¸Ð¸ task:
  1. touches_lock.remove(task.touches)
  2. ÐÐµÑÐµÑÑÐ¸ÑÐ°ÑÑ ready_queue (ÐºÑÐ¾ ÑÐ°Ð·Ð±Ð»Ð¾ÐºÐ¸ÑÐ¾Ð²Ð°Ð»ÑÑ?)
```

**ÐÑÐ°Ð²Ð¸Ð»Ð° ÐºÐ¾Ð½ÑÐ»Ð¸ÐºÑÐ¾Ð²:**
- `read-only` Ð·Ð°Ð´Ð°ÑÐ¸ â **Ð²ÑÐµÐ³Ð´Ð° Ð¿Ð°ÑÐ°Ð»Ð»ÐµÐ»ÑÐ½Ð¾** (Ð½Ðµ Ð±ÐµÑÑÑ lock)
- `write-local` â Ð¿Ð°ÑÐ°Ð»Ð»ÐµÐ»ÑÐ½Ð¾ ÐµÑÐ»Ð¸ touches Ð½Ðµ Ð¿ÐµÑÐµÑÐµÐºÐ°ÑÑÑÑ
- `write-shared` â ÑÑÑÐ¾Ð³Ð¾ Ð¿Ð¾ÑÐ»ÐµÐ´Ð¾Ð²Ð°ÑÐµÐ»ÑÐ½Ð¾

### Lock Granularity (v1.1.1)

**ÐÐµÑÐ°ÑÑÐ¸Ñ ÐºÐ¾Ð½ÑÐ»Ð¸ÐºÑÐ¾Ð²:**

```
Lock Ð¿Ð¾ ÐÐÐÐÐ (core/) ÐºÐ¾Ð½ÑÐ»Ð¸ÐºÑÑÐµÑ:
  âââ Ñ Ð»ÑÐ±ÑÐ¼ lock Ð²Ð½ÑÑÑÐ¸ (core/a.py, core/b.py)
  âââ Ñ lock Ð½Ð° ÑÐ°Ð¼Ñ Ð¿Ð°Ð¿ÐºÑ (core/)

Lock Ð¿Ð¾ Ð¤ÐÐÐÐ£ (core/a.py) ÐºÐ¾Ð½ÑÐ»Ð¸ÐºÑÑÐµÑ:
  âââ ÑÐ¾Ð»ÑÐºÐ¾ Ñ ÑÐµÐ¼ Ð¶Ðµ ÑÐ°Ð¹Ð»Ð¾Ð¼
  âââ Ñ lock Ð½Ð° ÑÐ¾Ð´Ð¸ÑÐµÐ»ÑÑÐºÑÑ Ð¿Ð°Ð¿ÐºÑ (core/)
```

**ÐÐ¾ÑÐ¼Ð°Ð»Ð¸Ð·Ð°ÑÐ¸Ñ Ð¿ÑÑÐµÐ¹:**
- ÐÑÐµÐ³Ð´Ð° Ð¸ÑÐ¿Ð¾Ð»ÑÐ·Ð¾Ð²Ð°ÑÑ `/` (forward slash)
- Ð£Ð±Ð¸ÑÐ°ÑÑ trailing slash (`core/` â `core`)
- ÐÑÐ¸Ð²Ð¾Ð´Ð¸ÑÑ Ðº lowercase (Ð´Ð»Ñ Windows)
- ÐÑÐ¿Ð¾Ð»ÑÐ·Ð¾Ð²Ð°ÑÑ Ð¾ÑÐ½Ð¾ÑÐ¸ÑÐµÐ»ÑÐ½ÑÐµ Ð¿ÑÑÐ¸ Ð¾Ñ ÐºÐ¾ÑÐ½Ñ Ð¿ÑÐ¾ÐµÐºÑÐ°

**ÐÑÐ¸Ð¼ÐµÑ Ð¿ÑÐ¾Ð²ÐµÑÐºÐ¸ ÐºÐ¾Ð½ÑÐ»Ð¸ÐºÑÐ°:**
```python
def conflicts(lock_a: str, lock_b: str) -> bool:
    a, b = normalize(lock_a), normalize(lock_b)
    return a == b or a.startswith(b + '/') or b.startswith(a + '/')
```

### Read-Only Safety Rules

**ÐÑÐ¾Ð±Ð»ÐµÐ¼Ð°:** "read-only" Ð·Ð°Ð´Ð°ÑÐ¸ Ð¼Ð¾Ð³ÑÑ ÑÐ»ÑÑÐ°Ð¹Ð½Ð¾ Ð¿Ð¸ÑÐ°ÑÑ Ð² cache, lockfiles, __pycache__.

**Ð ÐµÑÐµÐ½Ð¸Ðµ:** read-only Ð·Ð°Ð´Ð°ÑÐ¸ ÐÐÐÐÐÐ«:
1. ÐÐ Ð·Ð°Ð¿ÑÑÐºÐ°ÑÑ ÐºÐ¾Ð¼Ð°Ð½Ð´Ñ, Ð¼ÐµÐ½ÑÑÑÐ¸Ðµ ÑÐ°Ð¹Ð»Ñ (`npm install`, `pip install`)
2. ÐÐ¸ÑÐ°ÑÑ Ð²ÑÐµÐ¼ÐµÐ½Ð½ÑÐµ Ð°ÑÑÐµÑÐ°ÐºÑÑ Ð¢ÐÐÐ¬ÐÐ Ð² `swarm/tracks/<id>/scratch/`
3. ÐÑÐ¿Ð¾Ð»ÑÐ·Ð¾Ð²Ð°ÑÑ ÑÐ»Ð°Ð³Ð¸ `--dry-run`, `--check` Ð³Ð´Ðµ Ð²Ð¾Ð·Ð¼Ð¾Ð¶Ð½Ð¾

**scratch_dir Ð¿ÑÐ°Ð²Ð¸Ð»Ð¾:**
```
swarm/tracks/<track-id>/scratch/   â read-only tasks Ð¿Ð¸ÑÑÑ ÑÑÐ´Ð°
  âââ T00_analysis.json
  âââ T03_coverage.xml
  âââ ...
```

ÐÑÐ° Ð¿Ð°Ð¿ÐºÐ° ÐÐ ÑÑÐµÐ±ÑÐµÑ lock Ð¸ ÐÐ ÐºÐ¾Ð½ÑÐ»Ð¸ÐºÑÑÐµÑ Ð½Ð¸ Ñ ÐºÐµÐ¼.

### Auto-Background Classification

ÐÑÐºÐµÑÑÑÐ°ÑÐ¾Ñ Ð°Ð²ÑÐ¾Ð¼Ð°ÑÐ¸ÑÐµÑÐºÐ¸ ÐºÐ»Ð°ÑÑÐ¸ÑÐ¸ÑÐ¸ÑÑÐµÑ Ð·Ð°Ð´Ð°ÑÐ¸:

**Auto-background** (safe, Ð·Ð°Ð¿ÑÑÐºÐ°ÑÑ Ð±ÐµÐ· Ð²Ð¾Ð¿ÑÐ¾ÑÐ¾Ð²):
- Concurrency class = `read-only`
- ÐÐ»Ð¸ `write-local` + `needs_user_input=false` + no policy conflicts
- effort >= M Ð¸ Ð½ÐµÑ choice points

**Auto-foreground** (Ð½ÑÐ¶ÐµÐ½ Ð¿Ð¾Ð»ÑÐ·Ð¾Ð²Ð°ÑÐµÐ»Ñ):
- ÐÐµÐ½ÑÐµÑÑÑ API ÐºÐ¾Ð½ÑÑÐ°ÐºÑ/ÑÐ¾ÑÐ¼Ð°Ñ Ð¾ÑÐ²ÐµÑÐ°
- ÐÑÐ¶Ð½Ð° "Ð¸ÑÑÐ¸Ð½Ð°" (Ð¸ÑÑÐ¾ÑÐ½Ð¸ÐºÐ¸, Ð±Ð¸Ð·Ð½ÐµÑ-Ð»Ð¾Ð³Ð¸ÐºÐ°, Ð°ÑÑÑÐ¾Ð»Ð¾Ð³Ð¸Ñ)
- ÐÐ°Ð´Ð°ÑÑ ÑÐµÑÑÑ Ð¸ Ð½ÑÐ¶Ð½Ð¾ ÑÐµÑÐ¸ÑÑ "ÑÐ¸ÐºÑÐ¸ÑÑ ÐºÐ¾Ð´ Ð¸Ð»Ð¸ ÑÐµÑÑ"
- High-risk Ð¸Ð·Ð¼ÐµÐ½ÐµÐ½Ð¸Ñ Ð±ÐµÐ· ÑÐµÑÑÐ¾Ð²
- needs_user_input=true

### SpawnCandidates Protocol

ÐÐ°Ð¶Ð´ÑÐ¹ ÑÑÐ±Ð°Ð³ÐµÐ½Ñ ÐÐÐ¯ÐÐÐ Ð¿Ð¸ÑÐ°ÑÑ Ð² Ð¾ÑÑÑÑÐµ ÑÐµÐºÑÐ¸Ñ `SpawnCandidates`:

```markdown
## SpawnCandidates

ÐÑÐ¸ ÑÐ°Ð±Ð¾ÑÐµ Ð¾Ð±Ð½Ð°ÑÑÐ¶ÐµÐ½Ñ Ð½Ð¾Ð²ÑÐµ work items:

| ID | Subtask | Touches | Effort | User Input | Severity | Dedup Key | Accept Criteria |
|----|---------|---------|--------|------------|----------|-----------|-----------------|
| SC-01 | Fix missing type annotation in auth.py | `backend/auth.py` | S | false | medium | auth.py|type-annot | mypy passes |
| SC-02 | Clarify API contract for /natal/aspects | `docs/api_spec.yaml` | M | true | high | api_spec|contract | Contract approved |
```

**Dedup Key ÑÐ¾ÑÐ¼Ð°Ñ:** `<primary_touch>|<intent_category>`
- ÐÑÐ¿Ð¾Ð»ÑÐ·ÑÐµÑÑÑ Ð´Ð»Ñ Ð´ÐµÐ´ÑÐ¿Ð»Ð¸ÐºÐ°ÑÐ¸Ð¸ Ð¾Ð´Ð¸Ð½Ð°ÐºÐ¾Ð²ÑÑ ÐºÐ°Ð½Ð´Ð¸Ð´Ð°ÑÐ¾Ð² Ð¾Ñ ÑÐ°Ð·Ð½ÑÑ Ð²Ð¾ÑÐºÐµÑÐ¾Ð²

**ÐÑÐºÐµÑÑÑÐ°ÑÐ¾Ñ Ð¾Ð±ÑÐ·Ð°Ð½:**
1. ÐÐ¾ÑÐ»Ðµ ÐºÐ°Ð¶Ð´Ð¾Ð³Ð¾ task completion â ÑÐ¸ÑÐ°ÑÑ SpawnCandidates
2. **ÐÐµÐ´ÑÐ¿Ð»Ð¸ÑÐ¸ÑÐ¾Ð²Ð°ÑÑ** Ð¿Ð¾ dedup_key (Ð¿ÐµÑÐ²ÑÐ¹ wins)
3. ÐÑÐ»Ð¸ `needs_user_input=false` Ð¸ `severity != critical` â auto-spawn
4. ÐÑÐ»Ð¸ `needs_user_input=true` â Ð´Ð¾Ð±Ð°Ð²Ð¸ÑÑ Ð² blocked_user queue
5. ÐÑÐ¾Ð³Ð½Ð°ÑÑ Ð½Ð¾Ð²ÑÐµ tasks ÑÐµÑÐµÐ· Ð¿Ð»Ð°Ð½Ð½ÐµÑ Ð¸ dispatch

### Spawn Protection (v1.1.1)

**ÐÐ°ÑÐ¸ÑÐ° Ð¾Ñ Ð±ÐµÑÐºÐ¾Ð½ÐµÑÐ½Ð¾Ð³Ð¾ ÑÐ°Ð·Ð¼Ð½Ð¾Ð¶ÐµÐ½Ð¸Ñ Ð·Ð°Ð´Ð°Ñ:**

#### (A) Spawn Budget

Ð `iosm_state.md` Ð¾ÑÑÐ»ÐµÐ¶Ð¸Ð²Ð°ÑÑ:

```markdown
## Spawn Budget
- spawn_budget_total: 20
- spawn_budget_used: 7
- spawn_budget_remaining: 13
- spawn_budget_per_gate:
  - Gate-I: 5 (used: 2)
  - Gate-O: 8 (used: 3)
  - Gate-M: 4 (used: 2)
  - Gate-S: 3 (used: 0)
```

**ÐÑÐ°Ð²Ð¸Ð»Ð°:**
- ÐÑÐ¸ Ð¸ÑÑÐµÑÐ¿Ð°Ð½Ð¸Ð¸ budget â STOP, ÑÐ¿ÑÐ¾ÑÐ¸ÑÑ Ð¿Ð¾Ð»ÑÐ·Ð¾Ð²Ð°ÑÐµÐ»Ñ
- `severity=critical` Ð¸Ð³Ð½Ð¾ÑÐ¸ÑÑÐµÑ budget (Ð²ÑÐµÐ³Ð´Ð° spawn)
- User Ð¼Ð¾Ð¶ÐµÑ ÑÐ²ÐµÐ»Ð¸ÑÐ¸ÑÑ budget ÐºÐ¾Ð¼Ð°Ð½Ð´Ð¾Ð¹

#### (B) Dedup Rules

```python
def dedup_key(candidate) -> str:
    return f"{candidate.touches[0]}|{candidate.intent_category}"

# ÐÑÐºÐµÑÑÑÐ°ÑÐ¾Ñ ÑÑÐ°Ð½Ð¸Ñ:
seen_dedup_keys: Set[str] = set()

# ÐÑÐ¸ Ð¾Ð±ÑÐ°Ð±Ð¾ÑÐºÐµ SpawnCandidate:
if candidate.dedup_key in seen_dedup_keys:
    skip  # Ð´ÑÐ±Ð»Ñ
else:
    seen_dedup_keys.add(candidate.dedup_key)
    process(candidate)
```

#### (C) Severity Threshold

| Severity | Auto-spawn ÑÑÐ»Ð¾Ð²Ð¸Ðµ |
|----------|-------------------|
| `critical` | ÐÐ¡ÐÐÐÐ (Ð´Ð°Ð¶Ðµ ÐµÑÐ»Ð¸ budget=0), STOP loop Ð¸ alert |
| `high` | ÐÑÐ»Ð¸ gate fail ÐÐÐ user Ð·Ð°Ð¿ÑÐ¾ÑÐ¸Ð» |
| `medium` | ÐÑÐ»Ð¸ gate fail Ð budget > 0 |
| `low` | Ð¢Ð¾Ð»ÑÐºÐ¾ Ð¿Ð¾ ÑÐ²Ð½Ð¾Ð¼Ñ Ð·Ð°Ð¿ÑÐ¾ÑÑ user |

#### (D) Anti-Loop Protection

```markdown
## Anti-Loop Metrics (in iosm_state.md)
- loops_without_progress: 0  # ÑÐ±ÑÐ°ÑÑÐ²Ð°ÐµÑÑÑ Ð¿ÑÐ¸ Ð»ÑÐ±Ð¾Ð¼ task completion
- max_loops_without_progress: 3
- total_loop_iterations: 15
- max_total_iterations: 50
```

**ÐÑÐ°Ð²Ð¸Ð»Ð¾:** ÐÑÐ»Ð¸ `loops_without_progress >= 3` â STOP, analyze why stuck

### Model Selection & Cost (v1.2)

**Model Selection Rules:**
- `haiku`: read-only tasks ($0.25/M tokens)
- `sonnet`: standard tasks, background automation ($3.00/M tokens)
- `opus`: security audits, critical architecture, user decisions ($15.00/M tokens)

**Cost Tracking:**
Orchestrator tracks cost in `iosm_state.md`:
- **Estimate:** Calculated from Effort field (S=5k, M=20k, L=50k, XL=100k tokens)
- **Actual:** Sum of tokens reported by subagent (if available) or estimate if not

**Budget Control:**
- Default limit: $10.00 per track
- **Warn @ 80%** ($8.00): Notify user
- **Stop @ 100%** ($10.00): Pause execution, ask user to increase budget or prune tasks

### Gate-Driven Continuation

ÐÑÐºÐµÑÑÑÐ°ÑÐ¾Ñ Ð¿ÑÐ¾Ð´Ð¾Ð»Ð¶Ð°ÐµÑ LOOP Ð¿Ð¾ÐºÐ° Ð½Ðµ Ð´Ð¾ÑÑÐ¸Ð³Ð½ÑÑÑ Gate targets:

**ÐÐ±Ð½Ð¾Ð²Ð»ÑÑÑ `iosm_state.md` Ð¿Ð¾ÑÐ»Ðµ ÐºÐ°Ð¶Ð´Ð¾Ð³Ð¾ Ð±Ð°ÑÑÐ°:**

```markdown
# IOSM State â [Track ID]

**Updated:** 2026-01-17 15:30
**Status:** IN_PROGRESS

## Gate Targets (from plan.md)
- Gate-I: â¥0.75 (current: 0.68) â
- Gate-M: pass (current: pass) â
- Gate-O: tests pass (current: 3 failing) â
- Gate-S: N/A

## Auto-Spawn Queue
Based on gate gaps, auto-spawning:
- T15: "Improve naming clarity in core/calculator.py" (Gate-I gap)
- T16: "Fix 3 failing integration tests" (Gate-O gap)

## Blocking Questions (needs user)
- Q1: Should we fix test_natal_aspects.py or update expected values?

## Next Actions
Waiting for T15, T16 to complete. Then re-evaluate gates.
```

**ÐÑÐ°Ð²Ð¸Ð»Ð° Ð¿ÑÐ¾Ð´Ð¾Ð»Ð¶ÐµÐ½Ð¸Ñ:**
- ÐÑÐ»Ð¸ Gate-I Ð½Ð¸Ð¶Ðµ Ð¿Ð¾ÑÐ¾Ð³Ð° â auto-spawn "Improve clarity / reduce duplication"
- ÐÑÐ»Ð¸ Gate-O Ð½Ðµ pass â auto-spawn "fix failing tests"
- ÐÑÐ»Ð¸ Gate-M Ð½Ðµ pass â auto-spawn "remove circular import / clarify boundaries"
- ÐÑÐ¾Ð´Ð¾Ð»Ð¶Ð°ÑÑ Ð¿Ð¾ÐºÐ° gates Ð½Ðµ Ð´Ð¾ÑÑÐ¸Ð³Ð½ÑÑÑ

### Stop Conditions

ÐÑÐºÐµÑÑÑÐ°ÑÐ¾Ñ ÐÐÐ¯ÐÐÐ Ð¾ÑÑÐ°Ð½Ð¾Ð²Ð¸ÑÑÑÑ Ð¸ ÑÐ¿ÑÐ¾ÑÐ¸ÑÑ Ð¿Ð¾Ð»ÑÐ·Ð¾Ð²Ð°ÑÐµÐ»Ñ ÐµÑÐ»Ð¸:

1. **ÐÑÐµ remaining tasks = needs_user_input=true** â Ð½ÐµÑÐµÐ³Ð¾ Ð´ÐµÐ»Ð°ÑÑ Ð°Ð²ÑÐ¾Ð½Ð¾Ð¼Ð½Ð¾
2. **ÐÑÐ¾ÑÐ¸Ð²Ð¾ÑÐµÑÐ¸Ðµ** â "fix code vs fix tests" Ð±ÐµÐ· Ð¿Ð¾Ð»Ð¸ÑÐ¸ÐºÐ¸
3. **High-risk** â Ð¸Ð·Ð¼ÐµÐ½ÐµÐ½Ð¸Ðµ Ð±Ð¸Ð·Ð½ÐµÑ-Ð»Ð¾Ð³Ð¸ÐºÐ¸ Ð±ÐµÐ· Ð¸ÑÑÐ¾ÑÐ½Ð¸ÐºÐ°/ÑÑÐ°Ð»Ð¾Ð½Ð°
4. **Scope creep** â auto-spawn Ð²ÑÑÐ¾Ð´Ð¸Ñ Ð·Ð° ÑÐ°Ð¼ÐºÐ¸ PRD
5. **Critical severity** â SpawnCandidate Ñ severity=critical

---

## RETRY WORKFLOW (v1.2)

When user invokes `/swarm-iosm retry <task-id>`:

**1. Load error diagnosis:**
```python
from parse_errors import parse_subagent_errors
report_path = Path(f"swarm/tracks/{track_id}/reports/{task_id}.md")
diagnoses = parse_subagent_errors(report_path, task_id)
```

**2. Show diagnosis to user:**
Present each error with:
- Error type (e.g., "Permission Denied")
- Affected file
- Root reason
- Suggested fixes (from error diagnosis)

**3. User chooses action:**
Use AskUserQuestion with options:
- "Apply suggested fix" (if automatic fix available)
- "Manual fix required" (user does it manually)
- "Skip and continue" (mark task as failed)

**4. Regenerate brief:**
Create new brief with:
- All original brief content
- New "Previous Attempt" section:
```markdown
## Previous Attempt (Failed)

This task was attempted before and failed with:

**Error:** Permission Denied
**File:** backend/migrations/001.sql
**Reason:** Database user lacks CREATE TABLE permission

**What was attempted:** Direct migration execution

**What to do differently:**
1. Grant permissions first, OR
2. Run as admin user, OR
3. Break into smaller steps
```
- New "Special Instructions" based on error type
- Error-specific context (files, commands, etc.)

**5. Relaunch:**
```python
Task(
    subagent_type="iosm-engineering-agent",
    prompt=updated_brief,
    run_in_background=(not "--foreground" in user_command)
)
```

**6. Update state:**
- In iosm_state.md, mark task as RETRY_IN_PROGRESS
- Track retry_count in task metadata
- If retry_count >= 3, mark as PERMANENTLY_FAILED

### Retry Limits

- **Max 3 retries** per task
- After 3rd failure: mark as `PERMANENTLY_FAILED`
- Requires manual intervention to proceed

### Error-Specific Retry Strategies

| Error Type | Auto-Fix | Mode | Notes |
|------------|----------|------|-------|
| Permission Denied | No | foreground | User must grant permissions |
| Import Error | Yes (pip install) | background | Try install first |
| Test Failed | No | foreground | User decision: fix code or tests |
| MCP Tool Unavailable | No | foreground | Background can't use MCP |
| File Not Found | Maybe | foreground | Check dependency task |
| Timeout | No | foreground | May need effort increase |

---

### Wave Checkpoints (Ð½Ðµ Ð±Ð°ÑÑÐµÑÑ)

Waves Ð¾ÑÑÐ°ÑÑÑÑ Ð´Ð»Ñ **Ð¾ÑÑÑÑÐ½Ð¾ÑÑÐ¸ Ð¸ checkpoints**, Ð½Ð¾ ÐÐ Ð´Ð»Ñ blocking:

```
Wave 1: [T01, T02] â checkpoint Ð´Ð»Ñ Gate-I review
Wave 2: [T03, T04, T05] â checkpoint Ð´Ð»Ñ Gate-M review
Wave 3: [T06, T07] â checkpoint Ð´Ð»Ñ Gate-O review
```

**ÐÐ¾:** ÐÑÐ»Ð¸ T03 Ð·Ð°Ð²ÐµÑÑÐ¸Ð»ÑÑ ÑÐ°Ð½ÑÑÐµ T02, Ð¸ T04 depends_on T03 â **Ð·Ð°Ð¿ÑÑÐºÐ°ÑÑ T04 ÑÑÐ°Ð·Ñ**, Ð½Ðµ Ð¶Ð´Ð°ÑÑ Wave 2 checkpoint.

---

### PHASE 2.5: ORCHESTRATION PLANNING (AUTOMATIC)

**Goal:** Transform `plan.md` into executable `orchestration_plan.md` with waves, modes, conflict resolution.

**When:** After `plan.md` is created, before launching subagents.

**Steps:**

1. **Validate plan.md has required fields:**
   ```bash
   python .claude/skills/swarm-iosm/scripts/orchestration_planner.py swarm/tracks/<id>/plan.md --validate
   ```

   Check all tasks have:
   - `Touches` (files/folders)
   - `Needs user input` (true/false)
   - `Effort` (S/M/L/XL or minutes)

   **If missing:** Tasks without these fields CANNOT be auto-scheduled. Ask user to add them OR infer from context.

2. **Generate orchestration plan:**
   ```bash
   python .claude/skills/swarm-iosm/scripts/orchestration_planner.py swarm/tracks/<id>/plan.md --generate
   ```

   This creates `swarm/tracks/<id>/orchestration_plan.md` with:
   - Dependency graph
   - Critical path (longest path through dependencies)
   - Execution waves (parallel grouping)
   - File conflict matrix
   - Background readiness checklist
   - Time estimates (serial vs parallel)

3. **Review with user:**
   Show orchestration plan summary:
   ```
   Generated orchestration plan:
   - 5 waves (14 tasks total)
   - Wave 1: 1 task (Explorer, background)
   - Wave 2: 3 tasks parallel (Architects, foreground)
   - Wave 3: 3 tasks parallel (Implementers, background)
   - Wave 4: 3 tasks (Tests, background)
   - Wave 5: 3 tasks (Integration, mixed)

   Estimated time: 27-42h parallel (vs 60-80h serial)
   Speedup: ~1.8x

   Ready to execute? (yes/no)
   ```

4. **Pre-resolve questions for background tasks:**
   For each task marked `needs_user_input: false` but you suspect may need decisions:
   - Use AskUserQuestion NOW (before launching)
   - Document answers in subagent brief

   **Example:**
   ```
   Wave 3 has 3 background implementers.
   Before launching background tasks, let me clarify:

   [AskUserQuestion with 2-3 questions about API design, error handling, testing strategy]

   These answers will be included in subagent briefs so they can work autonomously.
   ```

**Output:** `orchestration_plan.md` ready, all questions resolved, ready for Phase 3 execution.

---

### Phase 1: Requirements Intake (Universal)

When user invokes `/swarm-iosm new-track` or triggers this Skill:

1. **Determine mode** using AskUserQuestion:
   - Greenfield (new feature from scratch)
   - Brownfield (modify existing codebase)

2. **If Brownfield:** Suggest Plan mode first:
   ```
   "I recommend starting in Plan mode (read-only exploration) to safely analyze the codebase before making changes. Shall I proceed with Plan mode first?"
   ```
   - If yes: Use Task tool with Explore agent to map codebase
   - If no: Proceed with caution warnings

3. **Gather requirements** using AskUserQuestion for:
   - **Priority**: Speed / Quality / Cost
   - **Change strictness**: Safe (minimal changes) / Normal / Aggressive refactor
   - **Test strategy**: TDD (tests first) / Post-tests / Smoke only
   - **Permissions**: What tools/operations are allowed

4. **Ask text questions** for:
   - Goal: "What defines 'done' for this task? (1-2 sentences)"
   - Context: "Product/users/environment context?"
   - Constraints: "Tech stack, versions, deadlines, restrictions?"
   - Interfaces: "API/UI/CLI changes needed?"
   - Data: "Data sources, migrations, PII concerns?"
   - Risks: "What could go wrong?"
   - Definition of Done: "Tests? Docs? Deployment?"

5. **Save intake** to `swarm/tracks/<track-id>/intake.md`

### Phase 2: PRD Generation

Using intake data, generate `swarm/tracks/<track-id>/PRD.md` following template:

```markdown
# PRD: <Feature Name>
## 1. Problem
## 2. Goals / Non-goals
## 3. Users & Use-cases
## 4. Scope (MVP / Later)
## 5. Requirements
### Functional
### Non-functional
## 6. UX / API / Data
## 7. Risks & Mitigations
## 8. Acceptance Criteria
## 9. Rollout / Migration plan
## 10. IOSM Targets (Gates + expected index delta)
```

See [templates/prd.md](templates/prd.md) for detailed template.

### Phase 3: Decomposition & Planning

From PRD, create `spec.md` and `plan.md`:

**spec.md** (Conductor-style):
- Context
- What / Why
- Constraints
- Out of scope
- Acceptance tests
- Artifacts to produce
- Rollback assumptions

**plan.md** (WBS with dependencies):
- Phases (0: Intake, 1: Design, 2: Implementation, 3: Verification, 4: Integration)
- Tasks with:
  - owner_role (Explorer/Architect/Implementer/TestRunner/etc)
  - depends_on (task IDs)
  - files_modules (scope)
  - acceptance criteria
  - artifacts (reports/T01.md, etc)
  - iosm_checks (which gates apply)
  - status (TODO/DOING/DONE/BLOCKED)

See [templates/plan.md](templates/plan.md) for structure.

### Phase 3: Subagent Execution

**Goal:** Execute `orchestration_plan.md` using parallel waves of subagents.

**CRITICAL:** Launch subagents in PARALLEL WAVES, not one-by-one.

---

#### Standardized Subagent Roles

Use these predefined roles:

1. **Explorer** (brownfield analysis)
   - Tools: Read, Grep, Glob
   - Output: Architecture map, dependencies, test coverage, code style
   - When: Always for brownfield, before making changes

2. **Architect** (design decisions)
   - Tools: Read, Write (ADRs)
   - Output: ADR documents, interface contracts, API specs
   - When: Complex features, API changes, architectural decisions

3. **Implementer-{A,B,C}** (parallel implementation)
   - Tools: Read, Write, Edit, Bash (tests)
   - Output: Code changes, unit tests, implementation report
   - When: Independent modules that can be developed in parallel

4. **TestRunner** (verification)
   - Tools: Read, Bash, Write
   - Output: Test results, coverage report, failure analysis
   - When: After implementation, before integration

5. **SecurityAuditor** (security review)
   - Tools: Read, Grep, Bash (security scanners)
   - Output: Security findings, remediation suggestions
   - When: Auth/payment features, external APIs, data handling

6. **PerfAnalyzer** (performance review)
   - Tools: Read, Bash (profiling)
   - Output: Performance metrics, bottleneck analysis
   - When: Data processing, APIs, high-traffic features

7. **DocsWriter** (documentation)
   - Tools: Read, Write, Edit
   - Output: README updates, API docs, user guides
   - When: Public APIs, complex features, user-facing changes

**Parallelization Rules:**

â **Parallel (can run simultaneously):**
- Different modules/files with no shared state
- Independent research tasks (Explorer on different subsystems)
- Docs + Implementation (if API is stable)
- Multiple Implementers on separate components

â **Sequential (must run in order):**
- Tasks with dependencies (Architect â Implementer)
- Shared file modifications (two agents editing same file)
- Test â Fix â Re-test cycles

**Background vs Foreground:**

Use **background** (`run_in_background: true` in Task tool) when:
- Long-running operations (tests, builds, analysis)
- No user input needed (all questions resolved upfront)
- Permissions pre-approved
- Can tolerate "fire and forget" mode

Use **foreground** (default) when:
- Need user clarifications during execution
- Interactive debugging/problem-solving
- Permission escalations expected
- Results needed immediately for next step

**IMPORTANT:** Background subagents cannot use AskUserQuestion (tool call will fail). Resolve all questions BEFORE launching background tasks.  

### Background Limitations (CRITICAL)

**Background subagents CANNOT reliably use:**

| Tool/Feature | Status | Reason |
|--------------|--------|--------|
| `AskUserQuestion` | BLOCKED | Auto-denied, no user interaction |
| Permission prompts | BLOCKED | Auto-denied, may fail silently |
| MCP tools | UNSTABLE | May be unavailable in background context |
| External APIs | RISKY | Network errors not recoverable |
| Long git operations | RISKY | May timeout or conflict |

**Rule of thumb:**
- **Background** = autonomous code/tests/read/local-only operations
- **Foreground** = MCP, external integrations, user decisions, risky operations

**Pre-flight checklist for background tasks:**
1. All questions pre-resolved in brief
2. No MCP tools required
3. No external API calls (or wrap with fallback)
4. No interactive permissions needed
5. Touches clearly defined (no surprises)

**If task needs MCP or external calls â force foreground:**
```markdown
- **Needs user input:** true  â even if technically "safe"
- **Note:** Requires MCP/external API, must run foreground
```

---

#### Step 1: Load Orchestration Plan

Read `swarm/tracks/<id>/orchestration_plan.md` to understand:
- How many waves
- Which tasks in each wave
- Which tasks are parallel vs sequential
- Which tasks are background vs foreground

---

#### Step 2: Execute Waves (ONE WAVE AT A TIME)

For each wave in the orchestration plan:

##### A. Prepare Subagent Briefs

For each task in the wave:
1. Generate brief using [templates/subagent_brief.md](templates/subagent_brief.md)
2. Fill in all sections:
   - Goal, Scope, Context
   - Dependencies (what previous tasks delivered)
   - Constraints (technical, performance, security)
   - Output contract (code + tests + report)
   - Verification steps
   - Acceptance criteria
   - **Pre-resolved questions** (for background tasks)
   - IOSM checks to pass

3. Include report template requirement:
   ```
   You MUST save report to: swarm/tracks/<id>/reports/<task-id>.md
   Use template: .claude/skills/swarm-iosm/templates/subagent_report.md
   ```

##### B. Launch Wave (CRITICAL: PARALLEL IN SINGLE MESSAGE)

**For parallel tasks in wave:**

Launch ALL tasks in wave SIMULTANEOUSLY using **single message with multiple Task tool calls**.

**Example (Wave 3: 3 implementers):**

```
I'm launching Wave 3 with 3 parallel implementers (all background):

[Single message with 3 Task tool calls]

Task 1 (T04 - Implementer-A):
- subagent_type: general-purpose
- description: Implement core business logic
- prompt: [Full brief for T04]
- run_in_background: true

Task 2 (T05 - Implementer-B):
- subagent_type: general-purpose
- description: Implement API endpoints
- prompt: [Full brief for T05]
- run_in_background: true

Task 3 (T06 - Implementer-C):
- subagent_type: general-purpose
- description: Implement data access layer
- prompt: [Full brief for T06]
- run_in_background: true

Monitoring: Use /bashes to track progress
Expected completion: 8-12 hours
```

**NEVER launch tasks one-by-one if they can run parallel. ALWAYS use single message.**

##### C. Monitor Progress

While wave is running:

1. **Check background tasks periodically:**
   ```
   /bashes
   ```

2. **Check task output files (if provided):**
   ```bash
   tail -n 50 /path/to/task/output/file
   ```

3. **If task completes:**
   - Verify report exists: `swarm/tracks/<id>/reports/T##.md`
   - Check acceptance criteria met
   - Mark status in `plan.md`: `Status: DONE`

4. **If task blocks/fails:**
   - Apply fallback strategy (see below)

##### D. Fallback Strategy (if subagent fails)

**Scenario 1: Transient error (timeout, network)**
- **Action:** Retry once automatically
- **Command:** Re-launch same brief

**Scenario 2: Permission/question blocker**
- **Action:** Resume in foreground
- **How:** Use TaskOutput to get task_id, then Task tool with resume parameter
- **Example:**
  ```
  Task blocked on permission for "run database migrations"
  â Resume in foreground, approve permission, continue
  ```

**Scenario 3: Logic gap (unclear contract/spec)**
- **Action:** Create recovery task
- **Steps:**
  1. Create new task for Architect: "Clarify [missing requirement]"
  2. Run Architect task (foreground)
  3. Update brief for blocked task
  4. Re-launch subagent

**Scenario 4: Unrecoverable failure**
- **Action:** Mark BLOCKED and continue
- **Steps:**
  1. Update `plan.md`: `Status: BLOCKED(reason: ...)`
  2. Save partial work in `reports/T##-partial.md`
  3. Add to integration report: "T## blocked, manual resolution needed"
  4. Continue with other waves (don't block entire workflow)

---

#### Step 3: Wave Completion Check

Before proceeding to next wave:

- [ ] All tasks in wave completed OR marked BLOCKED
- [ ] All reports saved to `reports/`
- [ ] No merge conflicts detected (if parallel edits)
- [ ] All acceptance criteria met (or exceptions documented)

**If wave has blockers:**
- Document in `orchestration_plan.md` (update Progress section)
- Decide: resolve now OR defer to integration phase

---

#### Step 4: Proceed to Next Wave

Repeat Step 2 for next wave.

**Important:**
- Respect dependencies: Wave N can only start when all Wave N-1 tasks are DONE or BLOCKED
- Update `orchestration_plan.md` with actual completion times (for future estimation)

---

#### Step 5: All Waves Complete

When all waves finished:
- Update `plan.md`: `Status: Integration`
- Proceed to Phase 4 (Integration & IOSM Gates)

---

#### PARALLEL LAUNCH EXAMPLES

**Example 1: Wave 2 (3 foreground tasks)**

```
Launching Wave 2 (Design phase) with 3 tasks:

[Single message with 3 Task calls, all foreground]

These tasks will run interactively (you'll see their prompts).
Expected: ~4-6 hours for slowest task (T01)
```

**Example 2: Wave 3 (3 background tasks)**

```
Launching Wave 3 (Implementation) with 3 background tasks:

[Single message with 3 Task calls, all run_in_background: true]

Monitor with: /bashes
Check outputs in: swarm/tracks/2026-01-17-001/reports/
```

**Example 3: Mixed wave (2 parallel + 1 sequential)**

```
Wave 4a: Launching 2 parallel tasks (T08, T10):

[Single message with 2 Task calls, background]

When T08 completes, I'll launch Wave 4b (T09 depends on T08).
```

### Phase 4: Integration & IOSM Gates

After subagents complete:

1. **Read all reports** from `swarm/tracks/<id>/reports/`
2. **Validate** each report has required sections (see templates/subagent_report.md)
3. **Identify conflicts:**
   - File modification overlaps
   - Contradictory decisions
   - Dependency mismatches
4. **Generate integration_report.md** with:
   - What changed (by task)
   - Conflict resolutions
   - Merge order (respecting dependencies)
   - Final verification checklist
   - Rollback guide

See [templates/integration_report.md](templates/integration_report.md).

#### IOSM Quality Gates Evaluation

After integration_report.md is complete, run IOSM gates on integrated result:

**Gate-I (Improve):**
- Semantic clarity â¥0.95 (clear naming, no magic numbers)
- Code duplication â¤5%
- Invariants documented
- All TODOs tracked

**Gate-O (Optimize):**
- P50/P95/P99 latency measured
- Error budget defined
- Basic chaos/resilience tests passing
- No obvious N+1 queries or memory leaks

**Gate-S (Shrink):**
- API surface reduced â¥20% (or justified growth)
- Dependency count stable or reduced
- Onboarding time â¤15min for new contributor

**Gate-M (Modularize):**
- Clear module contracts
- Change surface â¤20% (localized impact)
- Coupling/cohesion metrics acceptable
- No circular dependencies

**Calculate IOSM-Index:**
```
IOSM-Index = (Gate-I + Gate-O + Gate-S + Gate-M) / 4
```

Target: â¥0.80 for production merge.

Generate `swarm/tracks/<id>/iosm_report.md` with gate results.

See [templates/iosm_gates.md](templates/iosm_gates.md) for detailed criteria.

## File Structure

The Skill creates this structure:

```
.claude/skills/swarm-iosm/     # Skill definition
  SKILL.md                      # This file
  templates/                    # Progressive disclosure templates
  scripts/                      # Validation/analysis scripts

swarm/                          # Project workflow data
  context/                      # Project-wide context
    product.md
    tech-stack.md
    workflow.md
  tracks/                       # Feature/task tracks
    <YYYY-MM-DD-NNN>/          # Track directory
      intake.md                 # Requirements intake
      PRD.md                    # Product requirements
      spec.md                   # Technical spec
      plan.md                   # Implementation plan
      metadata.json             # Track metadata
      reports/                  # Subagent reports
        T01.md
        T02.md
        ...
      integration_report.md     # Integration plan
      iosm_report.md           # Quality gate results
      rollback_guide.md        # Revert instructions (if needed)
  tracks.md                     # Track registry/index
```

## Best Practices

1. **Always resolve questions upfront** - Background subagents can't ask questions
2. **Use Plan mode for brownfield** - Safe exploration before changes
3. **Parallelize research, sequence implementation** - Avoid file conflicts
4. **Demand structured reports** - Traceability and integration depend on it
5. **Run IOSM gates before merge** - Quality enforcement
6. **Create rollback plans** - Safety net for production changes
7. **Use TodoWrite** - Track overall Swarm workflow progress
8. **Monitor background tasks** - Use `/bashes` command

## Common Patterns

### Pattern 1: Greenfield Feature
```
/swarm-iosm new-track "Add email notification system"
â Intake (quick, no repo analysis)
â PRD + Plan generation
â Parallel: Architect (API design) + DocsWriter (email templates)
â Sequential: Implementer (core) â TestRunner â Integration
```

### Pattern 2: Brownfield Refactor
```
/swarm-iosm setup
/swarm-iosm new-track "Refactor payment processing"
â Plan mode: Explorer analyzes payment module
â Architect creates migration plan
â Parallel: Implementer-A (new code) + TestRunner (regression tests)
â Integration with rollback guide
```

### Pattern 3: Large Feature with Many Tasks
```
/swarm-iosm new-track "Multi-tenant architecture"
â Generate plan with 15+ tasks
â Phase 1: Sequential design (Architect â review)
â Phase 2: Parallel implementation (3x Implementer background)
â Phase 3: Sequential integration (merge â test â gates)
```

## Troubleshooting

**Background subagent fails with permission error:**
- Resume in foreground: Find task in `/bashes`, get task ID, resume
- Pre-approve permissions: Use AskUserQuestion before launching

**Reports missing or incomplete:**
- Subagent brief must explicitly require report template
- Validate reports using `scripts/summarize_reports.py`

**File conflicts during integration:**
- Plan should minimize shared file edits
- Use git branches per subagent (advanced)
- Integration report must resolve conflicts manually

**IOSM gates failing:**
- Review gate criteria in templates/iosm_gates.md
- Some gates may be aspirational (document exceptions)
- Iterate: fail â fix â re-check

## Advanced Usage

See additional documentation:
- [templates/](templates/) - All templates with detailed examples
- [scripts/](scripts/) - Helper scripts for validation and analysis

## Dependencies

- Claude Code with Task tool support
- Git (for version control and rollback)
- Project-specific: Python/Node/etc for running tests

## Version

Swarm Workflow (IOSM) v2.1 - 2026-01-19

**v2.1 Changes:**
- Automated State Management (auto-generated `iosm_state.md`)
- Status Sync CLI (`--update-task`)
- Improved Report Conflict Detection

**v2.0 Changes:**
- Inter-Agent Communication (Shared Context)
- Task Dependency Visualization (`--graph`)
- Anti-Pattern Detection
- Template Customization

**v1.3 Changes:**
- Simulation Mode (`/swarm-iosm simulate`) with ASCII Timeline
- Live Monitoring (`/swarm-iosm watch`)
- Checkpointing & Resume (`/swarm-iosm resume`)

**v1.2 Changes:**
- Concurrency Limits (Resource Budgets)
- Cost Tracking & Model Selection (Haiku/Sonnet/Opus)
- Intelligent Error Diagnosis & Retry (`/swarm-iosm retry`)

**v1.1 Changes:**
- Continuous Dispatch Loop (Ð½Ðµ Ð¶Ð´ÑÐ¼ Ð²Ð¾Ð»Ð½Ñ â Ð·Ð°Ð¿ÑÑÐºÐ°ÐµÐ¼ ÑÑÐ°Ð·Ñ Ð¿ÑÐ¸ READY)
- Gate-driven continuation (ÑÐ°Ð±Ð¾ÑÐ°ÐµÐ¼ Ð´Ð¾ Ð´Ð¾ÑÑÐ¸Ð¶ÐµÐ½Ð¸Ñ Gate targets)
- Auto-spawn Ð¸Ð· SpawnCandidates Ð² Ð¾ÑÑÑÑÐ°Ñ
- Touches lock manager (ÐºÐ¾Ð½ÑÐ»Ð¸ÐºÑÑ ÑÐ°Ð¹Ð»Ð¾Ð²)
- iosm_state.md Ð´Ð»Ñ ÑÑÐµÐºÐ¸Ð½Ð³Ð° Ð¿ÑÐ¾Ð³ÑÐµÑÑÐ° Ðº Ð³ÐµÐ¹ÑÐ°Ð¼

**v1.1.1 Changes:**
- Lock Granularity (folder vs file hierarchy, path normalization)
- Read-Only Safety Rules (scratch_dir Ð´Ð»Ñ Ð°ÑÑÐµÑÐ°ÐºÑÐ¾Ð²)
- Spawn Protection (budget, dedup keys, severity threshold)
- Anti-Loop Protection (max iterations, progress tracking)
- Batch Constraints (max 3-6 per batch, priority ordering, batch_id)
- Touched Actual tracking (plan vs actual diff, unplanned touches alert)
- Operational Runbook Ð² QUICKSTART.md
Similar Skills

workflow-orchestration✓Clean
Orchestrates agent workflow for non-trivial tasks: plan-first mode, subagent use, self-improvement loops, verification before done, and autonomous bug fixing. Use for any task with 3+ steps, architectural decisions, bug reports, or when the user corrects the agent. Ensures plans go in tasks/todo.md, lessons in tasks/lessons.md, and changes stay minimal and provably correct.
project-management devops architecture
⭐ 0↓ 0dnh33/workflow-orchestration
npx skills add dnh33/workflow-orchestration
agent-implementation-skill✓Clean
Multi-model agent implementation workflow for software development. Orchestrates research, evaluation, design baseline, implementation, RCA, structured decomposition, constraint discovery, model selection, and agent-driven Stage 3 codemap exploration across external AI models (GPT, GLM, Claude). Use when implementing features through a structured multi-phase pipeline with worktrees, dynamic scheduling, and SQLite-backed agent coordination.
architecture code-review project-management
⭐ 1↓ 0nestharus/agent-implementation-skill
npx skills add nestharus/agent-implementation-skill
autonomous-orchestrator✓Clean
Autonomous meta-orchestrator that continuously discovers work, dispatches agents, reviews results, and manages the full lifecycle across the user's workspace. Use when the user wants hands-off autonomous operation. Triggers on: 'autonomous', 'auto-pilot', 'run continuously', 'take over', 'autopilot'.
project-management devops architecture
⭐ 0↓ 0metyatech/skill-autonomous-orchestrator
npx skills add metyatech/skill-autonomous-orchestrator
Forge✓Clean
Autonomous quality engineering swarm that forges production-ready code through continuous behavioral verification, exhaustive E2E testing, and self-healing fix loops. Combines DDD+ADR+TDD methodology with BDD/Gherkin specifications, 7 quality gates, defect prediction, chaos testing, and cross-context dependency awareness. Architecture-agnostic - works with monoliths, microservices, modular monoliths, and any bounded-context topology.
testing devops architecture
⭐ 7↓ 0ikennaokpala/forge
npx skills add ikennaokpala/forge