Add orchestration loop phase and workflow execution skill

- Implemented Phase 2: Orchestration Loop for CCW Loop, including executor agent spawning, main loop execution, and iteration management. - Introduced helper functions for action result parsing and user interaction. - Created new skill: workflow-execute, coordinating agent execution for workflow tasks with automatic session discovery, parallel task processing, and status tracking. - Defined execution flow, key design principles, and error handling strategies for the workflow execution process.
2026-02-12 02:37:45 +08:00 · 2026-02-06 23:58:18 +08:00
parent 5b48bcff64
commit a4ea817a5f
19 changed files with 1633 additions and 2876 deletions
--- a/.codex/skills/ccw-loop/actions/action-complete.md
+++ b/.codex/skills/ccw-loop/actions/action-complete.md
@@ -0,0 +1,269 @@
+# Action: COMPLETE
+
+Complete CCW Loop session and generate summary report.
+
+## Purpose
+
+- Generate completion report
+- Aggregate all phase results
+- Provide follow-up recommendations
+- Offer expansion to issues
+- Mark status as completed
+
+## Preconditions
+
+- [ ] state.status === 'running'
+- [ ] state.skill_state !== null
+
+## Execution Steps
+
+### Step 1: Verify Control Signals
+
+```javascript
+const state = JSON.parse(Read(`.workflow/.loop/${loopId}.json`))
+
+if (state.status !== 'running') {
+  return {
+    action: 'COMPLETE',
+    status: 'failed',
+    message: `Cannot complete: status is ${state.status}`,
+    next_action: state.status === 'paused' ? 'PAUSED' : 'STOPPED'
+  }
+}
+```
+
+### Step 2: Aggregate Statistics
+
+```javascript
+const stats = {
+  // Time statistics
+  duration: Date.now() - new Date(state.created_at).getTime(),
+  iterations: state.current_iteration,
+
+  // Development statistics
+  develop: {
+    total_tasks: state.skill_state.develop.total,
+    completed_tasks: state.skill_state.develop.completed,
+    completion_rate: state.skill_state.develop.total > 0
+      ? ((state.skill_state.develop.completed / state.skill_state.develop.total) * 100).toFixed(1)
+      : 0
+  },
+
+  // Debug statistics
+  debug: {
+    iterations: state.skill_state.debug.iteration,
+    hypotheses_tested: state.skill_state.debug.hypotheses.length,
+    root_cause_found: state.skill_state.debug.confirmed_hypothesis !== null
+  },
+
+  // Validation statistics
+  validate: {
+    runs: state.skill_state.validate.test_results.length,
+    passed: state.skill_state.validate.passed,
+    coverage: state.skill_state.validate.coverage,
+    failed_tests: state.skill_state.validate.failed_tests.length
+  }
+}
+```
+
+### Step 3: Generate Summary Report
+
+```javascript
+const timestamp = getUtc8ISOString()
+
+const summaryReport = `# CCW Loop Session Summary
+
+**Loop ID**: ${loopId}
+**Task**: ${state.description}
+**Started**: ${state.created_at}
+**Completed**: ${timestamp}
+**Duration**: ${formatDuration(stats.duration)}
+
+---
+
+## Executive Summary
+
+${state.skill_state.validate.passed
+  ? 'All tests passed, validation successful'
+  : state.skill_state.develop.completed === state.skill_state.develop.total
+    ? 'Development complete, validation not passed - needs debugging'
+    : 'Task partially complete - pending items remain'}
+
+---
+
+## Development Phase
+
+| Metric | Value |
+|--------|-------|
+| Total Tasks | ${stats.develop.total_tasks} |
+| Completed | ${stats.develop.completed_tasks} |
+| Completion Rate | ${stats.develop.completion_rate}% |
+
+### Completed Tasks
+
+${state.skill_state.develop.tasks.filter(t => t.status === 'completed').map(t => `
+- ${t.description}
+  - Files: ${t.files_changed?.join(', ') || 'N/A'}
+  - Completed: ${t.completed_at}
+`).join('\n')}
+
+### Pending Tasks
+
+${state.skill_state.develop.tasks.filter(t => t.status !== 'completed').map(t => `
+- ${t.description}
+`).join('\n') || '_None_'}
+
+---
+
+## Debug Phase
+
+| Metric | Value |
+|--------|-------|
+| Iterations | ${stats.debug.iterations} |
+| Hypotheses Tested | ${stats.debug.hypotheses_tested} |
+| Root Cause Found | ${stats.debug.root_cause_found ? 'Yes' : 'No'} |
+
+${stats.debug.root_cause_found ? `
+### Confirmed Root Cause
+
+${state.skill_state.debug.confirmed_hypothesis}: ${state.skill_state.debug.hypotheses.find(h => h.id === state.skill_state.debug.confirmed_hypothesis)?.description}
+` : ''}
+
+---
+
+## Validation Phase
+
+| Metric | Value |
+|--------|-------|
+| Test Runs | ${stats.validate.runs} |
+| Status | ${stats.validate.passed ? 'PASSED' : 'FAILED'} |
+| Coverage | ${stats.validate.coverage || 'N/A'}% |
+| Failed Tests | ${stats.validate.failed_tests} |
+
+---
+
+## Recommendations
+
+${generateRecommendations(stats, state)}
+
+---
+
+## Session Artifacts
+
+| File | Description |
+|------|-------------|
+| \`develop.md\` | Development progress timeline |
+| \`debug.md\` | Debug exploration and learnings |
+| \`validate.md\` | Validation report |
+| \`test-results.json\` | Test execution results |
+
+---
+
+*Generated by CCW Loop at ${timestamp}*
+`
+
+Write(`${progressDir}/summary.md`, summaryReport)
+```
+
+### Step 4: Update State to Completed
+
+```javascript
+state.status = 'completed'
+state.completed_at = timestamp
+state.updated_at = timestamp
+state.skill_state.last_action = 'COMPLETE'
+state.skill_state.summary = stats
+
+Write(`.workflow/.loop/${loopId}.json`, JSON.stringify(state, null, 2))
+```
+
+## Output Format
+
+```
+ACTION_RESULT:
+- action: COMPLETE
+- status: success
+- message: Loop completed. Duration: {duration}, Iterations: {N}
+- state_updates: {
+    "status": "completed",
+    "completed_at": "{timestamp}"
+  }
+
+FILES_UPDATED:
+- .workflow/.loop/{loopId}.json: Status set to completed
+- .workflow/.loop/{loopId}.progress/summary.md: Summary report generated
+
+NEXT_ACTION_NEEDED: COMPLETED
+```
+
+## Expansion Options
+
+After completion, offer expansion to issues:
+
+```
+## Expansion Options
+
+Would you like to create follow-up issues?
+
+1. [test] Add more test cases
+2. [enhance] Feature enhancements
+3. [refactor] Code refactoring
+4. [doc] Documentation updates
+5. [none] No expansion needed
+
+Select options (comma-separated) or 'none':
+```
+
+## Helper Functions
+
+```javascript
+function formatDuration(ms) {
+  const seconds = Math.floor(ms / 1000)
+  const minutes = Math.floor(seconds / 60)
+  const hours = Math.floor(minutes / 60)
+
+  if (hours > 0) {
+    return `${hours}h ${minutes % 60}m`
+  } else if (minutes > 0) {
+    return `${minutes}m ${seconds % 60}s`
+  } else {
+    return `${seconds}s`
+  }
+}
+
+function generateRecommendations(stats, state) {
+  const recommendations = []
+
+  if (stats.develop.completion_rate < 100) {
+    recommendations.push('- Complete remaining development tasks')
+  }
+
+  if (!stats.validate.passed) {
+    recommendations.push('- Fix failing tests')
+  }
+
+  if (stats.validate.coverage && stats.validate.coverage < 80) {
+    recommendations.push(`- Improve test coverage (current: ${stats.validate.coverage}%)`)
+  }
+
+  if (recommendations.length === 0) {
+    recommendations.push('- Consider code review')
+    recommendations.push('- Update documentation')
+    recommendations.push('- Prepare for deployment')
+  }
+
+  return recommendations.join('\n')
+}
+```
+
+## Error Handling
+
+| Error Type | Recovery |
+|------------|----------|
+| Report generation failed | Show basic stats, skip file write |
+| Issue creation failed | Log error, continue completion |
+
+## Next Actions
+
+- None (terminal state)
+- To continue: Use `/ccw-loop --loop-id={loopId}` to reopen (will set status back to running)
--- a/.codex/skills/ccw-loop/actions/action-debug.md
+++ b/.codex/skills/ccw-loop/actions/action-debug.md
@@ -0,0 +1,286 @@
+# Action: DEBUG
+
+Hypothesis-driven debugging with understanding evolution documentation.
+
+## Purpose
+
+- Locate error source
+- Generate testable hypotheses
+- Add NDJSON instrumentation
+- Analyze log evidence
+- Correct understanding based on evidence
+- Apply fixes
+
+## Preconditions
+
+- [ ] state.status === 'running'
+- [ ] state.skill_state !== null
+
+## Mode Detection
+
+```javascript
+const understandingPath = `${progressDir}/debug.md`
+const debugLogPath = `${progressDir}/debug.log`
+
+const understandingExists = fs.existsSync(understandingPath)
+const logHasContent = fs.existsSync(debugLogPath) && fs.statSync(debugLogPath).size > 0
+
+const debugMode = logHasContent ? 'analyze' : (understandingExists ? 'continue' : 'explore')
+```
+
+## Execution Steps
+
+### Mode: Explore (First Debug)
+
+#### Step E1: Get Bug Description
+
+```javascript
+// From test failures or user input
+const bugDescription = state.skill_state.validate?.failed_tests?.[0]
+  || await getUserInput('Describe the bug:')
+```
+
+#### Step E2: Search Codebase
+
+```javascript
+// Use ACE search_context to find related code
+const searchResults = mcp__ace-tool__search_context({
+  project_root_path: '.',
+  query: `code related to: ${bugDescription}`
+})
+```
+
+#### Step E3: Generate Hypotheses
+
+```javascript
+const hypotheses = [
+  {
+    id: 'H1',
+    description: 'Most likely cause',
+    testable_condition: 'What to check',
+    logging_point: 'file.ts:functionName:42',
+    evidence_criteria: {
+      confirm: 'If we see X, hypothesis confirmed',
+      reject: 'If we see Y, hypothesis rejected'
+    },
+    likelihood: 1,
+    status: 'pending',
+    evidence: null,
+    verdict_reason: null
+  },
+  // H2, H3...
+]
+```
+
+#### Step E4: Create Understanding Document
+
+```javascript
+const initialUnderstanding = `# Understanding Document
+
+**Loop ID**: ${loopId}
+**Bug Description**: ${bugDescription}
+**Started**: ${getUtc8ISOString()}
+
+---
+
+## Exploration Timeline
+
+### Iteration 1 - Initial Exploration (${getUtc8ISOString()})
+
+#### Current Understanding
+
+Based on bug description and code search:
+
+- Error pattern: [identified pattern]
+- Affected areas: [files/modules]
+- Initial hypothesis: [first thoughts]
+
+#### Evidence from Code Search
+
+[Search results summary]
+
+#### Hypotheses
+
+${hypotheses.map(h => `
+**${h.id}**: ${h.description}
+- Testable condition: ${h.testable_condition}
+- Logging point: ${h.logging_point}
+- Likelihood: ${h.likelihood}
+`).join('\n')}
+
+---
+
+## Current Consolidated Understanding
+
+[Summary of what we know so far]
+`
+
+Write(understandingPath, initialUnderstanding)
+Write(`${progressDir}/hypotheses.json`, JSON.stringify({ hypotheses, iteration: 1 }, null, 2))
+```
+
+#### Step E5: Add NDJSON Logging Points
+
+```javascript
+// For each hypothesis, add instrumentation
+for (const hypothesis of hypotheses) {
+  const [file, func, line] = hypothesis.logging_point.split(':')
+
+  const logStatement = `console.log(JSON.stringify({
+    hid: "${hypothesis.id}",
+    ts: Date.now(),
+    func: "${func}",
+    data: { /* relevant context */ }
+  }))`
+
+  // Add to file using Edit tool
+}
+```
+
+### Mode: Analyze (Has Logs)
+
+#### Step A1: Parse Debug Log
+
+```javascript
+const logContent = Read(debugLogPath)
+const entries = logContent.split('\n')
+  .filter(l => l.trim())
+  .map(l => JSON.parse(l))
+
+// Group by hypothesis ID
+const byHypothesis = entries.reduce((acc, e) => {
+  acc[e.hid] = acc[e.hid] || []
+  acc[e.hid].push(e)
+  return acc
+}, {})
+```
+
+#### Step A2: Evaluate Evidence
+
+```javascript
+const hypothesesData = JSON.parse(Read(`${progressDir}/hypotheses.json`))
+
+for (const hypothesis of hypothesesData.hypotheses) {
+  const evidence = byHypothesis[hypothesis.id] || []
+
+  // Evaluate against criteria
+  if (matchesConfirmCriteria(evidence, hypothesis.evidence_criteria.confirm)) {
+    hypothesis.status = 'confirmed'
+    hypothesis.evidence = evidence
+    hypothesis.verdict_reason = 'Evidence matches confirm criteria'
+  } else if (matchesRejectCriteria(evidence, hypothesis.evidence_criteria.reject)) {
+    hypothesis.status = 'rejected'
+    hypothesis.evidence = evidence
+    hypothesis.verdict_reason = 'Evidence matches reject criteria'
+  } else {
+    hypothesis.status = 'inconclusive'
+    hypothesis.evidence = evidence
+    hypothesis.verdict_reason = 'Insufficient evidence'
+  }
+}
+```
+
+#### Step A3: Update Understanding
+
+```javascript
+const iteration = hypothesesData.iteration + 1
+const timestamp = getUtc8ISOString()
+
+const analysisEntry = `
+### Iteration ${iteration} - Evidence Analysis (${timestamp})
+
+#### Log Analysis Results
+
+${hypothesesData.hypotheses.map(h => `
+**${h.id}**: ${h.status.toUpperCase()}
+- Evidence: ${JSON.stringify(h.evidence?.slice(0, 3))}
+- Reasoning: ${h.verdict_reason}
+`).join('\n')}
+
+#### Corrected Understanding
+
+[Any corrections to previous assumptions]
+
+${confirmedHypothesis ? `
+#### Root Cause Identified
+
+**${confirmedHypothesis.id}**: ${confirmedHypothesis.description}
+` : `
+#### Next Steps
+
+[What to investigate next]
+`}
+
+---
+`
+
+const existingUnderstanding = Read(understandingPath)
+Write(understandingPath, existingUnderstanding + analysisEntry)
+```
+
+### Step: Update State
+
+```javascript
+state.skill_state.debug.active_bug = bugDescription
+state.skill_state.debug.hypotheses = hypothesesData.hypotheses
+state.skill_state.debug.hypotheses_count = hypothesesData.hypotheses.length
+state.skill_state.debug.iteration = iteration
+state.skill_state.debug.last_analysis_at = timestamp
+
+if (confirmedHypothesis) {
+  state.skill_state.debug.confirmed_hypothesis = confirmedHypothesis.id
+}
+
+state.skill_state.last_action = 'DEBUG'
+state.updated_at = timestamp
+Write(`.workflow/.loop/${loopId}.json`, JSON.stringify(state, null, 2))
+```
+
+## Output Format
+
+```
+ACTION_RESULT:
+- action: DEBUG
+- status: success
+- message: {Mode description} - {result summary}
+- state_updates: {
+    "debug.iteration": {N},
+    "debug.confirmed_hypothesis": "{id or null}"
+  }
+
+FILES_UPDATED:
+- .workflow/.loop/{loopId}.progress/debug.md: Understanding updated
+- .workflow/.loop/{loopId}.progress/hypotheses.json: Hypotheses updated
+- [Source files]: Instrumentation added
+
+NEXT_ACTION_NEEDED: {DEBUG | VALIDATE | DEVELOP | MENU}
+```
+
+## Next Action Selection
+
+```javascript
+if (confirmedHypothesis) {
+  // Root cause found, apply fix and validate
+  return 'VALIDATE'
+} else if (allRejected) {
+  // Generate new hypotheses
+  return 'DEBUG'
+} else {
+  // Need more evidence - prompt user to reproduce bug
+  return 'WAITING_INPUT'  // User needs to trigger bug
+}
+```
+
+## Error Handling
+
+| Error Type | Recovery |
+|------------|----------|
+| Empty debug.log | Prompt user to reproduce bug |
+| All hypotheses rejected | Generate new hypotheses |
+| >5 iterations | Suggest escalation |
+
+## Next Actions
+
+- Root cause found: `VALIDATE`
+- Need more evidence: `DEBUG` (after reproduction)
+- All rejected: `DEBUG` (new hypotheses)
--- a/.codex/skills/ccw-loop/actions/action-develop.md
+++ b/.codex/skills/ccw-loop/actions/action-develop.md
@@ -0,0 +1,183 @@
+# Action: DEVELOP
+
+Execute development task and record progress to develop.md.
+
+## Purpose
+
+- Execute next pending development task
+- Implement code changes
+- Record progress to Markdown file
+- Update task status in state
+
+## Preconditions
+
+- [ ] state.status === 'running'
+- [ ] state.skill_state !== null
+- [ ] state.skill_state.develop.tasks.some(t => t.status === 'pending')
+
+## Execution Steps
+
+### Step 1: Verify Control Signals
+
+```javascript
+const state = JSON.parse(Read(`.workflow/.loop/${loopId}.json`))
+
+if (state.status !== 'running') {
+  return {
+    action: 'DEVELOP',
+    status: 'failed',
+    message: `Cannot develop: status is ${state.status}`,
+    next_action: state.status === 'paused' ? 'PAUSED' : 'STOPPED'
+  }
+}
+```
+
+### Step 2: Find Next Pending Task
+
+```javascript
+const tasks = state.skill_state.develop.tasks
+const currentTask = tasks.find(t => t.status === 'pending')
+
+if (!currentTask) {
+  return {
+    action: 'DEVELOP',
+    status: 'success',
+    message: 'All development tasks completed',
+    next_action: mode === 'auto' ? 'VALIDATE' : 'MENU'
+  }
+}
+
+// Mark as in_progress
+currentTask.status = 'in_progress'
+```
+
+### Step 3: Execute Development Task
+
+```javascript
+console.log(`Executing task: ${currentTask.description}`)
+
+// Use appropriate tools based on task type
+// - ACE search_context for finding patterns
+// - Read for loading files
+// - Edit/Write for making changes
+
+// Record files changed
+const filesChanged = []
+
+// Implementation logic...
+```
+
+### Step 4: Record Changes to Log (NDJSON)
+
+```javascript
+const changesLogPath = `${progressDir}/changes.log`
+const timestamp = getUtc8ISOString()
+
+const changeEntry = {
+  timestamp: timestamp,
+  task_id: currentTask.id,
+  description: currentTask.description,
+  files_changed: filesChanged,
+  result: 'success'
+}
+
+// Append to NDJSON log
+const existingLog = Read(changesLogPath) || ''
+Write(changesLogPath, existingLog + JSON.stringify(changeEntry) + '\n')
+```
+
+### Step 5: Update Progress Document
+
+```javascript
+const progressPath = `${progressDir}/develop.md`
+const iteration = state.skill_state.develop.completed + 1
+
+const progressEntry = `
+### Iteration ${iteration} - ${currentTask.description} (${timestamp})
+
+#### Task Details
+
+- **ID**: ${currentTask.id}
+- **Tool**: ${currentTask.tool}
+- **Mode**: ${currentTask.mode}
+
+#### Implementation Summary
+
+[Implementation description]
+
+#### Files Changed
+
+${filesChanged.map(f => `- \`${f}\``).join('\n') || '- No files changed'}
+
+#### Status: COMPLETED
+
+---
+
+`
+
+const existingProgress = Read(progressPath)
+Write(progressPath, existingProgress + progressEntry)
+```
+
+### Step 6: Update State
+
+```javascript
+currentTask.status = 'completed'
+currentTask.completed_at = timestamp
+currentTask.files_changed = filesChanged
+
+state.skill_state.develop.completed += 1
+state.skill_state.develop.current_task = null
+state.skill_state.develop.last_progress_at = timestamp
+state.skill_state.last_action = 'DEVELOP'
+state.skill_state.completed_actions.push('DEVELOP')
+state.updated_at = timestamp
+
+Write(`.workflow/.loop/${loopId}.json`, JSON.stringify(state, null, 2))
+```
+
+## Output Format
+
+```
+ACTION_RESULT:
+- action: DEVELOP
+- status: success
+- message: Task completed: {task_description}
+- state_updates: {
+    "develop.completed": {N},
+    "develop.last_progress_at": "{timestamp}"
+  }
+
+FILES_UPDATED:
+- .workflow/.loop/{loopId}.json: Task status updated
+- .workflow/.loop/{loopId}.progress/develop.md: Progress entry added
+- .workflow/.loop/{loopId}.progress/changes.log: Change entry added
+
+NEXT_ACTION_NEEDED: {DEVELOP | DEBUG | VALIDATE | MENU}
+```
+
+## Auto Mode Next Action Selection
+
+```javascript
+const pendingTasks = tasks.filter(t => t.status === 'pending')
+
+if (pendingTasks.length > 0) {
+  return 'DEVELOP'  // More tasks to do
+} else {
+  return 'DEBUG'    // All done, check for issues
+}
+```
+
+## Error Handling
+
+| Error Type | Recovery |
+|------------|----------|
+| Task execution failed | Mark task as failed, continue to next |
+| File write failed | Retry once, then report error |
+| All tasks done | Move to DEBUG or VALIDATE |
+
+## Next Actions
+
+- More pending tasks: `DEVELOP`
+- All tasks complete: `DEBUG` (auto) or `MENU` (interactive)
+- Task failed: `DEVELOP` (retry) or `DEBUG` (investigate)
--- a/.codex/skills/ccw-loop/actions/action-init.md
+++ b/.codex/skills/ccw-loop/actions/action-init.md
@@ -0,0 +1,164 @@
+# Action: INIT
+
+Initialize CCW Loop session, create directory structure and initial state.
+
+## Purpose
+
+- Create session directory structure
+- Initialize state file with skill_state
+- Analyze task description to generate development tasks
+- Prepare execution environment
+
+## Preconditions
+
+- [ ] state.status === 'running'
+- [ ] state.skill_state === null
+
+## Execution Steps
+
+### Step 1: Verify Control Signals
+
+```javascript
+const state = JSON.parse(Read(`.workflow/.loop/${loopId}.json`))
+
+if (state.status !== 'running') {
+  return {
+    action: 'INIT',
+    status: 'failed',
+    message: `Cannot init: status is ${state.status}`,
+    next_action: state.status === 'paused' ? 'PAUSED' : 'STOPPED'
+  }
+}
+```
+
+### Step 2: Create Directory Structure
+
+```javascript
+const progressDir = `.workflow/.loop/${loopId}.progress`
+
+// Directories created by orchestrator, verify they exist
+// mkdir -p ${progressDir}
+```
+
+### Step 3: Analyze Task and Generate Tasks
+
+```javascript
+// Analyze task description
+const taskDescription = state.description
+
+// Generate 3-7 development tasks based on analysis
+// Use ACE search or smart_search to find relevant patterns
+
+const tasks = [
+  {
+    id: 'task-001',
+    description: 'Task description based on analysis',
+    tool: 'gemini',
+    mode: 'write',
+    status: 'pending',
+    priority: 1,
+    files: [],
+    created_at: getUtc8ISOString(),
+    completed_at: null
+  }
+  // ... more tasks
+]
+```
+
+### Step 4: Initialize Progress Document
+
+```javascript
+const progressPath = `${progressDir}/develop.md`
+
+const progressInitial = `# Development Progress
+
+**Loop ID**: ${loopId}
+**Task**: ${taskDescription}
+**Started**: ${getUtc8ISOString()}
+
+---
+
+## Task List
+
+${tasks.map((t, i) => `${i + 1}. [ ] ${t.description}`).join('\n')}
+
+---
+
+## Progress Timeline
+
+`
+
+Write(progressPath, progressInitial)
+```
+
+### Step 5: Update State
+
+```javascript
+const skillState = {
+  current_action: 'init',
+  last_action: null,
+  completed_actions: [],
+  mode: mode,
+
+  develop: {
+    total: tasks.length,
+    completed: 0,
+    current_task: null,
+    tasks: tasks,
+    last_progress_at: null
+  },
+
+  debug: {
+    active_bug: null,
+    hypotheses_count: 0,
+    hypotheses: [],
+    confirmed_hypothesis: null,
+    iteration: 0,
+    last_analysis_at: null
+  },
+
+  validate: {
+    pass_rate: 0,
+    coverage: 0,
+    test_results: [],
+    passed: false,
+    failed_tests: [],
+    last_run_at: null
+  },
+
+  errors: []
+}
+
+state.skill_state = skillState
+state.updated_at = getUtc8ISOString()
+Write(`.workflow/.loop/${loopId}.json`, JSON.stringify(state, null, 2))
+```
+
+## Output Format
+
+```
+ACTION_RESULT:
+- action: INIT
+- status: success
+- message: Session initialized with {N} development tasks
+
+FILES_UPDATED:
+- .workflow/.loop/{loopId}.json: skill_state initialized
+- .workflow/.loop/{loopId}.progress/develop.md: Progress document created
+
+NEXT_ACTION_NEEDED: {DEVELOP (auto) | MENU (interactive)}
+```
+
+## Error Handling
+
+| Error Type | Recovery |
+|------------|----------|
+| Directory creation failed | Report error, stop |
+| Task analysis failed | Create single generic task |
+| State write failed | Retry once, then stop |
+
+## Next Actions
+
+- Success (auto mode): `DEVELOP`
+- Success (interactive): `MENU`
+- Failed: Report error
--- a/.codex/skills/ccw-loop/actions/action-menu.md
+++ b/.codex/skills/ccw-loop/actions/action-menu.md
@@ -0,0 +1,205 @@
+# Action: MENU
+
+Display interactive action menu for user selection.
+
+## Purpose
+
+- Show current state summary
+- Display available actions
+- Wait for user selection
+- Return selected action
+
+## Preconditions
+
+- [ ] state.status === 'running'
+- [ ] state.skill_state !== null
+- [ ] mode === 'interactive'
+
+## Execution Steps
+
+### Step 1: Verify Control Signals
+
+```javascript
+const state = JSON.parse(Read(`.workflow/.loop/${loopId}.json`))
+
+if (state.status !== 'running') {
+  return {
+    action: 'MENU',
+    status: 'failed',
+    message: `Cannot show menu: status is ${state.status}`,
+    next_action: state.status === 'paused' ? 'PAUSED' : 'STOPPED'
+  }
+}
+```
+
+### Step 2: Generate Status Summary
+
+```javascript
+// Development progress
+const developProgress = state.skill_state.develop.total > 0
+  ? `${state.skill_state.develop.completed}/${state.skill_state.develop.total} (${((state.skill_state.develop.completed / state.skill_state.develop.total) * 100).toFixed(0)}%)`
+  : 'Not started'
+
+// Debug status
+const debugStatus = state.skill_state.debug.confirmed_hypothesis
+  ? 'Root cause found'
+  : state.skill_state.debug.iteration > 0
+    ? `Iteration ${state.skill_state.debug.iteration}`
+    : 'Not started'
+
+// Validation status
+const validateStatus = state.skill_state.validate.passed
+  ? 'PASSED'
+  : state.skill_state.validate.test_results.length > 0
+    ? `FAILED (${state.skill_state.validate.failed_tests.length} failures)`
+    : 'Not run'
+```
+
+### Step 3: Display Menu
+
+```javascript
+const menuDisplay = `
+================================================================
+  CCW Loop - ${loopId}
+================================================================
+
+  Task: ${state.description}
+  Iteration: ${state.current_iteration}
+
+  +-----------------------------------------------------+
+  |  Phase          |  Status                           |
+  +-----------------------------------------------------+
+  |  Develop        |  ${developProgress.padEnd(35)}|
+  |  Debug          |  ${debugStatus.padEnd(35)}|
+  |  Validate       |  ${validateStatus.padEnd(35)}|
+  +-----------------------------------------------------+
+
+================================================================
+
+MENU_OPTIONS:
+1. [develop] Continue Development - ${state.skill_state.develop.total - state.skill_state.develop.completed} tasks pending
+2. [debug] Start Debugging - ${debugStatus}
+3. [validate] Run Validation - ${validateStatus}
+4. [status] View Detailed Status
+5. [complete] Complete Loop
+6. [exit] Exit (save and quit)
+`
+
+console.log(menuDisplay)
+```
+
+## Output Format
+
+```
+ACTION_RESULT:
+- action: MENU
+- status: success
+- message: ${menuDisplay}
+
+MENU_OPTIONS:
+1. [develop] Continue Development - {N} tasks pending
+2. [debug] Start Debugging - {status}
+3. [validate] Run Validation - {status}
+4. [status] View Detailed Status
+5. [complete] Complete Loop
+6. [exit] Exit (save and quit)
+
+NEXT_ACTION_NEEDED: WAITING_INPUT
+```
+
+## User Input Handling
+
+When user provides input, orchestrator sends it back via `send_input`:
+
+```javascript
+// User selects "develop"
+send_input({
+  id: agent,
+  message: `
+## USER INPUT RECEIVED
+
+Action selected: develop
+
+## EXECUTE SELECTED ACTION
+
+Execute DEVELOP action.
+`
+})
+```
+
+## Status Detail View
+
+If user selects "status":
+
+```javascript
+const detailView = `
+## Detailed Status
+
+### Development Progress
+
+${Read(`${progressDir}/develop.md`)?.substring(0, 1000) || 'No progress recorded'}
+
+### Debug Status
+
+${state.skill_state.debug.hypotheses.length > 0
+  ? state.skill_state.debug.hypotheses.map(h => `  ${h.id}: ${h.status} - ${h.description.substring(0, 50)}...`).join('\n')
+  : '  No debugging started'}
+
+### Validation Results
+
+${state.skill_state.validate.test_results.length > 0
+  ? `  Last run: ${state.skill_state.validate.last_run_at}
+  Pass rate: ${state.skill_state.validate.pass_rate}%`
+  : '  No validation run yet'}
+`
+
+console.log(detailView)
+
+// Return to menu
+return {
+  action: 'MENU',
+  status: 'success',
+  message: detailView,
+  next_action: 'MENU'  // Show menu again
+}
+```
+
+## Exit Handling
+
+If user selects "exit":
+
+```javascript
+// Save current state
+state.status = 'user_exit'
+state.updated_at = getUtc8ISOString()
+Write(`.workflow/.loop/${loopId}.json`, JSON.stringify(state, null, 2))
+
+return {
+  action: 'MENU',
+  status: 'success',
+  message: 'Session saved. Use --loop-id to resume.',
+  next_action: 'COMPLETED'
+}
+```
+
+## Action Mapping
+
+| User Selection | Next Action |
+|----------------|-------------|
+| develop | DEVELOP |
+| debug | DEBUG |
+| validate | VALIDATE |
+| status | MENU (after showing details) |
+| complete | COMPLETE |
+| exit | COMPLETED (save and exit) |
+
+## Error Handling
+
+| Error Type | Recovery |
+|------------|----------|
+| Invalid selection | Show menu again |
+| User cancels | Return to menu |
+
+## Next Actions
+
+Based on user selection - forwarded via `send_input` by orchestrator.
--- a/.codex/skills/ccw-loop/actions/action-validate.md
+++ b/.codex/skills/ccw-loop/actions/action-validate.md
@@ -0,0 +1,250 @@
+# Action: VALIDATE
+
+Run tests and verify implementation, record results to validate.md.
+
+## Purpose
+
+- Run unit tests
+- Run integration tests
+- Check code coverage
+- Generate validation report
+- Determine pass/fail status
+
+## Preconditions
+
+- [ ] state.status === 'running'
+- [ ] state.skill_state !== null
+- [ ] (develop.completed > 0) OR (debug.confirmed_hypothesis !== null)
+
+## Execution Steps
+
+### Step 1: Verify Control Signals
+
+```javascript
+const state = JSON.parse(Read(`.workflow/.loop/${loopId}.json`))
+
+if (state.status !== 'running') {
+  return {
+    action: 'VALIDATE',
+    status: 'failed',
+    message: `Cannot validate: status is ${state.status}`,
+    next_action: state.status === 'paused' ? 'PAUSED' : 'STOPPED'
+  }
+}
+```
+
+### Step 2: Detect Test Framework
+
+```javascript
+const packageJson = JSON.parse(Read('package.json') || '{}')
+const testScript = packageJson.scripts?.test || 'npm test'
+const coverageScript = packageJson.scripts?.['test:coverage']
+```
+
+### Step 3: Run Tests
+
+```javascript
+const testResult = await Bash({
+  command: testScript,
+  timeout: 300000  // 5 minutes
+})
+
+// Parse test output based on framework
+const testResults = parseTestOutput(testResult.stdout, testResult.stderr)
+```
+
+### Step 4: Run Coverage (if available)
+
+```javascript
+let coverageData = null
+
+if (coverageScript) {
+  const coverageResult = await Bash({
+    command: coverageScript,
+    timeout: 300000
+  })
+
+  coverageData = parseCoverageReport(coverageResult.stdout)
+  Write(`${progressDir}/coverage.json`, JSON.stringify(coverageData, null, 2))
+}
+```
+
+### Step 5: Generate Validation Report
+
+```javascript
+const timestamp = getUtc8ISOString()
+const iteration = (state.skill_state.validate.test_results?.length || 0) + 1
+
+const validationReport = `# Validation Report
+
+**Loop ID**: ${loopId}
+**Task**: ${state.description}
+**Validated**: ${timestamp}
+
+---
+
+## Iteration ${iteration} - Validation Run
+
+### Test Execution Summary
+
+| Metric | Value |
+|--------|-------|
+| Total Tests | ${testResults.total} |
+| Passed | ${testResults.passed} |
+| Failed | ${testResults.failed} |
+| Skipped | ${testResults.skipped} |
+| Duration | ${testResults.duration_ms}ms |
+| **Pass Rate** | **${((testResults.passed / testResults.total) * 100).toFixed(1)}%** |
+
+### Coverage Report
+
+${coverageData ? `
+| File | Statements | Branches | Functions | Lines |
+|------|------------|----------|-----------|-------|
+${coverageData.files.map(f => `| ${f.path} | ${f.statements}% | ${f.branches}% | ${f.functions}% | ${f.lines}% |`).join('\n')}
+
+**Overall Coverage**: ${coverageData.overall.statements}%
+` : '_No coverage data available_'}
+
+### Failed Tests
+
+${testResults.failed > 0 ? testResults.failures.map(f => `
+#### ${f.test_name}
+
+- **Suite**: ${f.suite}
+- **Error**: ${f.error_message}
+`).join('\n') : '_All tests passed_'}
+
+---
+
+## Validation Decision
+
+**Result**: ${testResults.failed === 0 ? 'PASS' : 'FAIL'}
+
+${testResults.failed > 0 ? `
+### Next Actions
+
+1. Review failed tests
+2. Debug failures using DEBUG action
+3. Fix issues and re-run validation
+` : `
+### Next Actions
+
+1. Consider code review
+2. Complete loop
+`}
+`
+
+Write(`${progressDir}/validate.md`, validationReport)
+```
+
+### Step 6: Save Test Results
+
+```javascript
+const testResultsData = {
+  iteration,
+  timestamp,
+  summary: {
+    total: testResults.total,
+    passed: testResults.passed,
+    failed: testResults.failed,
+    skipped: testResults.skipped,
+    pass_rate: ((testResults.passed / testResults.total) * 100).toFixed(1),
+    duration_ms: testResults.duration_ms
+  },
+  tests: testResults.tests,
+  failures: testResults.failures,
+  coverage: coverageData?.overall || null
+}
+
+Write(`${progressDir}/test-results.json`, JSON.stringify(testResultsData, null, 2))
+```
+
+### Step 7: Update State
+
+```javascript
+const validationPassed = testResults.failed === 0 && testResults.passed > 0
+
+state.skill_state.validate.test_results.push(testResultsData)
+state.skill_state.validate.pass_rate = parseFloat(testResultsData.summary.pass_rate)
+state.skill_state.validate.coverage = coverageData?.overall?.statements || 0
+state.skill_state.validate.passed = validationPassed
+state.skill_state.validate.failed_tests = testResults.failures.map(f => f.test_name)
+state.skill_state.validate.last_run_at = timestamp
+
+state.skill_state.last_action = 'VALIDATE'
+state.updated_at = timestamp
+Write(`.workflow/.loop/${loopId}.json`, JSON.stringify(state, null, 2))
+```
+
+## Output Format
+
+```
+ACTION_RESULT:
+- action: VALIDATE
+- status: success
+- message: Validation {PASSED | FAILED} - {pass_count}/{total_count} tests passed
+- state_updates: {
+    "validate.passed": {true | false},
+    "validate.pass_rate": {N},
+    "validate.failed_tests": [{list}]
+  }
+
+FILES_UPDATED:
+- .workflow/.loop/{loopId}.progress/validate.md: Validation report created
+- .workflow/.loop/{loopId}.progress/test-results.json: Test results saved
+- .workflow/.loop/{loopId}.progress/coverage.json: Coverage data saved (if available)
+
+NEXT_ACTION_NEEDED: {COMPLETE | DEBUG | DEVELOP | MENU}
+```
+
+## Next Action Selection
+
+```javascript
+if (validationPassed) {
+  const pendingTasks = state.skill_state.develop.tasks.filter(t => t.status === 'pending')
+  if (pendingTasks.length === 0) {
+    return 'COMPLETE'
+  } else {
+    return 'DEVELOP'
+  }
+} else {
+  // Tests failed - need debugging
+  return 'DEBUG'
+}
+```
+
+## Test Output Parsers
+
+### Jest/Vitest Parser
+
+```javascript
+function parseJestOutput(stdout) {
+  const summaryMatch = stdout.match(/Tests:\s+(\d+)\s+passed.*?(\d+)\s+failed.*?(\d+)\s+total/)
+  // ... implementation
+}
+```
+
+### Pytest Parser
+
+```javascript
+function parsePytestOutput(stdout) {
+  const summaryMatch = stdout.match(/(\d+)\s+passed.*?(\d+)\s+failed/)
+  // ... implementation
+}
+```
+
+## Error Handling
+
+| Error Type | Recovery |
+|------------|----------|
+| Tests don't run | Check test script config, report error |
+| All tests fail | Suggest DEBUG action |
+| Coverage tool missing | Skip coverage, run tests only |
+| Timeout | Increase timeout or split tests |
+
+## Next Actions
+
+- Validation passed, no pending: `COMPLETE`
+- Validation passed, has pending: `DEVELOP`
+- Validation failed: `DEBUG`