Refactor TDD Task Generation Strategy: Consolidate tasks into single feature-complete units with internal Red-Green-Refactor cycles, enhancing dependency management and reducing task overhead. Update documentation to reflect new task structure, limits, and JSON format for implementation plans and TODO lists.

2026-03-30 20:21:09 +08:00 · 2025-10-17 22:21:17 +08:00
parent b65cd49444
commit 9db53a24cd
3 changed files with 747 additions and 504 deletions
--- a/.claude/commands/workflow/tdd-plan.md
+++ b/.claude/commands/workflow/tdd-plan.md
@@ -95,47 +95,72 @@ TEST_FOCUS: [Test scenarios]
 - Manual: `/workflow:tools:task-generate-tdd --session [sessionId]`
 - Agent: `/workflow:tools:task-generate-tdd --session [sessionId] --agent`
-**Parse**: Extract feature count, chain count, task count
+**Parse**: Extract feature count, task count (not chain count - tasks now contain internal TDD cycles)
 **Validate**:
- IMPL_PLAN.md exists (unified plan with TDD Task Chains section)
+- IMPL_PLAN.md exists (unified plan with TDD Implementation Tasks section)
- TEST-*.json, IMPL-*.json, REFACTOR-*.json exist
+- IMPL-*.json files exist (one per feature, or container + subtasks for complex features)
- TODO_LIST.md exists
+- TODO_LIST.md exists with internal TDD phase indicators
- IMPL tasks include test-fix-cycle configuration
+- Each IMPL task includes:
  - `meta.tdd_workflow: true`
  - `flow_control.implementation_approach` with 3 steps (red/green/refactor)
  - Green phase includes test-fix-cycle configuration
 - IMPL_PLAN.md contains workflow_type: "tdd" in frontmatter
 - Task count ≤10 (compliance with task limit)
 ### Phase 7: TDD Structure Validation & Action Plan Verification (RECOMMENDED)
 **Internal validation first, then recommend external verification**
 **Internal Validation**:
-1. Each feature has TEST → IMPL → REFACTOR chain
+1. Each task contains complete TDD workflow (Red-Green-Refactor internally)
-2. Dependencies: IMPL depends_on TEST, REFACTOR depends_on IMPL
+2. Task structure validation:
-3. Meta fields: tdd_phase correct ("red"/"green"/"refactor")
+   - `meta.tdd_workflow: true` in all IMPL tasks
-4. Agents: TEST uses @code-review-test-agent, IMPL/REFACTOR use @code-developer
+   - `flow_control.implementation_approach` has exactly 3 steps
-5. IMPL tasks contain test-fix-cycle in flow_control for iterative Green phase
+   - Each step has correct `tdd_phase`: "red", "green", "refactor"
 3. Dependency validation:
   - Sequential features: IMPL-N depends_on ["IMPL-(N-1)"] if needed
   - Complex features: IMPL-N.M depends_on ["IMPL-N.(M-1)"] for subtasks
 4. Agent assignment: All IMPL tasks use @code-developer
 5. Test-fix cycle: Green phase step includes test-fix-cycle logic with max_iterations
 6. Task count: Total tasks ≤10 (simple + subtasks)
 **Return Summary**:
 ```
 TDD Planning complete for session: [sessionId]
 Features analyzed: [N]
-TDD chains generated: [N]
+Total tasks: [M] (1 task per simple feature + subtasks for complex features)
-Total tasks: [3N]
+
 Task breakdown:
 - Simple features: [K] tasks (IMPL-1 to IMPL-K)
 - Complex features: [L] features with [P] subtasks
 - Total task count: [M] (within 10-task limit ✅)
 Structure:
- Feature 1: TEST-1.1 → IMPL-1.1 → REFACTOR-1.1
+- IMPL-1: {Feature 1 Name} (Internal: 🔴 Red → 🟢 Green → 🔵 Refactor)
 - IMPL-2: {Feature 2 Name} (Internal: 🔴 Red → 🟢 Green → 🔵 Refactor)
 - IMPL-3: {Complex Feature} (Container)
  - IMPL-3.1: {Sub-feature A} (Internal: 🔴 Red → 🟢 Green → 🔵 Refactor)
  - IMPL-3.2: {Sub-feature B} (Internal: 🔴 Red → 🟢 Green → 🔵 Refactor)
 [...]
-Plan:
+Plans generated:
 - Unified Implementation Plan: .workflow/[sessionId]/IMPL_PLAN.md
-  (includes TDD Task Chains section with workflow_type: "tdd")
+  (includes TDD Implementation Tasks section with workflow_type: "tdd")
 - Task List: .workflow/[sessionId]/TODO_LIST.md
  (with internal TDD phase indicators)
 TDD Configuration:
 - Each task contains complete Red-Green-Refactor cycle
 - Green phase includes test-fix cycle (max 3 iterations)
 - Auto-revert on max iterations reached
 ✅ Recommended Next Steps:
-1. /workflow:action-plan-verify --session [sessionId]  # Verify TDD plan quality
+1. /workflow:action-plan-verify --session [sessionId]  # Verify TDD plan quality and dependencies
 2. /workflow:execute --session [sessionId]  # Start TDD execution
 3. /workflow:tdd-verify [sessionId]  # Post-execution TDD compliance check
-⚠️ Quality Gate: Consider running /workflow:action-plan-verify to validate TDD task dependencies
+⚠️ Quality Gate: Consider running /workflow:action-plan-verify to validate TDD task structure and dependencies
 ```
 ## TodoWrite Pattern
@@ -232,14 +257,18 @@ Supports action-planning-agent for more autonomous TDD planning with:
 ### Workflow Comparison
-| Aspect | Previous | Current |
+| Aspect | Previous | Current (Optimized) |
-|--------|----------|---------|
+|--------|----------|---------------------|
-| **Phases** | 5 | 6 (test coverage analysis) |
+| **Phases** | 6 (with test coverage) | 7 (added concept verification) |
 | **Context** | Greenfield assumption | Existing codebase aware |
 | **Task Structure** | 1 feature = 3 tasks (TEST/IMPL/REFACTOR) | 1 feature = 1 task (internal TDD cycle) |
 | **Task Count** | 5 features = 15 tasks | 5 features = 5 tasks (70% reduction) |
 | **Green Phase** | Single implementation | Iterative with fix cycle |
 | **Failure Handling** | Manual intervention | Auto-diagnose + fix + revert |
 | **Test Analysis** | None | Deep coverage analysis |
 | **Feedback Loop** | Post-execution | During Green phase |
 | **Task Management** | High overhead (15 tasks) | Low overhead (5 tasks) |
 | **Execution Efficiency** | Frequent context switching | Continuous context per feature |
 ### Migration Notes
@@ -251,19 +280,26 @@ Supports action-planning-agent for more autonomous TDD planning with:
 **Session Structure**:
 ```
 .workflow/WFS-xxx/
-├── IMPL_PLAN.md (unified plan with TDD Task Chains section)
+├── IMPL_PLAN.md (unified plan with TDD Implementation Tasks section)
-├── TODO_LIST.md
+├── TODO_LIST.md (with internal TDD phase indicators)
 ├── .process/
 │   ├── context-package.json
 │   ├── test-context-package.json
 │   ├── ANALYSIS_RESULTS.md (enhanced with TDD breakdown)
-│   └── green-fix-iteration-*.md (fix logs)
+│   └── green-fix-iteration-*.md (fix logs from Green phase cycles)
 └── .task/
-    ├── TEST-*.json (Red phase)
+    ├── IMPL-1.json (Complete TDD task: Red-Green-Refactor internally)
-    ├── IMPL-*.json (Green phase with test-fix-cycle)
+    ├── IMPL-2.json (Complete TDD task)
-    └── REFACTOR-*.json (Refactor phase)
+    ├── IMPL-3.json (Complex feature container, if needed)
    ├── IMPL-3.1.json (Complex feature subtask, if needed)
    └── IMPL-3.2.json (Complex feature subtask, if needed)
 ```
 **File Count Comparison**:
 - **Old structure**: 5 features = 15 task files (TEST/IMPL/REFACTOR × 5)
 - **New structure**: 5 features = 5 task files (IMPL-N × 5)
 - **Complex features**: Add container + subtasks only when necessary
 **Configuration Options** (in IMPL tasks):
 - `meta.max_iterations`: Fix attempts (default: 3)
 - `meta.use_codex`: Auto-fix mode (default: false)
--- a/.claude/commands/workflow/test-gen.md
+++ b/.claude/commands/workflow/test-gen.md
@@ -0,0 +1,337 @@
 ---
 name: test-gen
 description: Create independent test-fix workflow session by analyzing completed implementation
 argument-hint: "[--use-codex] [--cli-execute] source-session-id"
 allowed-tools: SlashCommand(*), TodoWrite(*), Read(*), Bash(*)
 ---
 # Workflow Test Generation Command (/workflow:test-gen)
 ## Coordinator Role
 **This command is a pure orchestrator**: Creates an independent test-fix workflow session for validating a completed implementation. It reuses the standard planning toolchain with automatic cross-session context gathering.
 **Core Principles**:
 - **Session Isolation**: Creates new `WFS-test-[source]` session to keep verification separate from implementation
 - **Context-First**: Prioritizes gathering code changes and summaries from source session
 - **Format Reuse**: Creates standard `IMPL-*.json` task, using `meta.type: "test-fix"` for agent assignment
 - **Parameter Simplification**: Tools auto-detect test session type via metadata, no manual cross-session parameters needed
 - **Manual First**: Default to manual fixes, use `--use-codex` flag for automated Codex fix application
 **Execution Flow**:
 1. Initialize TodoWrite → Create test session → Parse session ID
 2. Gather cross-session context (automatic) → Parse context path
 3. Analyze implementation with concept-enhanced → Parse ANALYSIS_RESULTS.md
 4. Generate test task from analysis → Return summary
 **⚠️ Command Scope**: This command ONLY prepares test workflow artifacts. It does NOT execute tests or implementation. Task execution requires separate user action.
 ## Core Rules
 1. **Start Immediately**: First action is TodoWrite initialization, second action is Phase 1 test session creation
 2. **No Preliminary Analysis**: Do not read files or analyze before Phase 1
 3. **Parse Every Output**: Extract required data from each phase for next phase
 4. **Sequential Execution**: Each phase depends on previous phase's output
 5. **Complete All Phases**: Do not return to user until Phase 5 completes (summary returned)
 6. **Track Progress**: Update TodoWrite after every phase completion
 7. **Automatic Detection**: context-gather auto-detects test session and gathers source session context
 8. **Parse --use-codex Flag**: Extract flag from arguments and pass to Phase 4 (test-task-generate)
 9. **⚠️ Command Boundary**: This command ends at Phase 5 summary. Test execution is NOT part of this command.
 ## 5-Phase Execution
 ### Phase 1: Create Test Session
 **Command**: `SlashCommand(command="/workflow:session:start --new \"Test validation for [sourceSessionId]\"")`
 **Input**: `sourceSessionId` from user argument (e.g., `WFS-user-auth`)
 **Expected Behavior**:
 - Creates new session with pattern `WFS-test-[source-slug]` (e.g., `WFS-test-user-auth`)
 - Writes metadata to `workflow-session.json`:
  - `workflow_type: "test_session"`
  - `source_session_id: "[sourceSessionId]"`
 - Returns new session ID for subsequent phases
 **Parse Output**:
 - Extract: new test session ID (store as `testSessionId`)
 - Pattern: `WFS-test-[slug]`
 **Validation**:
 - Source session `.workflow/[sourceSessionId]/` exists
 - Source session has completed IMPL tasks (`.summaries/IMPL-*-summary.md`)
 - New test session directory created
 - Metadata includes `workflow_type` and `source_session_id`
 **TodoWrite**: Mark phase 1 completed, phase 2 in_progress
 ---
 ### Phase 2: Gather Test Context
 **Command**: `SlashCommand(command="/workflow:tools:test-context-gather --session [testSessionId]")`
 **Input**: `testSessionId` from Phase 1 (e.g., `WFS-test-user-auth`)
 **Expected Behavior**:
 - Load source session implementation context and summaries
 - Analyze test coverage using MCP tools (find existing tests)
 - Identify files requiring tests (coverage gaps)
 - Detect test framework and conventions
 - Generate `test-context-package.json`
 **Parse Output**:
 - Extract: test context package path (store as `testContextPath`)
 - Pattern: `.workflow/[testSessionId]/.process/test-context-package.json`
 **Validation**:
 - Test context package created
 - Contains source session summaries
 - Includes coverage gap analysis
 - Test framework detected
 - Test conventions documented
 **TodoWrite**: Mark phase 2 completed, phase 3 in_progress
 ---
 ### Phase 3: Test Generation Analysis
 **Command**: `SlashCommand(command="/workflow:tools:test-concept-enhanced --session [testSessionId] --context [testContextPath]")`
 **Input**:
 - `testSessionId` from Phase 1
 - `testContextPath` from Phase 2
 **Expected Behavior**:
 - Use Gemini to analyze coverage gaps and implementation context
 - Study existing test patterns and conventions
 - Generate test requirements for each missing test file
 - Design test generation strategy
 - Generate `TEST_ANALYSIS_RESULTS.md`
 **Parse Output**:
 - Verify `.workflow/[testSessionId]/.process/TEST_ANALYSIS_RESULTS.md` created
 - Contains test requirements and generation strategy
 - Lists test files to create with specifications
 **Validation**:
 - TEST_ANALYSIS_RESULTS.md exists with complete sections:
  - Coverage Assessment
  - Test Framework & Conventions
  - Test Requirements by File
  - Test Generation Strategy
  - Implementation Targets (test files to create)
  - Success Criteria
 **TodoWrite**: Mark phase 3 completed, phase 4 in_progress
 ---
 ### Phase 4: Generate Test Tasks
 **Command**: `SlashCommand(command="/workflow:tools:test-task-generate [--use-codex] [--cli-execute] --session [testSessionId]")`
 **Input**:
 - `testSessionId` from Phase 1
 - `--use-codex` flag (if present in original command) - Controls IMPL-002 fix mode
 - `--cli-execute` flag (if present in original command) - Controls IMPL-001 generation mode
 **Expected Behavior**:
 - Parse TEST_ANALYSIS_RESULTS.md from Phase 3
 - Extract test requirements and generation strategy
 - Generate **TWO task JSON files**:
  - **IMPL-001.json**: Test Generation task (calls @code-developer)
  - **IMPL-002.json**: Test Execution and Fix Cycle task (calls @test-fix-agent)
 - Generate IMPL_PLAN.md with test generation and execution strategy
 - Generate TODO_LIST.md with both tasks
 **Parse Output**:
 - Verify `.workflow/[testSessionId]/.task/IMPL-001.json` exists (test generation)
 - Verify `.workflow/[testSessionId]/.task/IMPL-002.json` exists (test execution & fix)
 - Verify `.workflow/[testSessionId]/IMPL_PLAN.md` created
 - Verify `.workflow/[testSessionId]/TODO_LIST.md` created
 **Validation - IMPL-001.json (Test Generation)**:
 - Task ID: `IMPL-001`
 - `meta.type: "test-gen"`
 - `meta.agent: "@code-developer"`
 - `context.requirements`: Generate tests based on TEST_ANALYSIS_RESULTS.md
 - `flow_control.pre_analysis`: Load TEST_ANALYSIS_RESULTS.md and test context
 - `flow_control.implementation_approach`: Test generation steps
 - `flow_control.target_files`: Test files to create from analysis section 5
 **Validation - IMPL-002.json (Test Execution & Fix)**:
 - Task ID: `IMPL-002`
 - `meta.type: "test-fix"`
 - `meta.agent: "@test-fix-agent"`
 - `meta.use_codex: true|false` (based on --use-codex flag)
 - `context.depends_on: ["IMPL-001"]`
 - `context.requirements`: Execute and fix tests
 - `flow_control.implementation_approach.test_fix_cycle`: Complete cycle specification
  - **Cycle pattern**: test → gemini_diagnose → manual_fix (or codex if --use-codex) → retest
  - **Tools configuration**: Gemini for analysis with bug-fix template, manual or Codex for fixes
  - **Exit conditions**: Success (all pass) or failure (max iterations)
 - `flow_control.implementation_approach.modification_points`: 3-phase execution flow
  - Phase 1: Initial test execution
  - Phase 2: Iterative Gemini diagnosis + manual/Codex fixes (based on flag)
  - Phase 3: Final validation and certification
 **TodoWrite**: Mark phase 4 completed, phase 5 in_progress
 ---
 ### Phase 5: Return Summary (⚠️ Command Ends Here)
 **⚠️ Important**: This is the final phase of `/workflow:test-gen`. The command completes and returns control to the user. No automatic execution occurs.
 **Return to User**:
 ```
 ✅ Test workflow preparation complete!
 Source Session: [sourceSessionId]
 Test Session: [testSessionId]
 Artifacts Created:
 - Test context analysis
 - Test generation strategy
 - Task definitions (IMPL-001, IMPL-002)
 - Implementation plan
 Test Framework: [detected framework]
 Test Files to Generate: [count]
 Fix Mode: [Manual|Codex Automated] (based on --use-codex flag)
 📋 Review Generated Artifacts:
 - Test plan: .workflow/[testSessionId]/IMPL_PLAN.md
 - Task list: .workflow/[testSessionId]/TODO_LIST.md
 - Analysis: .workflow/[testSessionId]/.process/TEST_ANALYSIS_RESULTS.md
 ⚠️ Ready for execution. Use appropriate workflow commands to proceed.
 ```
 **TodoWrite**: Mark phase 5 completed
 **⚠️ Command Boundary**: After this phase, the command terminates and returns to user prompt.
 ---
 ## TodoWrite Pattern
 Track progress through 5 phases:
 ```javascript
 TodoWrite({todos: [
  {"content": "Create independent test session", "status": "in_progress|completed", "activeForm": "Creating test session"},
  {"content": "Gather test coverage context", "status": "pending|in_progress|completed", "activeForm": "Gathering test coverage context"},
  {"content": "Analyze test requirements with Gemini", "status": "pending|in_progress|completed", "activeForm": "Analyzing test requirements"},
  {"content": "Generate test generation and execution tasks", "status": "pending|in_progress|completed", "activeForm": "Generating test tasks"},
  {"content": "Return workflow summary", "status": "pending|in_progress|completed", "activeForm": "Returning workflow summary"}
 ]})
 ```
 Update status to `in_progress` when starting each phase, mark `completed` when done.
 ## Data Flow
 ```
 ┌─────────────────────────────────────────────────────────┐
 │ /workflow:test-gen WFS-user-auth                        │
 ├─────────────────────────────────────────────────────────┤
 │ Phase 1: session-start → WFS-test-user-auth            │
 │   ↓                                                     │
 │ Phase 2: test-context-gather → test-context-package.json│
 │   ↓                                                     │
 │ Phase 3: test-concept-enhanced → TEST_ANALYSIS_RESULTS.md│
 │   ↓                                                     │
 │ Phase 4: test-task-generate → IMPL-001.json + IMPL-002.json│
 │   ↓                                                     │
 │ Phase 5: Return summary                                │
 └─────────────────────────────────────────────────────────┘
         ⚠️ COMMAND ENDS - Control returns to user
 Artifacts Created:
 ├── .workflow/WFS-test-[session]/
 │   ├── workflow-session.json
 │   ├── IMPL_PLAN.md
 │   ├── TODO_LIST.md
 │   ├── .task/
 │   │   ├── IMPL-001.json (test generation task)
 │   │   └── IMPL-002.json (test execution task)
 │   └── .process/
 │       ├── test-context-package.json
 │       └── TEST_ANALYSIS_RESULTS.md
 ```
 ## Session Metadata
 Test session includes `workflow_type: "test_session"` and `source_session_id` for automatic context gathering.
 ## Task Output
 Generates two task definition files:
 - **IMPL-001.json**: Test generation task specification
  - Agent: @code-developer
  - Input: TEST_ANALYSIS_RESULTS.md
  - Output: Test files based on analysis
 - **IMPL-002.json**: Test execution and fix cycle specification
  - Agent: @test-fix-agent
  - Dependency: IMPL-001 must complete first
  - Max iterations: 5
  - Fix mode: Manual or Codex (based on --use-codex flag)
 See `/workflow:tools:test-task-generate` for complete task JSON schemas.
 ## Error Handling
 | Phase | Error | Action |
 |-------|-------|--------|
 | 1 | Source session not found | Return error with source session ID |
 | 1 | No completed IMPL tasks | Return error, source incomplete |
 | 2 | Context gathering failed | Return error, check source artifacts |
 | 3 | Analysis failed | Return error, check context package |
 | 4 | Task generation failed | Retry once, then error with details |
 ## Output Files
 Created in `.workflow/WFS-test-[session]/`:
 - `workflow-session.json` - Session metadata
 - `.process/test-context-package.json` - Coverage analysis
 - `.process/TEST_ANALYSIS_RESULTS.md` - Test requirements
 - `.task/IMPL-001.json` - Test generation task
 - `.task/IMPL-002.json` - Test execution & fix task
 - `IMPL_PLAN.md` - Test plan
 - `TODO_LIST.md` - Task checklist
 ## Task Specifications
 **IMPL-001.json Structure**:
 - `meta.type: "test-gen"`
 - `meta.agent: "@code-developer"`
 - `context.requirements`: Generate tests based on TEST_ANALYSIS_RESULTS.md
 - `flow_control.target_files`: Test files to create
 - `flow_control.implementation_approach`: Test generation strategy
 **IMPL-002.json Structure**:
 - `meta.type: "test-fix"`
 - `meta.agent: "@test-fix-agent"`
 - `meta.use_codex`: true/false (based on --use-codex flag)
 - `context.depends_on: ["IMPL-001"]`
 - `flow_control.implementation_approach.test_fix_cycle`: Complete cycle specification
  - Gemini diagnosis template
  - Fix application mode (manual/codex)
  - Max iterations: 5
 - `flow_control.implementation_approach.modification_points`: 3-phase flow
 See `/workflow:tools:test-task-generate` for complete JSON schemas.
 ## Best Practices
 1. **Prerequisites**: Ensure source session has completed IMPL tasks with summaries
 2. **Clean State**: Commit implementation changes before running test-gen
 3. **Review Artifacts**: Check generated IMPL_PLAN.md and TODO_LIST.md before proceeding
 4. **Understand Scope**: This command only prepares artifacts; it does not execute tests
 ## Related Commands
 - `/workflow:tools:test-context-gather` - Phase 2 (coverage analysis)
 - `/workflow:tools:test-concept-enhanced` - Phase 3 (Gemini test analysis)
 - `/workflow:tools:test-task-generate` - Phase 4 (task generation)
 - `/workflow:execute` - Execute workflow
 - `/workflow:status` - Check progress
--- a/.claude/commands/workflow/tools/task-generate-tdd.md
+++ b/.claude/commands/workflow/tools/task-generate-tdd.md