feat: Implement workflow phases for test generation and execution

- Added Phase 1: Session Start to detect input mode and create test workflow session. - Added Phase 2: Test Context Gather to gather test context via coverage analysis or codebase scan. - Added Phase 3: Test Concept Enhanced to analyze test requirements using Gemini and generate multi-layered test requirements. - Added Phase 4: Test Task Generate to create test-specific tasks based on analysis results. - Added Phase 5: Test Cycle Execute to manage iterative test execution and fix cycles with adaptive strategies. - Introduced BottomPanel component for terminal dashboard with Queue and Inspector tabs.
2026-02-15 02:42:45 +08:00 · 2026-02-14 21:35:55 +08:00
parent 0d805efe87
commit d535ab4749
27 changed files with 2004 additions and 2363 deletions
--- a/.claude/skills/workflow-test-fix/phases/01-session-start.md
+++ b/.claude/skills/workflow-test-fix/phases/01-session-start.md
@@ -0,0 +1,60 @@
+# Phase 1: Session Start (session-start)
+
+Detect input mode and create test workflow session.
+
+## Objective
+
+- Detect input mode (session ID vs description)
+- Create test workflow session with appropriate metadata
+
+## Execution
+
+### Step 1.0: Detect Input Mode
+
+```
+// Automatic mode detection based on input pattern
+if (input.startsWith("WFS-")) {
+  MODE = "session"
+  // Load source session to preserve original task description
+  Read(".workflow/active/[sourceSessionId]/workflow-session.json")
+} else {
+  MODE = "prompt"
+}
+```
+
+### Step 1.1: Create Test Session
+
+```
+// Session Mode - preserve original task description
+Skill(skill="workflow:session:start", args="--type test --new \"Test validation for [sourceSessionId]: [originalTaskDescription]\"")
+
+// Prompt Mode - use user's description directly
+Skill(skill="workflow:session:start", args="--type test --new \"Test generation for: [description]\"")
+```
+
+**Parse Output**:
+- Extract: `SESSION_ID: WFS-test-[slug]` (store as `testSessionId`)
+
+**Validation**:
+- Session Mode: Source session `.workflow/active/[sourceSessionId]/` exists with completed IMPL tasks
+- Both Modes: New test session directory created with metadata
+
+**TodoWrite**: Mark step 1.1 completed, step 1.2 in_progress
+
+### Session Metadata
+
+**File**: `workflow-session.json`
+
+| Mode | Fields |
+|------|--------|
+| **Session** | `type: "test"`, `source_session_id: "[sourceId]"` |
+| **Prompt** | `type: "test"` (no source_session_id) |
+
+## Output
+
+- **Variable**: `testSessionId` (WFS-test-xxx)
+- **Variable**: `MODE` (session | prompt)
+
+## Next Phase
+
+Continue to [Phase 2: Test Context Gather](02-test-context-gather.md).
--- a/.claude/skills/workflow-test-fix/phases/01-test-fix-gen.md
+++ b/.claude/skills/workflow-test-fix/phases/01-test-fix-gen.md
@@ -1,309 +0,0 @@
-# Phase 1: Test Fix Generation (test-fix-gen)
-
-Create test-fix workflow session with progressive test layers (L0-L3), AI code validation, and test task generation. This phase runs 5 internal steps sequentially, calling existing sub-commands via Skill().
-
-## Objective
-
- Detect input mode (session ID vs description)
- Create test workflow session
- Gather test context (coverage analysis or codebase scan)
- Analyze test requirements with Gemini (L0-L3 layers)
- Generate test task JSONs via action-planning-agent
-
-## Test Strategy Overview
-
-This workflow generates tests using **Progressive Test Layers (L0-L3)**:
-
-| Layer | Name | Focus |
-|-------|------|-------|
-| **L0** | Static Analysis | Compilation, imports, types, AI code issues |
-| **L1** | Unit Tests | Function/class behavior (happy/negative/edge cases) |
-| **L2** | Integration Tests | Component interactions, API contracts, failure modes |
-| **L3** | E2E Tests | User journeys, critical paths (optional) |
-
-**Key Features**:
- **AI Code Issue Detection** - Validates against common AI-generated code problems (hallucinated imports, placeholder code, mock leakage, etc.)
- **Project Type Detection** - Applies appropriate test templates (React, Node API, CLI, Library, etc.)
- **Quality Gates** - IMPL-001.3 (code validation) and IMPL-001.5 (test quality) ensure high standards
-
-**Detailed specifications**: See `/workflow:tools:test-task-generate` for complete L0-L3 requirements and quality thresholds.
-
-## Execution
-
-### Step 1.0: Detect Input Mode
-
-```
-// Automatic mode detection based on input pattern
-if (input.startsWith("WFS-")) {
-  MODE = "session"
-  // Load source session to preserve original task description
-  Read(".workflow/active/[sourceSessionId]/workflow-session.json")
-} else {
-  MODE = "prompt"
-}
-```
-
-### Step 1.1: Create Test Session
-
-```
-// Session Mode - preserve original task description
-Skill(skill="workflow:session:start", args="--type test --new \"Test validation for [sourceSessionId]: [originalTaskDescription]\"")
-
-// Prompt Mode - use user's description directly
-Skill(skill="workflow:session:start", args="--type test --new \"Test generation for: [description]\"")
-```
-
-**Parse Output**:
- Extract: `SESSION_ID: WFS-test-[slug]` (store as `testSessionId`)
-
-**Validation**:
- Session Mode: Source session `.workflow/active/[sourceSessionId]/` exists with completed IMPL tasks
- Both Modes: New test session directory created with metadata
-
-**TodoWrite**: Mark step 1.1 completed, step 1.2 in_progress
-
-### Step 1.2: Gather Test Context
-
-```
-// Session Mode - gather from source session
-Skill(skill="workflow:tools:test-context-gather", args="--session [testSessionId]")
-
-// Prompt Mode - gather from codebase
-Skill(skill="workflow:tools:context-gather", args="--session [testSessionId] \"[task_description]\"")
-```
-
-**Input**: `testSessionId` from Step 1.1
-
-**Parse Output**:
- Extract: context package path (store as `contextPath`)
- Pattern: `.workflow/active/[testSessionId]/.process/[test-]context-package.json`
-
-**Validation**:
- Context package file exists and is valid JSON
- Contains coverage analysis (session mode) or codebase analysis (prompt mode)
- Test framework detected
-
-**TodoWrite Update (tasks attached)**:
-```json
-[
-  {"content": "Phase 1: Test Generation", "status": "in_progress"},
-  {"content": "  → Create test session", "status": "completed"},
-  {"content": "  → Gather test context", "status": "in_progress"},
-  {"content": "    → Load source/codebase context", "status": "in_progress"},
-  {"content": "    → Analyze test coverage", "status": "pending"},
-  {"content": "    → Generate context package", "status": "pending"},
-  {"content": "  → Test analysis (Gemini)", "status": "pending"},
-  {"content": "  → Generate test tasks", "status": "pending"},
-  {"content": "Phase 2: Test Cycle Execution", "status": "pending"}
-]
-```
-
-**TodoWrite Update (tasks collapsed)**:
-```json
-[
-  {"content": "Phase 1: Test Generation", "status": "in_progress"},
-  {"content": "  → Create test session", "status": "completed"},
-  {"content": "  → Gather test context", "status": "completed"},
-  {"content": "  → Test analysis (Gemini)", "status": "pending"},
-  {"content": "  → Generate test tasks", "status": "pending"},
-  {"content": "Phase 2: Test Cycle Execution", "status": "pending"}
-]
-```
-
-### Step 1.3: Test Generation Analysis
-
-```
-Skill(skill="workflow:tools:test-concept-enhanced", args="--session [testSessionId] --context [contextPath]")
-```
-
-**Input**:
- `testSessionId` from Step 1.1
- `contextPath` from Step 1.2
-
-**Expected Behavior**:
- Use Gemini to analyze coverage gaps
- Detect project type and apply appropriate test templates
- Generate **multi-layered test requirements** (L0-L3)
- Scan for AI code issues
- Generate `TEST_ANALYSIS_RESULTS.md`
-
-**Output**: `.workflow/[testSessionId]/.process/TEST_ANALYSIS_RESULTS.md`
-
-**Validation** - TEST_ANALYSIS_RESULTS.md must include:
- Project Type Detection (with confidence)
- Coverage Assessment (current vs target)
- Test Framework & Conventions
- Multi-Layered Test Plan (L0-L3)
- AI Issue Scan Results
- Test Requirements by File (with layer annotations)
- Quality Assurance Criteria
- Success Criteria
-
-**Note**: Detailed specifications for project types, L0-L3 layers, and AI issue detection are defined in `/workflow:tools:test-concept-enhanced`.
-
-### Step 1.4: Generate Test Tasks
-
-```
-Skill(skill="workflow:tools:test-task-generate", args="--session [testSessionId]")
-```
-
-**Input**: `testSessionId` from Step 1.1
-
-**Note**: test-task-generate invokes action-planning-agent to generate test-specific IMPL_PLAN.md and task JSONs based on TEST_ANALYSIS_RESULTS.md.
-
-**Expected Output** (minimum 4 tasks):
-
-| Task | Type | Agent | Purpose |
-|------|------|-------|---------|
-| IMPL-001 | test-gen | @code-developer | Test understanding & generation (L1-L3) |
-| IMPL-001.3 | code-validation | @test-fix-agent | Code validation gate (L0 + AI issues) |
-| IMPL-001.5 | test-quality-review | @test-fix-agent | Test quality gate |
-| IMPL-002 | test-fix | @test-fix-agent | Test execution & fix cycle |
-
-**Validation**:
- `.workflow/active/[testSessionId]/.task/IMPL-001.json` exists
- `.workflow/active/[testSessionId]/.task/IMPL-001.3-validation.json` exists
- `.workflow/active/[testSessionId]/.task/IMPL-001.5-review.json` exists
- `.workflow/active/[testSessionId]/.task/IMPL-002.json` exists
- `.workflow/active/[testSessionId]/IMPL_PLAN.md` exists
- `.workflow/active/[testSessionId]/TODO_LIST.md` exists
-
-### Step 1.5: Return Summary
-
-**Return to Orchestrator**:
-```
-Test-fix workflow created successfully!
-
-Input: [original input]
-Mode: [Session|Prompt]
-Test Session: [testSessionId]
-
-Tasks Created:
- IMPL-001: Test Understanding & Generation (@code-developer)
- IMPL-001.3: Code Validation Gate - AI Error Detection (@test-fix-agent)
- IMPL-001.5: Test Quality Gate - Static Analysis & Coverage (@test-fix-agent)
- IMPL-002: Test Execution & Fix Cycle (@test-fix-agent)
-
-Quality Thresholds:
- Code Validation: Zero CRITICAL issues, zero compilation errors
- Minimum Coverage: 80% line, 70% branch
- Static Analysis: Zero critical anti-patterns
- Max Fix Iterations: 5
-
-Review artifacts:
- Test plan: .workflow/[testSessionId]/IMPL_PLAN.md
- Task list: .workflow/[testSessionId]/TODO_LIST.md
- Analysis: .workflow/[testSessionId]/.process/TEST_ANALYSIS_RESULTS.md
-```
-
-**CRITICAL - Next Step**: Auto-continue to Phase 2: Test Cycle Execution.
-Pass `testSessionId` to Phase 2 for test execution pipeline. Do NOT wait for user confirmation — the unified pipeline continues automatically.
-
-## Execution Flow Diagram
-
-```
-User triggers: /workflow:test-fix-gen "Test user authentication"
-  ↓
-[Input Detection] → MODE: prompt
-  ↓
-[TodoWrite Init] Phase 1 sub-steps + Phase 2
-  ↓
-Step 1.1: Create Test Session
-  → /workflow:session:start --type test
-  → testSessionId extracted (WFS-test-user-auth)
-  ↓
-Step 1.2: Gather Test Context (Skill executed)
-  → ATTACH 3 sub-tasks: ← ATTACHED
-    - → Load codebase context
-    - → Analyze test coverage
-    - → Generate context package
-  → Execute sub-tasks sequentially
-  → COLLAPSE tasks ← COLLAPSED
-  → contextPath extracted
-  ↓
-Step 1.3: Test Generation Analysis (Skill executed)
-  → ATTACH 3 sub-tasks: ← ATTACHED
-    - → Analyze coverage gaps with Gemini
-    - → Detect AI code issues (L0.5)
-    - → Generate L0-L3 test requirements
-  → Execute sub-tasks sequentially
-  → COLLAPSE tasks ← COLLAPSED
-  → TEST_ANALYSIS_RESULTS.md created
-  ↓
-Step 1.4: Generate Test Tasks (Skill executed)
-  → Single agent task (test-task-generate → action-planning-agent)
-  → Agent autonomously generates:
-    - IMPL-001.json (test generation)
-    - IMPL-001.3-validation.json (code validation)
-    - IMPL-001.5-review.json (test quality)
-    - IMPL-002.json (test execution)
-    - IMPL_PLAN.md
-    - TODO_LIST.md
-  ↓
-Step 1.5: Return Summary
-  → Display summary
-  → Phase 1 complete
-```
-
-## Output Artifacts
-
-### Directory Structure
-
-```
-.workflow/active/WFS-test-[session]/
-├── workflow-session.json              # Session metadata
-├── IMPL_PLAN.md                       # Test generation and execution strategy
-├── TODO_LIST.md                       # Task checklist
-├── .task/
-│   ├── IMPL-001.json                  # Test understanding & generation
-│   ├── IMPL-001.3-validation.json     # Code validation gate
-│   ├── IMPL-001.5-review.json         # Test quality gate
-│   ├── IMPL-002.json                  # Test execution & fix cycle
-│   └── IMPL-*.json                    # Additional tasks (if applicable)
-└── .process/
-    ├── [test-]context-package.json    # Context and coverage analysis
-    └── TEST_ANALYSIS_RESULTS.md       # Test requirements and strategy (L0-L3)
-```
-
-### Session Metadata
-
-**File**: `workflow-session.json`
-
-| Mode | Fields |
-|------|--------|
-| **Session** | `type: "test"`, `source_session_id: "[sourceId]"` |
-| **Prompt** | `type: "test"` (no source_session_id) |
-
-## Error Handling
-
-| Step | Error Condition | Action |
-|------|----------------|--------|
-| 1.1 | Source session not found (session mode) | Return error with session ID |
-| 1.1 | No completed IMPL tasks (session mode) | Return error, source incomplete |
-| 1.2 | Context gathering failed | Return error, check source artifacts |
-| 1.3 | Gemini analysis failed | Return error, check context package |
-| 1.4 | Task generation failed | Retry once, then return error |
-
-## Usage Examples
-
-```bash
-# Session Mode - test validation for completed implementation
-/workflow:test-fix-gen WFS-user-auth-v2
-
-# Prompt Mode - text description
-/workflow:test-fix-gen "Test the user authentication API endpoints in src/auth/api.ts"
-
-# Prompt Mode - file reference
-/workflow:test-fix-gen ./docs/api-requirements.md
-```
-
-## Output
-
- **Variable**: `testSessionId` (WFS-test-xxx)
- **Variable**: `contextPath` (context-package.json path)
- **Files**: IMPL_PLAN.md, IMPL-*.json (4+), TODO_LIST.md, TEST_ANALYSIS_RESULTS.md
- **TodoWrite**: Mark Phase 1 completed, Phase 2 in_progress
-
-## Next Phase
-
-Return to orchestrator, then auto-continue to [Phase 2: Test Cycle Execution](02-test-cycle-execute.md).
--- a/.claude/skills/workflow-test-fix/phases/02-test-context-gather.md
+++ b/.claude/skills/workflow-test-fix/phases/02-test-context-gather.md
@@ -0,0 +1,66 @@
+# Phase 2: Test Context Gather (test-context-gather)
+
+Gather test context via coverage analysis or codebase scan.
+
+## Objective
+
+- Gather test context (coverage analysis or codebase scan)
+- Generate context package for downstream analysis
+
+## Execution
+
+### Step 1.2: Gather Test Context
+
+```
+// Session Mode - gather from source session
+Skill(skill="workflow:tools:test-context-gather", args="--session [testSessionId]")
+
+// Prompt Mode - gather from codebase
+Skill(skill="workflow:tools:context-gather", args="--session [testSessionId] \"[task_description]\"")
+```
+
+**Input**: `testSessionId` from Phase 1
+
+**Parse Output**:
+- Extract: context package path (store as `contextPath`)
+- Pattern: `.workflow/active/[testSessionId]/.process/[test-]context-package.json`
+
+**Validation**:
+- Context package file exists and is valid JSON
+- Contains coverage analysis (session mode) or codebase analysis (prompt mode)
+- Test framework detected
+
+**TodoWrite Update (tasks attached)**:
+```json
+[
+  {"content": "Phase 1: Test Generation", "status": "in_progress"},
+  {"content": "  → Create test session", "status": "completed"},
+  {"content": "  → Gather test context", "status": "in_progress"},
+  {"content": "    → Load source/codebase context", "status": "in_progress"},
+  {"content": "    → Analyze test coverage", "status": "pending"},
+  {"content": "    → Generate context package", "status": "pending"},
+  {"content": "  → Test analysis (Gemini)", "status": "pending"},
+  {"content": "  → Generate test tasks", "status": "pending"},
+  {"content": "Phase 2: Test Cycle Execution", "status": "pending"}
+]
+```
+
+**TodoWrite Update (tasks collapsed)**:
+```json
+[
+  {"content": "Phase 1: Test Generation", "status": "in_progress"},
+  {"content": "  → Create test session", "status": "completed"},
+  {"content": "  → Gather test context", "status": "completed"},
+  {"content": "  → Test analysis (Gemini)", "status": "pending"},
+  {"content": "  → Generate test tasks", "status": "pending"},
+  {"content": "Phase 2: Test Cycle Execution", "status": "pending"}
+]
+```
+
+## Output
+
+- **Variable**: `contextPath` (context-package.json path)
+
+## Next Phase
+
+Continue to [Phase 3: Test Concept Enhanced](03-test-concept-enhanced.md).
--- a/.claude/skills/workflow-test-fix/phases/03-test-concept-enhanced.md
+++ b/.claude/skills/workflow-test-fix/phases/03-test-concept-enhanced.md
@@ -0,0 +1,51 @@
+# Phase 3: Test Concept Enhanced (test-concept-enhanced)
+
+Analyze test requirements with Gemini using progressive L0-L3 test layers.
+
+## Objective
+
+- Use Gemini to analyze coverage gaps
+- Detect project type and apply appropriate test templates
+- Generate multi-layered test requirements (L0-L3)
+- Scan for AI code issues
+
+## Execution
+
+### Step 1.3: Test Generation Analysis
+
+```
+Skill(skill="workflow:tools:test-concept-enhanced", args="--session [testSessionId] --context [contextPath]")
+```
+
+**Input**:
+- `testSessionId` from Phase 1
+- `contextPath` from Phase 2
+
+**Expected Behavior**:
+- Use Gemini to analyze coverage gaps
+- Detect project type and apply appropriate test templates
+- Generate **multi-layered test requirements** (L0-L3)
+- Scan for AI code issues
+- Generate `TEST_ANALYSIS_RESULTS.md`
+
+**Output**: `.workflow/[testSessionId]/.process/TEST_ANALYSIS_RESULTS.md`
+
+**Validation** - TEST_ANALYSIS_RESULTS.md must include:
+- Project Type Detection (with confidence)
+- Coverage Assessment (current vs target)
+- Test Framework & Conventions
+- Multi-Layered Test Plan (L0-L3)
+- AI Issue Scan Results
+- Test Requirements by File (with layer annotations)
+- Quality Assurance Criteria
+- Success Criteria
+
+**Note**: Detailed specifications for project types, L0-L3 layers, and AI issue detection are defined in `/workflow:tools:test-concept-enhanced`.
+
+## Output
+
+- **File**: `.workflow/[testSessionId]/.process/TEST_ANALYSIS_RESULTS.md`
+
+## Next Phase
+
+Continue to [Phase 4: Test Task Generate](04-test-task-generate.md).
--- a/.claude/skills/workflow-test-fix/phases/04-test-task-generate.md
+++ b/.claude/skills/workflow-test-fix/phases/04-test-task-generate.md
@@ -0,0 +1,46 @@
+# Phase 4: Test Task Generate (test-task-generate)
+
+Generate test task JSONs via action-planning-agent.
+
+## Objective
+
+- Generate test-specific IMPL_PLAN.md and task JSONs based on TEST_ANALYSIS_RESULTS.md
+- Create minimum 4 tasks covering test generation, code validation, quality review, and test execution
+
+## Execution
+
+### Step 1.4: Generate Test Tasks
+
+```
+Skill(skill="workflow:tools:test-task-generate", args="--session [testSessionId]")
+```
+
+**Input**: `testSessionId` from Phase 1
+
+**Note**: test-task-generate invokes action-planning-agent to generate test-specific IMPL_PLAN.md and task JSONs based on TEST_ANALYSIS_RESULTS.md.
+
+**Expected Output** (minimum 4 tasks):
+
+| Task | Type | Agent | Purpose |
+|------|------|-------|---------|
+| IMPL-001 | test-gen | @code-developer | Test understanding & generation (L1-L3) |
+| IMPL-001.3 | code-validation | @test-fix-agent | Code validation gate (L0 + AI issues) |
+| IMPL-001.5 | test-quality-review | @test-fix-agent | Test quality gate |
+| IMPL-002 | test-fix | @test-fix-agent | Test execution & fix cycle |
+
+**Validation**:
+- `.workflow/active/[testSessionId]/.task/IMPL-001.json` exists
+- `.workflow/active/[testSessionId]/.task/IMPL-001.3-validation.json` exists
+- `.workflow/active/[testSessionId]/.task/IMPL-001.5-review.json` exists
+- `.workflow/active/[testSessionId]/.task/IMPL-002.json` exists
+- `.workflow/active/[testSessionId]/IMPL_PLAN.md` exists
+- `.workflow/active/[testSessionId]/TODO_LIST.md` exists
+
+## Output
+
+- **Files**: IMPL_PLAN.md, IMPL-*.json (4+), TODO_LIST.md
+- **TodoWrite**: Mark Phase 1-4 completed, Phase 5 in_progress
+
+## Next Phase
+
+Return to orchestrator for summary output, then auto-continue to [Phase 5: Test Cycle Execute](05-test-cycle-execute.md).
--- a/.claude/skills/workflow-test-fix/phases/05-test-cycle-execute.md
+++ b/.claude/skills/workflow-test-fix/phases/05-test-cycle-execute.md
@@ -53,7 +53,7 @@ Load session, tasks, and iteration state.
   └─ Load session, tasks, iteration state
 ```

-**For full-pipeline entry (from Phase 1)**: Use `testSessionId` passed from Phase 1.
+**For full-pipeline entry (from Phase 1-4)**: Use `testSessionId` passed from Phase 4.

 **For direct entry (/workflow:test-cycle-execute)**:
 - `--resume-session="WFS-xxx"` → Use specified session
@@ -526,7 +526,7 @@ The orchestrator automatically creates git commits at key checkpoints to enable
 - **Variable**: `finalPassRate` (percentage)
 - **File**: `test-results.json` (final results)
 - **File**: `iteration-state.json` (full iteration history)
- **TodoWrite**: Mark Phase 2 completed
+- **TodoWrite**: Mark Phase 5 completed

 ## Next Phase