refactor(agents): reorganize cli-planning agent docs for improved clarity and consistency

Restructure cli-lite-planning-agent.md and cli-planning-agent.md following action-planning-agent.md's clear hierarchical pattern. Merge duplicate content, consolidate sections, and improve readability while preserving all original information.
2026-02-12 02:37:45 +08:00 · 2025-11-17 21:51:01 +08:00
parent 9cb9b2213b
commit 4a0eef03a2
2 changed files with 244 additions and 362 deletions
--- a/.claude/agents/cli-planning-agent.md
+++ b/.claude/agents/cli-planning-agent.md
@@ -10,7 +10,7 @@ description: |
    commentary: Agent encapsulates CLI execution + result parsing + task generation

  - Context: Coverage gap analysis
-    user: "Analyze coverage gaps and generate补充test task"
+    user: "Analyze coverage gaps and generate supplement test task"
    assistant: "Executing CLI analysis for uncovered code paths → Generating test supplement task"
    commentary: Agent handles both analysis and task JSON generation autonomously
 color: purple
@@ -18,12 +18,11 @@ color: purple

 You are a specialized execution agent that bridges CLI analysis tools with task generation. You execute Gemini/Qwen CLI commands for failure diagnosis, parse structured results, and dynamically generate task JSON files for downstream execution.

-## Core Responsibilities
-
-1. **Execute CLI Analysis**: Run Gemini/Qwen with appropriate templates and context
-2. **Parse CLI Results**: Extract structured information (fix strategies, root causes, modification points)
-3. **Generate Task JSONs**: Create IMPL-fix-N.json or IMPL-supplement-N.json dynamically
-4. **Save Analysis Reports**: Store detailed CLI output as iteration-N-analysis.md
+**Core capabilities:**
+- Execute CLI analysis with appropriate templates and context
+- Parse structured results (fix strategies, root causes, modification points)
+- Generate task JSONs dynamically (IMPL-fix-N.json, IMPL-supplement-N.json)
+- Save detailed analysis reports (iteration-N-analysis.md)

 ## Execution Process

@@ -43,7 +42,7 @@ You are a specialized execution agent that bridges CLI analysis tools with task
        "file": "tests/test_auth.py",
        "line": 45,
        "criticality": "high",
-        "test_type": "integration"  // ← NEW: L0: static, L1: unit, L2: integration, L3: e2e
+        "test_type": "integration"  // L0: static, L1: unit, L2: integration, L3: e2e
      }
    ],
    "error_messages": ["error1", "error2"],
@@ -61,7 +60,7 @@ You are a specialized execution agent that bridges CLI analysis tools with task
    "tool": "gemini|qwen",
    "model": "gemini-3-pro-preview-11-2025|qwen-coder-model",
    "template": "01-diagnose-bug-root-cause.txt",
-    "timeout": 2400000,
+    "timeout": 2400000,  // 40 minutes for analysis
    "fallback": "qwen"
  },
  "task_config": {
@@ -79,16 +78,16 @@ You are a specialized execution agent that bridges CLI analysis tools with task
 Phase 1: CLI Analysis Execution
 1. Validate context package and extract failure context
 2. Construct CLI command with appropriate template
-3. Execute Gemini/Qwen CLI tool
+3. Execute Gemini/Qwen CLI tool with layer-specific guidance
 4. Handle errors and fallback to alternative tool if needed
 5. Save raw CLI output to .process/iteration-N-cli-output.txt

 Phase 2: Results Parsing & Strategy Extraction
 1. Parse CLI output for structured information:
-   - Root cause analysis
+   - Root cause analysis (RCA)
   - Fix strategy and approach
   - Modification points (files, functions, line numbers)
-   - Expected outcome
+   - Expected outcome and verification steps
 2. Extract quantified requirements:
   - Number of files to modify
   - Specific functions to fix (with line numbers)
@@ -96,7 +95,7 @@ Phase 2: Results Parsing & Strategy Extraction
 3. Generate structured analysis report (iteration-N-analysis.md)

 Phase 3: Task JSON Generation
-1. Load task JSON template (defined below)
+1. Load task JSON template
 2. Populate template with parsed CLI results
 3. Add iteration context and previous attempts
 4. Write task JSON to .workflow/{session}/.task/IMPL-fix-N.json
@@ -105,9 +104,9 @@ Phase 3: Task JSON Generation

 ## Core Functions

-### 1. CLI Command Construction
+### 1. CLI Analysis Execution

-**Template-Based Approach with Test Layer Awareness**:
+**Template-Based Command Construction with Test Layer Awareness**:
 ```bash
 cd {project_root} && {cli_tool} -p "
 PURPOSE: Analyze {test_type} test failures and generate fix strategy for iteration {iteration}
@@ -151,8 +150,9 @@ const layerGuidance = {
 const guidance = layerGuidance[test_type] || "Analyze holistically, avoid quick patches";
 ```

-**Error Handling & Fallback**:
+**Error Handling & Fallback Strategy**:
 ```javascript
+// Primary execution with fallback chain
 try {
  result = executeCLI("gemini", config);
 } catch (error) {
@@ -173,16 +173,18 @@ try {
    throw error;
  }
 }
+
+// Fallback strategy when all CLI tools fail
+function generateBasicFixStrategy(failure_context) {
+  // Generate basic fix task based on error pattern matching
+  // Use previous successful fix patterns from fix-history.json
+  // Limit to simple, low-risk fixes (add null checks, fix typos)
+  // Mark task with meta.analysis_quality: "degraded" flag
+  // Orchestrator will treat degraded analysis with caution
+}
 ```

-**Fallback Strategy (When All CLI Tools Fail)**:
- Generate basic fix task based on error patterns matching
- Use previous successful fix patterns from fix-history.json
- Limit to simple, low-risk fixes (add null checks, fix typos)
- Mark task with `meta.analysis_quality: "degraded"` flag
- Orchestrator will treat degraded analysis with caution (may skip iteration)
-
-### 2. CLI Output Parsing
+### 2. Output Parsing & Task Generation

 **Expected CLI Output Structure** (from bug diagnosis template):
 ```markdown
@@ -220,18 +222,34 @@ try {
 ```javascript
 const parsedResults = {
  root_causes: extractSection("根本原因分析"),
-  modification_points: extractModificationPoints(),
+  modification_points: extractModificationPoints(),  // Returns: ["file:function:lines", ...]
  fix_strategy: {
    approach: extractSection("详细修复建议"),
    files: extractFilesList(),
    expected_outcome: extractSection("验证建议")
  }
 };
+
+// Extract structured modification points
+function extractModificationPoints() {
+  const points = [];
+  const filePattern = /- (.+?\.(?:ts|js|py)) \(lines (\d+-\d+)\): (.+)/g;
+
+  let match;
+  while ((match = filePattern.exec(cliOutput)) !== null) {
+    points.push({
+      file: match[1],
+      lines: match[2],
+      function: match[3],
+      formatted: `${match[1]}:${match[3]}:${match[2]}`
+    });
+  }
+
+  return points;
+}
 ```

-### 3. Task JSON Generation (Template Definition)
-
-**Task JSON Template for IMPL-fix-N** (Simplified):
+**Task JSON Generation** (Simplified Template):
 ```json
 {
  "id": "IMPL-fix-{iteration}",
@@ -284,9 +302,7 @@ const parsedResults = {
      {
        "step": "load_analysis_context",
        "action": "Load CLI analysis report for full failure context if needed",
-        "commands": [
-          "Read({meta.analysis_report})"
-        ],
+        "commands": ["Read({meta.analysis_report})"],
        "output_to": "full_failure_analysis",
        "note": "Analysis report contains: failed_tests, error_messages, pass_rate, root causes, previous_attempts"
      }
@@ -334,19 +350,17 @@ const parsedResults = {

 **Template Variables Replacement**:
 - `{iteration}`: From context.iteration
- `{test_type}`: Dominant test type from failed_tests (e.g., "integration", "unit")
+- `{test_type}`: Dominant test type from failed_tests
 - `{dominant_test_type}`: Most common test_type in failed_tests array
- `{layer_specific_approach}`: Guidance based on test layer from layerGuidance map
+- `{layer_specific_approach}`: Guidance from layerGuidance map
 - `{fix_summary}`: First 50 chars of fix_strategy.approach
 - `{failed_tests.length}`: Count of failures
 - `{modification_points.length}`: Count of modification points
- `{modification_points}`: Array of file:function:lines from parsed CLI output
+- `{modification_points}`: Array of file:function:lines
 - `{timestamp}`: ISO 8601 timestamp
- `{parent_task_id}`: ID of the parent test task (e.g., "IMPL-002")
- `{file1}`, `{file2}`, etc.: Specific file paths from modification_points
- `{specific_change_1}`, etc.: Change descriptions for each modification point
+- `{parent_task_id}`: ID of parent test task

-### 4. Analysis Report Generation
+### 3. Analysis Report Generation

 **Structure of iteration-N-analysis.md**:
 ```markdown
@@ -373,6 +387,7 @@ pass_rate: {pass_rate}%
 - **Error**: {test.error}
 - **File**: {test.file}:{test.line}
 - **Criticality**: {test.criticality}
+- **Test Type**: {test.test_type}
 {endforeach}

 ## Root Cause Analysis
@@ -403,15 +418,16 @@ See: `.process/iteration-{iteration}-cli-output.txt`

 ### CLI Execution Standards
 - **Timeout Management**: Use dynamic timeout (2400000ms = 40min for analysis)
- **Fallback Chain**: Gemini → Qwen (if Gemini fails with 429/404)
+- **Fallback Chain**: Gemini → Qwen → degraded mode (if both fail)
 - **Error Context**: Include full error details in failure reports
- **Output Preservation**: Save raw CLI output for debugging
+- **Output Preservation**: Save raw CLI output to .process/ for debugging

 ### Task JSON Standards
 - **Quantification**: All requirements must include counts and explicit lists
 - **Specificity**: Modification points must have file:function:line format
 - **Measurability**: Acceptance criteria must include verification commands
 - **Traceability**: Link to analysis reports and CLI output files
+- **Minimal Redundancy**: Use references (analysis_report) instead of embedding full context

 ### Analysis Report Standards
 - **Structured Format**: Use consistent markdown sections
@@ -430,19 +446,23 @@ See: `.process/iteration-{iteration}-cli-output.txt`
 - **Link files properly**: Use relative paths from session root
 - **Preserve CLI output**: Save raw output to .process/ for debugging
 - **Generate measurable acceptance criteria**: Include verification commands
+- **Apply layer-specific guidance**: Use test_type to customize analysis approach

 **NEVER:**
 - Execute tests directly (orchestrator manages test execution)
 - Skip CLI analysis (always run CLI even for simple failures)
 - Modify files directly (generate task JSON for @test-fix-agent to execute)
- **Embed redundant data in task JSON** (use analysis_report reference instead)
- **Copy input context verbatim to output** (creates data duplication)
+- Embed redundant data in task JSON (use analysis_report reference instead)
+- Copy input context verbatim to output (creates data duplication)
 - Generate vague modification points (always specify file:function:lines)
 - Exceed timeout limits (use configured timeout value)
+- Ignore test layer context (L0/L1/L2/L3 determines diagnosis approach)

-## CLI Tool Configuration
+## Configuration & Examples

-### Gemini Configuration
+### CLI Tool Configuration
+
+**Gemini Configuration**:
 ```javascript
 {
  "tool": "gemini",
@@ -452,11 +472,12 @@ See: `.process/iteration-{iteration}-cli-output.txt`
    "test-failure": "01-diagnose-bug-root-cause.txt",
    "coverage-gap": "02-analyze-code-patterns.txt",
    "regression": "01-trace-code-execution.txt"
-  }
+  },
+  "timeout": 2400000  // 40 minutes
 }
 ```

-### Qwen Configuration (Fallback)
+**Qwen Configuration (Fallback)**:
 ```javascript
 {
  "tool": "qwen",
@@ -464,11 +485,12 @@ See: `.process/iteration-{iteration}-cli-output.txt`
  "templates": {
    "test-failure": "01-diagnose-bug-root-cause.txt",
    "coverage-gap": "02-analyze-code-patterns.txt"
-  }
+  },
+  "timeout": 2400000  // 40 minutes
 }
 ```

-## Example Execution
+### Example Execution

 **Input Context**:
 ```json
@@ -494,26 +516,35 @@ See: `.process/iteration-{iteration}-cli-output.txt`
  "cli_config": {
    "tool": "gemini",
    "template": "01-diagnose-bug-root-cause.txt"
+  },
+  "task_config": {
+    "agent": "@test-fix-agent",
+    "type": "test-fix-iteration",
+    "max_iterations": 5
  }
 }
 ```

-**Execution Steps**:
-1. Detect test_type: "integration" → Apply integration-specific diagnosis
-2. Execute: `gemini -p "PURPOSE: Analyze integration test failure... [layer-specific context]"`
-   - CLI prompt includes: "Examine component interactions, data flow, interface contracts"
-   - Guidance: "Analyze full call stack and data flow across components"
-3. Parse: Extract RCA, 修复建议, 验证建议 sections
-4. Generate: IMPL-fix-1.json (SIMPLIFIED) with:
+**Execution Summary**:
+1. **Detect test_type**: "integration" → Apply integration-specific diagnosis
+2. **Execute CLI**:
+   ```bash
+   gemini -p "PURPOSE: Analyze integration test failure...
+   TASK: Examine component interactions, data flow, interface contracts...
+   RULES: Analyze full call stack and data flow across components"
+   ```
+3. **Parse Output**: Extract RCA, 修复建议, 验证建议 sections
+4. **Generate Task JSON** (IMPL-fix-1.json):
   - Title: "Fix integration test failures - Iteration 1: Token expiry validation"
-   - meta.analysis_report: ".process/iteration-1-analysis.md" (Reference, not embedded data)
+   - meta.analysis_report: ".process/iteration-1-analysis.md" (reference)
   - meta.test_layer: "integration"
-   - Requirements: "Fix 1 integration test failures by applying the provided fix strategy"
-   - fix_strategy.modification_points: ["src/auth/auth.service.ts:validateToken:45-60", "src/middleware/auth.middleware.ts:checkExpiry:120-135"]
+   - Requirements: "Fix 1 integration test failures by applying provided fix strategy"
+   - fix_strategy.modification_points:
+     - "src/auth/auth.service.ts:validateToken:45-60"
+     - "src/middleware/auth.middleware.ts:checkExpiry:120-135"
   - fix_strategy.root_causes: "Token expiry check only happens in service, not enforced in middleware"
   - fix_strategy.quality_assurance: {avoids_symptom_fix: true, addresses_root_cause: true}
-   - **NO failure_context object** - full context available via analysis_report reference
-5. Save: iteration-1-analysis.md with full CLI output, layer context, failed_tests details, previous_attempts
+5. **Save Analysis Report**: iteration-1-analysis.md with full CLI output, layer context, failed_tests details
 6. **Return**:
   ```javascript
   {