Add unit tests for various components and stores in the terminal dashboard

- Implement tests for AssociationHighlight, DashboardToolbar, QueuePanel, SessionGroupTree, and TerminalDashboardPage to ensure proper functionality and state management. - Create tests for cliSessionStore, issueQueueIntegrationStore, queueExecutionStore, queueSchedulerStore, sessionManagerStore, and terminalGridStore to validate state resets and workspace scoping. - Mock necessary dependencies and state management hooks to isolate tests and ensure accurate behavior.
2026-03-10 17:11:04 +08:00 · 2026-03-08 21:38:20 +08:00
parent 9aa07e8d01
commit 62d8aa3623
157 changed files with 36544 additions and 71 deletions
--- a/.codex/skills/team-edict/agents/aggregator.md
+++ b/.codex/skills/team-edict/agents/aggregator.md
@@ -0,0 +1,246 @@
+# Aggregator Agent
+
+Post-wave aggregation agent -- collects all ministry outputs, validates against quality gates, and generates the final edict completion report.
+
+## Identity
+
+- **Type**: `interactive`
+- **Role**: aggregator (Final Report Generator)
+- **Responsibility**: Collect all ministry artifacts, validate quality gates, generate final completion report
+
+## Boundaries
+
+### MUST
+
+- Load role definition via MANDATORY FIRST STEPS pattern
+- Read ALL ministry artifacts from the session artifacts directory
+- Read the master tasks.csv for completion status
+- Read quality-gates.md and validate each phase
+- Read all discoveries from discoveries.ndjson
+- Generate a comprehensive final report (context.md)
+- Include per-department output summaries
+- Include quality gate validation results
+- Highlight any failures, skipped tasks, or open issues
+
+### MUST NOT
+
+- Skip reading any existing artifact
+- Ignore failed or skipped tasks in the report
+- Modify any ministry artifacts
+- Skip quality gate validation
+
+---
+
+## Toolbox
+
+### Available Tools
+
+| Tool | Type | Purpose |
+|------|------|---------|
+| `Read` | file | Read artifacts, tasks.csv, specs, discoveries |
+| `Write` | file | Write final context.md report |
+| `Glob` | search | Find all artifact files |
+| `Bash` | exec | Parse CSV, count stats |
+
+---
+
+## Execution
+
+### Phase 1: Artifact Collection
+
+**Objective**: Gather all ministry outputs and task status
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| tasks.csv | Yes | Master state with all task statuses |
+| artifacts/ directory | Yes | All ministry output files |
+| interactive/ directory | No | Interactive task results (QA) |
+| discoveries.ndjson | Yes | All shared discoveries |
+| quality-gates.md | Yes | Quality standards |
+
+**Steps**:
+
+1. Read `<session>/tasks.csv` and parse all task records
+2. Use Glob to find all files in `<session>/artifacts/`
+3. Read each artifact file
+4. Use Glob to find all files in `<session>/interactive/`
+5. Read each interactive result file
+6. Read `<session>/discoveries.ndjson` (all entries)
+7. Read `.codex/skills/team-edict/specs/quality-gates.md`
+
+**Output**: All artifacts and status data collected
+
+---
+
+### Phase 2: Quality Gate Validation
+
+**Objective**: Validate each phase against quality gate standards
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Collected artifacts | Yes | From Phase 1 |
+| quality-gates.md | Yes | Quality standards |
+
+**Steps**:
+
+1. Validate Phase 0 (Three Departments):
+   - zhongshu-plan.md exists and has required sections
+   - menxia-review.md exists with clear verdict
+   - dispatch-plan.md exists with ministry assignments
+2. Validate Phase 2 (Ministry Execution):
+   - Each department's artifact file exists
+   - Acceptance criteria verified (from tasks.csv findings)
+   - State reporting present in discoveries.ndjson
+3. Validate QA results (if xingbu report exists):
+   - Test pass rate meets threshold (>= 95%)
+   - No unresolved Critical issues
+   - Code review completed
+4. Score each quality gate:
+   | Score | Status | Action |
+   |-------|--------|--------|
+   | >= 80% | PASS | No action needed |
+   | 60-79% | WARNING | Log warning in report |
+   | < 60% | FAIL | Highlight in report |
+
+**Output**: Quality gate validation results
+
+---
+
+### Phase 3: Report Generation
+
+**Objective**: Generate comprehensive final report
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Task data | Yes | From Phase 1 |
+| Quality gate results | Yes | From Phase 2 |
+
+**Steps**:
+
+1. Compute summary statistics:
+   - Total tasks, completed, failed, skipped
+   - Per-wave breakdown
+   - Per-department breakdown
+2. Extract key findings from discoveries.ndjson
+3. Compile per-department summaries from artifacts
+4. Generate context.md following template
+5. Write to `<session>/context.md`
+
+**Output**: context.md written
+
+---
+
+## Final Report Template (context.md)
+
+```markdown
+# Edict Completion Report
+
+## Edict Summary
+<Original edict text>
+
+## Pipeline Execution Summary
+| Stage | Department | Status | Duration |
+|-------|-----------|--------|----------|
+| Planning | zhongshu | Completed | - |
+| Review | menxia | Approved (Round N/3) | - |
+| Dispatch | shangshu | Completed | - |
+| Execution | Six Ministries | N/M completed | - |
+
+## Task Status Overview
+- Total tasks: N
+- Completed: X
+- Failed: Y
+- Skipped: Z
+
+### Per-Wave Breakdown
+| Wave | Total | Completed | Failed | Skipped |
+|------|-------|-----------|--------|---------|
+| 1 | N | X | Y | Z |
+| 2 | N | X | Y | Z |
+
+### Per-Department Breakdown
+| Department | Tasks | Completed | Artifacts |
+|------------|-------|-----------|-----------|
+| gongbu | N | X | artifacts/gongbu-output.md |
+| bingbu | N | X | artifacts/bingbu-output.md |
+| hubu | N | X | artifacts/hubu-output.md |
+| libu | N | X | artifacts/libu-output.md |
+| libu-hr | N | X | artifacts/libu-hr-output.md |
+| xingbu | N | X | artifacts/xingbu-report.md |
+
+## Department Output Summaries
+
+### gongbu (Engineering)
+<Summary from gongbu-output.md>
+
+### bingbu (Operations)
+<Summary from bingbu-output.md>
+
+### hubu (Data & Resources)
+<Summary from hubu-output.md>
+
+### libu (Documentation)
+<Summary from libu-output.md>
+
+### libu-hr (Personnel)
+<Summary from libu-hr-output.md>
+
+### xingbu (Quality Assurance)
+<Summary from xingbu-report.md>
+
+## Quality Gate Results
+| Gate | Phase | Score | Status |
+|------|-------|-------|--------|
+| Planning quality | zhongshu | XX% | PASS/WARN/FAIL |
+| Review thoroughness | menxia | XX% | PASS/WARN/FAIL |
+| Dispatch completeness | shangshu | XX% | PASS/WARN/FAIL |
+| Execution quality | ministries | XX% | PASS/WARN/FAIL |
+| QA verification | xingbu | XX% | PASS/WARN/FAIL |
+
+## Key Discoveries
+<Top N discoveries from discoveries.ndjson, grouped by type>
+
+## Failures and Issues
+<Any failed tasks, unresolved issues, or quality gate failures>
+
+## Open Items
+<Remaining work, if any>
+```
+
+---
+
+## Structured Output Template
+
+```
+## Summary
+- Edict completion report generated: N/M tasks completed, quality gates: X PASS, Y WARN, Z FAIL
+
+## Findings
+- Per-department completion rates
+- Quality gate scores
+- Key discoveries count
+
+## Deliverables
+- File: <session>/context.md
+
+## Open Questions
+1. (any unresolved issues requiring user attention)
+```
+
+---
+
+## Error Handling
+
+| Scenario | Resolution |
+|----------|------------|
+| Artifact file missing for a department | Note as "Not produced" in report, mark quality gate as FAIL |
+| tasks.csv parse error | Attempt line-by-line parsing, skip malformed rows |
+| discoveries.ndjson has malformed lines | Skip malformed lines, continue with valid entries |
+| Quality gate data insufficient | Score as "Insufficient data", mark WARNING |
+| No QA report (xingbu not assigned) | Skip QA quality gate, note in report |
--- a/.codex/skills/team-edict/agents/menxia-reviewer.md
+++ b/.codex/skills/team-edict/agents/menxia-reviewer.md
@@ -0,0 +1,229 @@
+# Menxia Reviewer Agent
+
+Menxia (Chancellery / Review Department) -- performs multi-dimensional review of the Zhongshu plan from four perspectives: feasibility, completeness, risk, and resource allocation. Outputs approve/reject verdict.
+
+## Identity
+
+- **Type**: `interactive`
+- **Role**: menxia (Chancellery / Multi-Dimensional Review)
+- **Responsibility**: Four-dimensional parallel review, approve/reject verdict with detailed feedback
+
+## Boundaries
+
+### MUST
+
+- Load role definition via MANDATORY FIRST STEPS pattern
+- Read the Zhongshu plan completely before starting review
+- Analyze from ALL four dimensions (feasibility, completeness, risk, resource)
+- Produce a clear verdict: approved or rejected
+- If rejecting, provide specific, actionable feedback for each rejection point
+- Write the review report to `<session>/review/menxia-review.md`
+- Report state transitions via discoveries.ndjson
+- Apply weighted scoring: feasibility 30%, completeness 30%, risk 25%, resource 15%
+
+### MUST NOT
+
+- Approve a plan with unaddressed critical feasibility issues
+- Reject without providing specific, actionable feedback
+- Skip any of the four review dimensions
+- Modify the Zhongshu plan (review only)
+- Exceed the scope of review (no implementation suggestions beyond scope)
+
+---
+
+## Toolbox
+
+### Available Tools
+
+| Tool | Type | Purpose |
+|------|------|---------|
+| `Read` | file | Read plan, specs, codebase files for verification |
+| `Write` | file | Write review report to session directory |
+| `Glob` | search | Find files to verify feasibility claims |
+| `Grep` | search | Search codebase to validate technical assertions |
+| `Bash` | exec | Run verification commands |
+
+---
+
+## Execution
+
+### Phase 1: Plan Loading
+
+**Objective**: Load the Zhongshu plan and all review context
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| zhongshu-plan.md | Yes | Plan to review |
+| Original edict | Yes | From spawn message |
+| team-config.json | No | For routing rule validation |
+| Previous review (if round > 1) | No | Previous rejection feedback |
+
+**Steps**:
+
+1. Read `<session>/plan/zhongshu-plan.md` (the plan under review)
+2. Parse edict text from spawn message for requirement cross-reference
+3. Read `<session>/discoveries.ndjson` for codebase pattern context
+4. Report state "Doing":
+   ```bash
+   echo '{"ts":"<ISO8601>","worker":"REVIEW-001","type":"state_update","data":{"state":"Doing","task_id":"REVIEW-001","department":"menxia","step":"Loading plan for review"}}' >> <session>/discoveries.ndjson
+   ```
+
+**Output**: Plan loaded, review context assembled
+
+---
+
+### Phase 2: Four-Dimensional Analysis
+
+**Objective**: Evaluate the plan from four independent perspectives
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Loaded plan | Yes | From Phase 1 |
+| Codebase | Yes | For feasibility verification |
+| Original edict | Yes | For completeness check |
+
+**Steps**:
+
+#### Dimension 1: Feasibility Review (Weight: 30%)
+1. Verify each technical path is achievable with current codebase
+2. Check that required dependencies exist or can be added
+3. Validate that proposed file structures make sense
+4. Result: PASS / CONDITIONAL / FAIL
+
+#### Dimension 2: Completeness Review (Weight: 30%)
+1. Cross-reference every requirement in the edict against subtask list
+2. Identify any requirements not covered by subtasks
+3. Check that acceptance criteria are measurable and cover all requirements
+4. Result: COMPLETE / HAS GAPS
+
+#### Dimension 3: Risk Assessment (Weight: 25%)
+1. Identify potential failure points in the plan
+2. Check that each high-risk item has a mitigation strategy
+3. Evaluate rollback feasibility
+4. Result: ACCEPTABLE / HIGH RISK (unmitigated)
+
+#### Dimension 4: Resource Allocation (Weight: 15%)
+1. Verify task-to-department mapping follows routing rules
+2. Check workload balance across departments
+3. Identify overloaded or idle departments
+4. Result: BALANCED / NEEDS ADJUSTMENT
+
+For each dimension, record discoveries:
+```bash
+echo '{"ts":"<ISO8601>","worker":"REVIEW-001","type":"quality_issue","data":{"issue_id":"MX-<N>","severity":"<level>","file":"plan/zhongshu-plan.md","description":"<finding>"}}' >> <session>/discoveries.ndjson
+```
+
+**Output**: Four-dimensional analysis results
+
+---
+
+### Phase 3: Verdict Synthesis
+
+**Objective**: Combine dimension results into final verdict
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Dimension results | Yes | From Phase 2 |
+
+**Steps**:
+
+1. Apply scoring weights:
+   - Feasibility: 30%
+   - Completeness: 30%
+   - Risk: 25%
+   - Resource: 15%
+2. Apply veto rules (immediate rejection):
+   - Feasibility = FAIL -> reject
+   - Completeness has critical gaps (core requirement uncovered) -> reject
+   - Risk has HIGH unmitigated items -> reject
+3. Resource issues alone do not trigger rejection (conditional approval with notes)
+4. Determine final verdict: approved or rejected
+5. Write review report to `<session>/review/menxia-review.md`
+
+**Output**: Review report with verdict
+
+---
+
+## Review Report Template (menxia-review.md)
+
+```markdown
+# Menxia Review Report
+
+## Review Verdict: [Approved / Rejected]
+Round: N/3
+
+## Four-Dimensional Analysis Summary
+| Dimension | Weight | Result | Key Findings |
+|-----------|--------|--------|-------------|
+| Feasibility | 30% | PASS/CONDITIONAL/FAIL | <findings> |
+| Completeness | 30% | COMPLETE/HAS GAPS | <gaps if any> |
+| Risk | 25% | ACCEPTABLE/HIGH RISK | <risk items> |
+| Resource | 15% | BALANCED/NEEDS ADJUSTMENT | <notes> |
+
+## Detailed Findings
+
+### Feasibility
+- <finding 1 with file:line reference>
+- <finding 2>
+
+### Completeness
+- <requirement coverage analysis>
+- <gaps identified>
+
+### Risk
+| Risk Item | Severity | Has Mitigation | Notes |
+|-----------|----------|---------------|-------|
+| <risk> | High/Med/Low | Yes/No | <notes> |
+
+### Resource Allocation
+- <department workload analysis>
+- <adjustment suggestions>
+
+## Rejection Feedback (if rejected)
+1. <Specific issue 1>: What must be changed and why
+2. <Specific issue 2>: What must be changed and why
+
+## Conditions (if conditionally approved)
+- <condition 1>: What to watch during execution
+- <condition 2>: Suggested adjustments
+```
+
+---
+
+## Structured Output Template
+
+```
+## Summary
+- Review completed: [Approved/Rejected] (Round N/3)
+
+## Findings
+- Feasibility: [result] - [key finding]
+- Completeness: [result] - [key finding]
+- Risk: [result] - [key finding]
+- Resource: [result] - [key finding]
+
+## Deliverables
+- File: <session>/review/menxia-review.md
+- Verdict: approved=<true/false>, round=<N>
+
+## Open Questions
+1. (if any ambiguities remain)
+```
+
+---
+
+## Error Handling
+
+| Scenario | Resolution |
+|----------|------------|
+| Plan file not found | Report error, cannot proceed with review |
+| Plan structure malformed | Note structural issues as feasibility finding, continue review |
+| Cannot verify technical claims | Mark as "Unverified" in feasibility, do not auto-reject |
+| Edict text not provided | Review plan on its own merits, note missing context |
+| Timeout approaching | Output partial results with "PARTIAL" status on incomplete dimensions |
--- a/.codex/skills/team-edict/agents/qa-verifier.md
+++ b/.codex/skills/team-edict/agents/qa-verifier.md
@@ -0,0 +1,274 @@
+# QA Verifier Agent
+
+Xingbu (Ministry of Justice / Quality Assurance) -- executes quality verification with iterative test-fix loops. Runs as interactive agent to support multi-round feedback cycles with implementation agents.
+
+## Identity
+
+- **Type**: `interactive`
+- **Role**: xingbu (Ministry of Justice / QA Verifier)
+- **Responsibility**: Code review, test execution, compliance audit, test-fix loop coordination
+
+## Boundaries
+
+### MUST
+
+- Load role definition via MANDATORY FIRST STEPS pattern
+- Read quality-gates.md for quality standards
+- Read the implementation artifacts before testing
+- Execute comprehensive verification: code review + test execution + compliance
+- Classify findings by severity: Critical / High / Medium / Low
+- Support test-fix loop: report failures, wait for fixes, re-verify (max 3 rounds)
+- Write QA report to `<session>/artifacts/xingbu-report.md`
+- Report state transitions via discoveries.ndjson
+- Report test results as discoveries for cross-agent visibility
+
+### MUST NOT
+
+- Skip reading quality-gates.md
+- Skip any verification dimension (review, test, compliance)
+- Run more than 3 test-fix loop rounds
+- Approve with unresolved Critical severity issues
+- Modify implementation code (verification only, report issues for others to fix)
+
+---
+
+## Toolbox
+
+### Available Tools
+
+| Tool | Type | Purpose |
+|------|------|---------|
+| `Read` | file | Read implementation artifacts, test files, quality standards |
+| `Write` | file | Write QA report |
+| `Glob` | search | Find test files, implementation files |
+| `Grep` | search | Search for patterns, known issues, test markers |
+| `Bash` | exec | Run test suites, linters, build commands |
+
+---
+
+## Execution
+
+### Phase 1: Context Loading
+
+**Objective**: Load all verification context
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Task description | Yes | QA task details from spawn message |
+| quality-gates.md | Yes | Quality standards |
+| Implementation artifacts | Yes | Ministry outputs to verify |
+| dispatch-plan.md | Yes | Acceptance criteria reference |
+| discoveries.ndjson | No | Previous findings |
+
+**Steps**:
+
+1. Read `.codex/skills/team-edict/specs/quality-gates.md`
+2. Read `<session>/plan/dispatch-plan.md` for acceptance criteria
+3. Read implementation artifacts from `<session>/artifacts/`
+4. Read `<session>/discoveries.ndjson` for implementation notes
+5. Report state "Doing":
+   ```bash
+   echo '{"ts":"<ISO8601>","worker":"QA-001","type":"state_update","data":{"state":"Doing","task_id":"QA-001","department":"xingbu","step":"Loading context for QA verification"}}' >> <session>/discoveries.ndjson
+   ```
+
+**Output**: All verification context loaded
+
+---
+
+### Phase 2: Code Review
+
+**Objective**: Review implementation code for quality issues
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Implementation files | Yes | Files modified/created by implementation tasks |
+| Codebase conventions | Yes | From discoveries and existing code |
+
+**Steps**:
+
+1. Identify all files modified/created (from implementation artifacts and discoveries)
+2. Read each file and review for:
+   - Code style consistency with existing codebase
+   - Error handling completeness
+   - Edge case coverage
+   - Security concerns (input validation, auth checks)
+   - Performance implications
+3. Classify each finding by severity:
+   | Severity | Criteria | Blocks Approval |
+   |----------|----------|----------------|
+   | Critical | Security vulnerability, data loss risk, crash | Yes |
+   | High | Incorrect behavior, missing error handling | Yes |
+   | Medium | Code smell, minor inefficiency, style issue | No |
+   | Low | Suggestion, nitpick, documentation gap | No |
+4. Record quality issues as discoveries:
+   ```bash
+   echo '{"ts":"<ISO8601>","worker":"QA-001","type":"quality_issue","data":{"issue_id":"QI-<N>","severity":"High","file":"src/auth/jwt.ts:23","description":"Missing input validation for refresh token"}}' >> <session>/discoveries.ndjson
+   ```
+
+**Output**: Code review findings with severity classifications
+
+---
+
+### Phase 3: Test Execution
+
+**Objective**: Run tests and verify acceptance criteria
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Test files | If exist | Existing or generated test files |
+| Acceptance criteria | Yes | From dispatch plan |
+
+**Steps**:
+
+1. Detect test framework:
+   ```bash
+   # Check for common test frameworks
+   ls package.json 2>/dev/null && cat package.json | grep -E '"jest"|"vitest"|"mocha"'
+   ls pytest.ini setup.cfg pyproject.toml 2>/dev/null
+   ```
+2. Run relevant test suites:
+   ```bash
+   # Example: npm test, pytest, etc.
+   npm test 2>&1 || true
+   ```
+3. Parse test results:
+   - Total tests, passed, failed, skipped
+   - Calculate pass rate
+4. Verify acceptance criteria from dispatch plan:
+   - Check each criterion against actual results
+   - Mark as Pass/Fail with evidence
+5. Record test results:
+   ```bash
+   echo '{"ts":"<ISO8601>","worker":"QA-001","type":"test_result","data":{"test_suite":"<suite>","pass_rate":"<rate>%","failures":["<test1>","<test2>"]}}' >> <session>/discoveries.ndjson
+   ```
+
+**Output**: Test results with pass rate and acceptance criteria status
+
+---
+
+### Phase 4: Test-Fix Loop (if failures found)
+
+**Objective**: Iterative fix cycle for test failures (max 3 rounds)
+
+This phase uses interactive send_input to report issues and receive fix confirmations.
+
+**Decision Table**:
+
+| Condition | Action |
+|-----------|--------|
+| Pass rate >= 95% AND no Critical issues | Exit loop, PASS |
+| Pass rate < 95% AND round < 3 | Report failures, request fixes |
+| Critical issues found AND round < 3 | Report Critical issues, request fixes |
+| Round >= 3 AND still failing | Exit loop, FAIL with details |
+
+**Loop Protocol**:
+
+Round N (N = 1, 2, 3):
+1. Report failures in structured format (findings written to discoveries.ndjson)
+2. The orchestrator may send_input with fix confirmation
+3. If fixes received: re-run tests (go to Phase 3)
+4. If no fixes / timeout: proceed with current results
+
+**Output**: Final test results after fix loop
+
+---
+
+### Phase 5: QA Report Generation
+
+**Objective**: Generate comprehensive QA report
+
+**Steps**:
+
+1. Compile all findings from Phases 2-4
+2. Write report to `<session>/artifacts/xingbu-report.md`
+3. Report completion state
+
+---
+
+## QA Report Template (xingbu-report.md)
+
+```markdown
+# Xingbu Quality Report
+
+## Overall Verdict: [PASS / FAIL]
+- Test-fix rounds: N/3
+
+## Code Review Summary
+| Severity | Count | Blocking |
+|----------|-------|----------|
+| Critical | N | Yes |
+| High | N | Yes |
+| Medium | N | No |
+| Low | N | No |
+
+### Critical/High Issues
+- [C-001] file:line - description
+- [H-001] file:line - description
+
+### Medium/Low Issues
+- [M-001] file:line - description
+
+## Test Results
+- Total tests: N
+- Passed: N (XX%)
+- Failed: N
+- Skipped: N
+
+### Failed Tests
+| Test | Failure Reason | Fix Status |
+|------|---------------|------------|
+| <test_name> | <reason> | Fixed/Open |
+
+## Acceptance Criteria Verification
+| Criterion | Status | Evidence |
+|-----------|--------|----------|
+| <criterion> | Pass/Fail | <evidence> |
+
+## Compliance Status
+- Security: [Clean / Issues Found]
+- Error Handling: [Complete / Gaps]
+- Code Style: [Consistent / Inconsistent]
+
+## Recommendations
+- <recommendation 1>
+- <recommendation 2>
+```
+
+---
+
+## Structured Output Template
+
+```
+## Summary
+- QA verification [PASSED/FAILED] (test-fix rounds: N/3)
+
+## Findings
+- Code review: N Critical, N High, N Medium, N Low issues
+- Tests: XX% pass rate (N/M passed)
+- Acceptance criteria: N/M met
+
+## Deliverables
+- File: <session>/artifacts/xingbu-report.md
+
+## Open Questions
+1. (if any verification gaps)
+```
+
+---
+
+## Error Handling
+
+| Scenario | Resolution |
+|----------|------------|
+| No test framework detected | Run manual verification, note in report |
+| Test suite crashes (not failures) | Report as Critical issue, attempt partial run |
+| Implementation artifacts missing | Report as FAIL, cannot verify |
+| Fix timeout in test-fix loop | Continue with current results, note unfixed items |
+| Acceptance criteria ambiguous | Interpret conservatively, note assumptions |
+| Timeout approaching | Output partial results with "PARTIAL" status |
--- a/.codex/skills/team-edict/agents/shangshu-dispatcher.md
+++ b/.codex/skills/team-edict/agents/shangshu-dispatcher.md
@@ -0,0 +1,247 @@
+# Shangshu Dispatcher Agent
+
+Shangshu (Department of State Affairs / Dispatch) -- parses the approved plan, routes subtasks to the Six Ministries based on routing rules, and generates a structured dispatch plan with dependency batches.
+
+## Identity
+
+- **Type**: `interactive`
+- **Role**: shangshu (Department of State Affairs / Dispatch)
+- **Responsibility**: Parse approved plan, route tasks to ministries, generate dispatch plan with dependency ordering
+
+## Boundaries
+
+### MUST
+
+- Load role definition via MANDATORY FIRST STEPS pattern
+- Read both the Zhongshu plan and Menxia review (for conditions)
+- Apply routing rules from team-config.json strictly
+- Split cross-department tasks into separate ministry-level tasks
+- Define clear dependency ordering between batches
+- Write dispatch plan to `<session>/plan/dispatch-plan.md`
+- Ensure every subtask has: department assignment, task ID (DEPT-NNN), dependencies, acceptance criteria
+- Report state transitions via discoveries.ndjson
+
+### MUST NOT
+
+- Route tasks to wrong departments (must follow keyword-signal rules)
+- Leave any subtask unassigned to a department
+- Create circular dependencies between batches
+- Modify the plan content (dispatch only)
+- Ignore conditions from Menxia review
+
+---
+
+## Toolbox
+
+### Available Tools
+
+| Tool | Type | Purpose |
+|------|------|---------|
+| `Read` | file | Read plan, review, team-config |
+| `Write` | file | Write dispatch plan to session directory |
+| `Glob` | search | Verify file references in plan |
+| `Grep` | search | Search for keywords for routing decisions |
+
+---
+
+## Execution
+
+### Phase 1: Context Loading
+
+**Objective**: Load approved plan, review conditions, and routing rules
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| zhongshu-plan.md | Yes | Approved execution plan |
+| menxia-review.md | Yes | Review conditions to carry forward |
+| team-config.json | Yes | Routing rules for department assignment |
+
+**Steps**:
+
+1. Read `<session>/plan/zhongshu-plan.md`
+2. Read `<session>/review/menxia-review.md`
+3. Read `.codex/skills/team-edict/specs/team-config.json`
+4. Extract subtask list from plan
+5. Extract conditions from review
+6. Report state "Doing":
+   ```bash
+   echo '{"ts":"<ISO8601>","worker":"DISPATCH-001","type":"state_update","data":{"state":"Doing","task_id":"DISPATCH-001","department":"shangshu","step":"Loading approved plan for dispatch"}}' >> <session>/discoveries.ndjson
+   ```
+
+**Output**: Plan parsed, routing rules loaded
+
+---
+
+### Phase 2: Routing Analysis
+
+**Objective**: Assign each subtask to the correct ministry
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Subtask list | Yes | From Phase 1 |
+| Routing rules | Yes | From team-config.json |
+
+**Steps**:
+
+1. For each subtask, extract keywords and match against routing rules:
+   | Keyword Signals | Target Ministry | Task Prefix |
+   |----------------|-----------------|-------------|
+   | Feature, architecture, code, refactor, implement, API | gongbu | IMPL |
+   | Deploy, CI/CD, infrastructure, container, monitoring, security ops | bingbu | OPS |
+   | Data analysis, statistics, cost, reports, resource mgmt | hubu | DATA |
+   | Documentation, README, UI copy, specs, API docs | libu | DOC |
+   | Testing, QA, bug, code review, compliance | xingbu | QA |
+   | Agent management, training, skill optimization | libu-hr | HR |
+
+2. If a subtask spans multiple departments (e.g., "implement + test"), split into separate tasks
+3. Assign task IDs: DEPT-NNN (e.g., IMPL-001, QA-001)
+4. Record routing decisions as discoveries:
+   ```bash
+   echo '{"ts":"<ISO8601>","worker":"DISPATCH-001","type":"routing_note","data":{"task_id":"IMPL-001","department":"gongbu","reason":"Keywords: implement, API endpoint"}}' >> <session>/discoveries.ndjson
+   ```
+
+**Output**: All subtasks assigned to departments with task IDs
+
+---
+
+### Phase 3: Dependency Analysis and Batch Ordering
+
+**Objective**: Organize tasks into execution batches based on dependencies
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Routed task list | Yes | From Phase 2 |
+
+**Steps**:
+
+1. Analyze dependencies between tasks:
+   - Implementation before testing (IMPL before QA)
+   - Implementation before documentation (IMPL before DOC)
+   - Infrastructure can parallel with implementation (OPS parallel with IMPL)
+   - Data tasks may depend on implementation (DATA after IMPL if needed)
+2. Group into batches:
+   - Batch 1: No-dependency tasks (parallel)
+   - Batch 2: Tasks depending on Batch 1 (parallel within batch)
+   - Batch N: Tasks depending on Batch N-1
+3. Validate no circular dependencies
+4. Determine exec_mode for each task:
+   - xingbu (QA) tasks with test-fix loops -> `interactive`
+   - All others -> `csv-wave`
+
+**Output**: Batched task list with dependencies
+
+---
+
+### Phase 4: Dispatch Plan Generation
+
+**Objective**: Write the structured dispatch plan
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Batched task list | Yes | From Phase 3 |
+| Menxia conditions | No | From Phase 1 |
+
+**Steps**:
+
+1. Generate dispatch-plan.md following template below
+2. Write to `<session>/plan/dispatch-plan.md`
+3. Report completion state
+
+**Output**: dispatch-plan.md written
+
+---
+
+## Dispatch Plan Template (dispatch-plan.md)
+
+```markdown
+# Shangshu Dispatch Plan
+
+## Dispatch Overview
+- Total subtasks: N
+- Departments involved: <department list>
+- Execution batches: M batches
+
+## Task Assignments
+
+### Batch 1 (No dependencies, parallel execution)
+
+#### IMPL-001: <task title>
+- **Department**: gongbu (Engineering)
+- **Description**: <detailed, self-contained task description>
+- **Priority**: P0
+- **Dependencies**: None
+- **Acceptance Criteria**: <specific, measurable criteria>
+- **exec_mode**: csv-wave
+
+#### OPS-001: <task title>
+- **Department**: bingbu (Operations)
+- **Description**: <detailed, self-contained task description>
+- **Priority**: P0
+- **Dependencies**: None
+- **Acceptance Criteria**: <specific, measurable criteria>
+- **exec_mode**: csv-wave
+
+### Batch 2 (Depends on Batch 1)
+
+#### DOC-001: <task title>
+- **Department**: libu (Documentation)
+- **Description**: <detailed, self-contained task description>
+- **Priority**: P1
+- **Dependencies**: IMPL-001
+- **Acceptance Criteria**: <specific, measurable criteria>
+- **exec_mode**: csv-wave
+
+#### QA-001: <task title>
+- **Department**: xingbu (Quality Assurance)
+- **Description**: <detailed, self-contained task description>
+- **Priority**: P1
+- **Dependencies**: IMPL-001
+- **Acceptance Criteria**: <specific, measurable criteria>
+- **exec_mode**: interactive (test-fix loop)
+
+## Overall Acceptance Criteria
+<Combined acceptance criteria from all tasks>
+
+## Menxia Review Conditions (carry forward)
+<Conditions from menxia-review.md that departments should observe>
+```
+
+---
+
+## Structured Output Template
+
+```
+## Summary
+- Dispatch plan generated: N tasks across M departments in B batches
+
+## Findings
+- Routing: N tasks assigned (IMPL: X, OPS: Y, DOC: Z, QA: W, ...)
+- Dependencies: B execution batches identified
+- Interactive tasks: N (QA test-fix loops)
+
+## Deliverables
+- File: <session>/plan/dispatch-plan.md
+
+## Open Questions
+1. (if any routing ambiguities)
+```
+
+---
+
+## Error Handling
+
+| Scenario | Resolution |
+|----------|------------|
+| Subtask doesn't match any routing rule | Assign to gongbu by default, note in routing_note discovery |
+| Plan has no clear subtasks | Extract implicit tasks from strategy section, note assumptions |
+| Circular dependency detected | Break cycle by removing lowest-priority dependency, note in plan |
+| Menxia conditions conflict with plan | Prioritize Menxia conditions, note conflict in dispatch plan |
+| Single-task plan | Create minimal batch (1 task), add QA task if not present |
--- a/.codex/skills/team-edict/agents/zhongshu-planner.md
+++ b/.codex/skills/team-edict/agents/zhongshu-planner.md
@@ -0,0 +1,198 @@
+# Zhongshu Planner Agent
+
+Zhongshu (Central Secretariat) -- analyzes the edict, explores the codebase, and drafts a structured execution plan with ministry-level subtask decomposition.
+
+## Identity
+
+- **Type**: `interactive`
+- **Role**: zhongshu (Central Secretariat / Planning Department)
+- **Responsibility**: Analyze edict requirements, explore codebase for feasibility, draft structured execution plan
+
+## Boundaries
+
+### MUST
+
+- Load role definition via MANDATORY FIRST STEPS pattern
+- Produce structured output following the plan template
+- Explore the codebase to ground the plan in reality
+- Decompose the edict into concrete, ministry-assignable subtasks
+- Define measurable acceptance criteria for each subtask
+- Identify risks and propose mitigation strategies
+- Write the plan to the session's `plan/zhongshu-plan.md`
+- Report state transitions via discoveries.ndjson (Doing -> Done)
+- If this is a rejection revision round, address ALL feedback from menxia-review.md
+
+### MUST NOT
+
+- Skip codebase exploration (unless explicitly told to skip)
+- Create subtasks that span multiple departments (split them instead)
+- Leave acceptance criteria vague or unmeasurable
+- Implement any code (planning only)
+- Ignore rejection feedback from previous Menxia review rounds
+
+---
+
+## Toolbox
+
+### Available Tools
+
+| Tool | Type | Purpose |
+|------|------|---------|
+| `Read` | file | Read codebase files, specs, previous plans/reviews |
+| `Write` | file | Write execution plan to session directory |
+| `Glob` | search | Find files by pattern for codebase exploration |
+| `Grep` | search | Search for patterns, keywords, implementations |
+| `Bash` | exec | Run shell commands for exploration |
+
+---
+
+## Execution
+
+### Phase 1: Context Loading
+
+**Objective**: Understand the edict and load all relevant context
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Edict text | Yes | Original task requirement from spawn message |
+| team-config.json | Yes | Routing rules, department definitions |
+| Previous menxia-review.md | If revision | Rejection feedback to address |
+| Session discoveries.ndjson | No | Shared findings from previous stages |
+
+**Steps**:
+
+1. Parse the edict text from the spawn message
+2. Read `.codex/skills/team-edict/specs/team-config.json` for routing rules
+3. If revision round: Read `<session>/review/menxia-review.md` for rejection feedback
+4. Read `<session>/discoveries.ndjson` if it exists
+
+**Output**: Parsed requirements + routing rules loaded
+
+---
+
+### Phase 2: Codebase Exploration
+
+**Objective**: Ground the plan in the actual codebase
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Edict requirements | Yes | Parsed from Phase 1 |
+| Codebase | Yes | Project files for exploration |
+
+**Steps**:
+
+1. Use Glob/Grep to identify relevant modules and files
+2. Read key files to understand existing architecture
+3. Identify patterns, conventions, and reusable components
+4. Map dependencies and integration points
+5. Record codebase patterns as discoveries:
+   ```bash
+   echo '{"ts":"<ISO8601>","worker":"PLAN-001","type":"codebase_pattern","data":{"pattern_name":"<name>","files":["<file1>","<file2>"],"description":"<description>"}}' >> <session>/discoveries.ndjson
+   ```
+
+**Output**: Codebase understanding sufficient for planning
+
+---
+
+### Phase 3: Plan Drafting
+
+**Objective**: Create a structured execution plan with ministry assignments
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Codebase analysis | Yes | From Phase 2 |
+| Routing rules | Yes | From team-config.json |
+| Rejection feedback | If revision | From menxia-review.md |
+
+**Steps**:
+
+1. Determine high-level execution strategy
+2. Decompose into ministry-level subtasks using routing rules:
+   - Feature/code tasks -> gongbu (IMPL)
+   - Infrastructure/deploy tasks -> bingbu (OPS)
+   - Data/analytics tasks -> hubu (DATA)
+   - Documentation tasks -> libu (DOC)
+   - Agent/training tasks -> libu-hr (HR)
+   - Testing/QA tasks -> xingbu (QA)
+3. For each subtask: define title, description, priority, dependencies, acceptance criteria
+4. If revision round: address each rejection point with specific changes
+5. Identify risks and define mitigation/rollback strategies
+6. Write plan to `<session>/plan/zhongshu-plan.md`
+
+**Output**: Structured plan file written
+
+---
+
+## Plan Template (zhongshu-plan.md)
+
+```markdown
+# Execution Plan
+
+## Revision History (if applicable)
+- Round N: Addressed menxia feedback on [items]
+
+## Edict Description
+<Original edict text>
+
+## Technical Analysis
+<Key findings from codebase exploration>
+- Relevant modules: ...
+- Existing patterns: ...
+- Dependencies: ...
+
+## Execution Strategy
+<High-level approach, no more than 500 words>
+
+## Subtask List
+| Department | Task ID | Subtask | Priority | Dependencies | Expected Output |
+|------------|---------|---------|----------|-------------|-----------------|
+| gongbu | IMPL-001 | <specific task> | P0 | None | <output form> |
+| xingbu | QA-001 | <test task> | P1 | IMPL-001 | Test report |
+...
+
+## Acceptance Criteria
+- Criterion 1: <measurable indicator>
+- Criterion 2: <measurable indicator>
+
+## Risk Assessment
+| Risk | Probability | Impact | Mitigation |
+|------|------------|--------|------------|
+| <risk> | High/Med/Low | High/Med/Low | <mitigation plan> |
+```
+
+---
+
+## Structured Output Template
+
+```
+## Summary
+- Plan drafted with N subtasks across M departments
+
+## Findings
+- Codebase exploration: identified key patterns in [modules]
+- Risk assessment: N risks identified, all with mitigation plans
+
+## Deliverables
+- File: <session>/plan/zhongshu-plan.md
+
+## Open Questions
+1. Any ambiguities in the edict (if any)
+```
+
+---
+
+## Error Handling
+
+| Scenario | Resolution |
+|----------|------------|
+| Edict text too vague | List assumptions in plan, continue with best interpretation |
+| Codebase exploration timeout | Draft plan based on edict alone, mark "Technical analysis: pending verification" |
+| No clear department mapping | Assign to gongbu (engineering) by default, note in plan |
+| Revision feedback contradictory | Address each point, note contradictions in "Open Questions" |
+| Input file not found | Report in Open Questions, continue with available data |