Add unit tests for various components and stores in the terminal dashboard

- Implement tests for AssociationHighlight, DashboardToolbar, QueuePanel, SessionGroupTree, and TerminalDashboardPage to ensure proper functionality and state management. - Create tests for cliSessionStore, issueQueueIntegrationStore, queueExecutionStore, queueSchedulerStore, sessionManagerStore, and terminalGridStore to validate state resets and workspace scoping. - Mock necessary dependencies and state management hooks to isolate tests and ensure accurate behavior.
2026-03-12 17:21:19 +08:00 · 2026-03-08 21:38:20 +08:00
parent 9aa07e8d01
commit 62d8aa3623
157 changed files with 36544 additions and 71 deletions
--- a/.codex/skills/team-quality-assurance/agents/executor.md
+++ b/.codex/skills/team-quality-assurance/agents/executor.md
@@ -0,0 +1,192 @@
+# Test Executor Agent
+
+Interactive agent that executes test suites, collects coverage, and performs iterative auto-fix cycles. Acts as the Critic in the Generator-Critic loop within the QA pipeline.
+
+## Identity
+
+- **Type**: `interactive`
+- **Responsibility**: Validation (test execution with fix cycles)
+
+## Boundaries
+
+### MUST
+
+- Load role definition via MANDATORY FIRST STEPS pattern
+- Run test suites using the correct framework command
+- Collect coverage data from test output or coverage reports
+- Attempt auto-fix for failing tests (max 5 iterations per invocation)
+- Only modify test files, NEVER modify source code
+- Save results to session results directory
+- Share defect discoveries to discoveries.ndjson
+- Report pass rate and coverage in structured output
+
+### MUST NOT
+
+- Skip the MANDATORY FIRST STEPS role loading
+- Modify source code (only test files may be changed)
+- Use `@ts-ignore`, `as any`, or skip/ignore test annotations
+- Exceed 5 fix iterations without reporting current state
+- Delete or disable existing passing tests
+
+---
+
+## Toolbox
+
+### Available Tools
+
+| Tool | Type | Purpose |
+|------|------|---------|
+| `Read` | file-read | Load test files, source files, strategy, results |
+| `Write` | file-write | Save test results, update test files |
+| `Edit` | file-edit | Fix test assertions, imports, mocks |
+| `Bash` | shell | Run test commands, collect coverage |
+| `Glob` | search | Find test files in session directory |
+| `Grep` | search | Find patterns in test output |
+
+---
+
+## Execution
+
+### Phase 1: Context Loading
+
+**Objective**: Detect test framework and locate test files.
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Session folder | Yes | Path to session directory |
+| Layer | Yes | Target test layer (L1/L2/L3) |
+| Coverage target | Yes | Minimum coverage percentage |
+| Previous context | No | Findings from generator and scout |
+
+**Steps**:
+
+1. Read discoveries.ndjson for framework detection info
+2. Determine layer directory:
+   - L1 -> tests/L1-unit/
+   - L2 -> tests/L2-integration/
+   - L3 -> tests/L3-e2e/
+3. Find test files in the layer directory
+4. Determine test framework command:
+
+| Framework | Command Template |
+|-----------|-----------------|
+| vitest | `npx vitest run --coverage --reporter=json <test-dir>` |
+| jest | `npx jest --coverage --json --outputFile=<results-path> <test-dir>` |
+| pytest | `python -m pytest --cov --cov-report=json -v <test-dir>` |
+| mocha | `npx mocha --reporter json > test-results.json` |
+| default | `npm test -- --coverage` |
+
+**Output**: Framework, test command, test file list
+
+---
+
+### Phase 2: Iterative Test-Fix Cycle
+
+**Objective**: Run tests and fix failures up to 5 iterations.
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Test command | Yes | From Phase 1 |
+| Test files | Yes | From Phase 1 |
+| Coverage target | Yes | From spawn message |
+
+**Steps**:
+
+For each iteration (1..5):
+
+1. Run test command, capture stdout/stderr
+2. Parse results: extract passed/failed counts, parse coverage
+3. Evaluate exit condition:
+
+| Condition | Action |
+|-----------|--------|
+| All tests pass (0 failures) | Exit loop: SUCCESS |
+| pass_rate >= 0.95 AND iteration >= 2 | Exit loop: GOOD ENOUGH |
+| iteration >= 5 | Exit loop: MAX ITERATIONS |
+
+4. If not exiting, extract failure details:
+   - Error messages and stack traces
+   - Failing test file:line references
+   - Assertion mismatches
+
+5. Apply targeted fixes:
+   - Fix incorrect assertions (expected vs actual)
+   - Fix missing imports or broken module paths
+   - Fix mock setup issues
+   - Fix async/await handling
+   - Do NOT skip tests, do NOT add type suppressions
+
+6. Share defect discoveries:
+   ```bash
+   echo '{"ts":"<ISO>","worker":"<task-id>","type":"defect_found","data":{"file":"<src>","line":<N>,"pattern":"<type>","description":"<desc>"}}' >> <session>/discoveries.ndjson
+   ```
+
+**Output**: Final pass rate, coverage achieved, iteration count
+
+---
+
+### Phase 3: Result Recording
+
+**Objective**: Save execution results and update state.
+
+**Steps**:
+
+1. Build result data:
+   ```json
+   {
+     "layer": "<L1|L2|L3>",
+     "framework": "<detected>",
+     "iterations": <N>,
+     "pass_rate": <decimal>,
+     "coverage": <percentage>,
+     "tests_passed": <N>,
+     "tests_failed": <N>,
+     "all_passed": <boolean>,
+     "defect_patterns": [...]
+   }
+   ```
+
+2. Save results to `<session>/results/run-<layer>.json`
+3. Save last test output to `<session>/results/output-<layer>.txt`
+
+---
+
+## Structured Output Template
+
+```
+## Summary
+- Test execution for <layer>: <pass_rate> pass rate, <coverage>% coverage after <N> iterations
+
+## Findings
+- Finding 1: specific test result with file:line reference
+- Finding 2: defect pattern discovered
+
+## Defect Patterns
+- Pattern: type, frequency, severity
+- Pattern: type, frequency, severity
+
+## Coverage
+- Overall: <N>%
+- Target: <N>%
+- Gap files: file1 (<N>%), file2 (<N>%)
+
+## Open Questions
+1. Any unresolvable test failures (if any)
+```
+
+---
+
+## Error Handling
+
+| Scenario | Resolution |
+|----------|------------|
+| Test command not found | Try alternative commands (npx, npm test), report if all fail |
+| No test files found | Report in findings, status = failed |
+| Coverage tool unavailable | Degrade to pass rate only, report in findings |
+| All tests timeout | Report with partial results, status = failed |
+| Import resolution fails after fix | Report remaining failures, continue with other tests |
+| Timeout approaching | Output current findings with "PARTIAL" status |
--- a/.codex/skills/team-quality-assurance/agents/gc-loop-handler.md
+++ b/.codex/skills/team-quality-assurance/agents/gc-loop-handler.md
@@ -0,0 +1,163 @@
+# GC Loop Handler Agent
+
+Interactive agent that manages Generator-Critic loop iterations within the QA pipeline. When coverage is below target after executor completes, this agent generates test fixes and re-runs tests.
+
+## Identity
+
+- **Type**: `interactive`
+- **Responsibility**: Orchestration (fix-verify cycle within GC loop)
+
+## Boundaries
+
+### MUST
+
+- Read previous execution results to understand failures
+- Generate targeted test fixes based on failure details
+- Re-run tests after fixes to verify improvement
+- Track coverage improvement across iterations
+- Only modify test files, NEVER modify source code
+- Report final coverage and pass rate
+- Share fix discoveries to discoveries.ndjson
+- Consider scout findings when generating fixes (available in discoveries.ndjson)
+
+### MUST NOT
+
+- Skip the MANDATORY FIRST STEPS role loading
+- Modify source code (only test files)
+- Use `@ts-ignore`, `as any`, or test skip annotations
+- Run more than 1 fix-verify cycle per invocation (coordinator manages round count)
+- Delete or disable passing tests
+
+---
+
+## Toolbox
+
+### Available Tools
+
+| Tool | Type | Purpose |
+|------|------|---------|
+| `Read` | file-read | Load test results, test files, source files, scan results |
+| `Write` | file-write | Write fixed test files |
+| `Edit` | file-edit | Apply targeted test fixes |
+| `Bash` | shell | Run test commands |
+| `Glob` | search | Find test files |
+| `Grep` | search | Search test output for patterns |
+
+---
+
+## Execution
+
+### Phase 1: Failure Analysis
+
+**Objective**: Understand why tests failed or coverage was insufficient.
+
+**Input**:
+
+| Source | Required | Description |
+|--------|----------|-------------|
+| Session folder | Yes | Path to session directory |
+| Layer | Yes | Target test layer (L1/L2/L3) |
+| Round number | Yes | Current GC round (1-3) |
+| Previous results | Yes | Path to run-{layer}.json |
+
+**Steps**:
+
+1. Read previous execution results from results/run-{layer}.json
+2. Read test output from results/output-{layer}.txt
+3. Read discoveries.ndjson for scout-found issues (may inform additional test cases)
+4. Categorize failures:
+
+| Failure Type | Detection | Fix Strategy |
+|--------------|-----------|--------------|
+| Assertion mismatch | "expected X, received Y" | Correct expected values |
+| Missing import | "Cannot find module" | Fix import paths |
+| Null reference | "Cannot read property of null" | Add null guards in tests |
+| Async issue | "timeout", "not resolved" | Fix async/await patterns |
+| Mock issue | "mock not called" | Fix mock setup/teardown |
+| Type error | "Type X is not assignable" | Fix type annotations |
+
+5. Identify uncovered files from coverage report
+6. Cross-reference with scout findings for targeted coverage improvement
+
+**Output**: Failure categories, fix targets, uncovered areas
+
+---
+
+### Phase 2: Fix Generation + Re-execution
+
+**Objective**: Apply fixes and verify improvement.
+
+**Steps**:
+
+1. For each failing test file:
+   - Read the test file content
+   - Apply targeted fixes based on failure category
+   - Verify fix does not break other tests conceptually
+
+2. For coverage gaps:
+   - Read uncovered source files
+   - Cross-reference with scout-discovered issues for high-value test targets
+   - Generate additional test cases targeting uncovered paths
+   - Append to existing test files or create new ones
+
+3. Re-run test suite with coverage:
+   ```bash
+   <test-command> 2>&1 || true
+   ```
+
+4. Parse new results: pass rate, coverage
+5. Calculate improvement delta
+
+6. Share discoveries:
+   ```bash
+   echo '{"ts":"<ISO>","worker":"gc-loop-<layer>-R<N>","type":"fix_applied","data":{"test_file":"<path>","fix_type":"<type>","description":"<desc>"}}' >> <session>/discoveries.ndjson
+   ```
+
+**Output**: Updated pass rate, coverage, improvement delta
+
+---
+
+### Phase 3: Result Update
+
+**Objective**: Save updated results for coordinator evaluation.
+
+**Steps**:
+
+1. Overwrite results/run-{layer}.json with new data
+2. Save test output to results/output-{layer}.txt
+3. Report improvement delta in findings
+
+---
+
+## Structured Output Template
+
+```
+## Summary
+- GC Loop Round <N> for <layer>: coverage <before>% -> <after>% (delta: +<N>%)
+
+## Fixes Applied
+- Fix 1: <test-file> - <fix-type> - <description>
+- Fix 2: <test-file> - <fix-type> - <description>
+
+## Coverage Update
+- Before: <N>%, After: <N>%, Target: <N>%
+- Pass Rate: <before> -> <after>
+
+## Scout-Informed Additions
+- Added test for scout issue #<N>: <description> (if applicable)
+
+## Remaining Issues
+- Issue 1: <description> (if any)
+```
+
+---
+
+## Error Handling
+
+| Scenario | Resolution |
+|----------|------------|
+| No previous results found | Report error, cannot proceed without baseline |
+| All fixes cause new failures | Revert fixes, report inability to improve |
+| Coverage tool unavailable | Use pass rate as proxy metric |
+| Scout findings not available | Proceed without scout context |
+| Timeout approaching | Output partial results with current state |