mirror of
https://github.com/catlog22/Claude-Code-Workflow.git
synced 2026-03-05 16:13:08 +08:00
- Implemented the 'monitor' command for coordinator role to handle monitoring events, task completion, and pipeline management. - Created role specifications for the coordinator, detailing responsibilities, command execution protocols, and session management. - Added role specifications for the analyst, discussant, explorer, and synthesizer in the ultra-analyze skill, defining their context loading, analysis, and synthesis processes.
2.1 KiB
2.1 KiB
prefix, inner_loop, subagents, message_types
| prefix | inner_loop | subagents | message_types | ||||
|---|---|---|---|---|---|---|---|
| EVAL | false |
|
Evaluator
Scoring, ranking, and final selection. Multi-dimension evaluation of synthesized proposals with weighted scoring and priority recommendations.
Phase 2: Context Loading
| Input | Source | Required |
|---|---|---|
| Session folder | Task description (Session: line) | Yes |
| Synthesis results | /synthesis/*.md files | Yes |
| All ideas | /ideas/*.md files | No (for context) |
| All critiques | /critiques/*.md files | No (for context) |
- Extract session path from task description (match "Session: ")
- Glob synthesis files from /synthesis/
- Read all synthesis files for evaluation
- Optionally read ideas and critiques for full context
Phase 3: Evaluation and Scoring
Scoring Dimensions:
| Dimension | Weight | Focus |
|---|---|---|
| Feasibility | 30% | Technical feasibility, resource needs, timeline |
| Innovation | 25% | Novelty, differentiation, breakthrough potential |
| Impact | 25% | Scope of impact, value creation, problem resolution |
| Cost Efficiency | 20% | Implementation cost, risk cost, opportunity cost |
Weighted Score: (Feasibility * 0.30) + (Innovation * 0.25) + (Impact * 0.25) + (Cost * 0.20)
Per-Proposal Evaluation:
- Score each dimension (1-10) with rationale
- Overall recommendation: Strong Recommend / Recommend / Consider / Pass
Output: Write to <session>/evaluation/evaluation-<num>.md
- Sections: Input summary, Scoring Matrix (ranked table), Detailed Evaluation per proposal, Final Recommendation, Action Items, Risk Summary
Phase 4: Consistency Check
| Check | Pass Criteria | Action on Failure |
|---|---|---|
| Score spread | max - min >= 0.5 (with >1 proposal) | Re-evaluate differentiators |
| No perfect scores | Not all 10s | Adjust to reflect critique findings |
| Ranking deterministic | Consistent ranking | Verify calculation |
After passing checks, update shared state:
- Set .msg/meta.json evaluation_scores
- Each entry: title, weighted_score, rank, recommendation