--- name: layout-extract description: Extract structural layout information from reference images, URLs, or text prompts using Claude analysis argument-hint: [--design-id ] [--session ] [--images ""] [--urls ""] [--prompt ""] [--targets ""] [--variants ] [--device-type ] [--interactive] allowed-tools: TodoWrite(*), Read(*), Write(*), Glob(*), Bash(*), AskUserQuestion(*), Task(ui-design-agent), mcp__exa__web_search_exa(*) --- # Layout Extraction Command ## Overview Extract structural layout information from reference images, URLs, or text prompts using AI analysis. This command separates the "scaffolding" (HTML structure and CSS layout) from the "paint" (visual tokens handled by `style-extract`). **Strategy**: AI-Driven Structural Analysis - **Agent-Powered**: Uses `ui-design-agent` for deep structural analysis - **Behavior**: Generate N layout concepts → User multi-select → Generate selected templates - **Output**: `layout-templates.json` with DOM structure, component hierarchy, and CSS layout rules - **Device-Aware**: Optimized for specific device types (desktop, mobile, tablet, responsive) - **Token-Based**: CSS uses `var()` placeholders for spacing and breakpoints ## Phase 0: Setup & Input Validation ### Step 1: Detect Input, Mode & Targets ```bash # Detect input source # Priority: --urls + --images → hybrid | --urls → url | --images → image | --prompt → text # Parse URLs if provided (format: "target:url,target:url,...") IF --urls: url_list = [] FOR pair IN split(--urls, ","): IF ":" IN pair: target, url = pair.split(":", 1) url_list.append({target: target.strip(), url: url.strip()}) ELSE: # Single URL without target url_list.append({target: "page", url: pair.strip()}) has_urls = true ELSE: has_urls = false url_list = [] # Set variants count (default: 3, range: 1-5) # Behavior: Generate N layout concepts per target → User multi-select → Generate selected templates variants_count = --variants OR 3 VALIDATE: 1 <= variants_count <= 5 # Resolve targets # Priority: --targets → url_list targets → prompt analysis → default ["page"] IF --targets: targets = split(--targets, ",") ELSE IF has_urls: targets = [url_info.target for url_info in url_list] ELSE IF --prompt: # Extract targets from prompt using pattern matching # Looks for keywords: "page names", target descriptors (login, dashboard, etc.) # Returns lowercase, hyphenated strings (e.g., ["login", "dashboard"]) targets = extract_from_prompt(--prompt) ELSE: targets = ["page"] # Resolve device type device_type = --device-type OR "responsive" # desktop|mobile|tablet|responsive # Determine base path with priority: --design-id > --session > auto-detect if [ -n "$DESIGN_ID" ]; then # Exact match by design ID relative_path=$(find .workflow -name "${DESIGN_ID}" -type d -print -quit) elif [ -n "$SESSION_ID" ]; then # Latest in session relative_path=$(find .workflow/WFS-$SESSION_ID -name "design-run-*" -type d -printf "%T@ %p\n" 2>/dev/null | sort -nr | head -1 | cut -d' ' -f2) else # Latest globally relative_path=$(find .workflow -name "design-run-*" -type d -printf "%T@ %p\n" 2>/dev/null | sort -nr | head -1 | cut -d' ' -f2) fi # Validate and convert to absolute path if [ -z "$relative_path" ] || [ ! -d "$relative_path" ]; then echo "❌ ERROR: Design run not found" echo "💡 HINT: Run '/workflow:ui-design:list' to see available design runs" exit 1 fi base_path=$(cd "$relative_path" && pwd) bash(echo "✓ Base path: $base_path") ``` ### Step 2: Load Inputs & Create Directories ```bash # For image mode bash(ls {images_pattern}) # Expand glob pattern Read({image_path}) # Load each image # For URL mode # Parse URL list format: "target:url,target:url" # Validate URLs are accessible # For text mode # Validate --prompt is non-empty # Create output directory bash(mkdir -p {base_path}/layout-extraction) ``` ### Step 2.5: Extract DOM Structure (URL Mode - Auto-Trigger) ```bash # AUTO-TRIGGER: If URLs are available (from --urls parameter), automatically extract real DOM structure # This provides accurate layout data to supplement visual analysis # Check if URLs provided via --urls parameter IF --urls AND url_list: REPORT: "🔍 Auto-triggering URL mode: Extracting DOM structure" bash(mkdir -p {base_path}/.intermediates/layout-analysis) # For each URL in url_list: FOR url_info IN url_list: target = url_info.target url = url_info.url IF mcp_chrome_devtools_available: REPORT: " Processing: {target} ({url})" # Read extraction script script_content = Read(~/.claude/scripts/extract-layout-structure.js) # Open page in Chrome DevTools mcp__chrome-devtools__navigate_page(url=url) # Execute layout extraction script result = mcp__chrome-devtools__evaluate_script(function=script_content) # Save DOM structure for this target (intermediate file) Write({base_path}/.intermediates/layout-analysis/dom-structure-{target}.json, result) REPORT: " ✅ DOM structure extracted for '{target}'" ELSE: REPORT: " ⚠️ Chrome DevTools MCP not available, falling back to visual analysis" BREAK dom_structure_available = mcp_chrome_devtools_available ELSE: dom_structure_available = false ``` **Extraction Script Reference**: `~/.claude/scripts/extract-layout-structure.js` **Usage**: Read the script file and use content directly in `mcp__chrome-devtools__evaluate_script()` **Script returns**: - `metadata`: Extraction timestamp, URL, method, version - `patterns`: Layout pattern statistics (flexColumn, flexRow, grid counts) - `structure`: Hierarchical DOM tree with layout properties - `exploration`: (Optional) Progressive exploration results when standard selectors fail **Benefits**: - ✅ Real flex/grid configuration (justifyContent, alignItems, gap, etc.) - ✅ Accurate element bounds (x, y, width, height) - ✅ Structural hierarchy with depth control - ✅ Layout pattern identification (flex-row, flex-column, grid-NCol) - ✅ Progressive exploration: Auto-discovers missing selectors **Progressive Exploration Strategy** (v2.2.0+): When script finds <3 main containers, it automatically: 1. **Scans** all large visible containers (≥500×300px) 2. **Extracts** class patterns matching: `main|content|wrapper|container|page|layout|app` 3. **Suggests** new selectors to add to script 4. **Returns** exploration data in `result.exploration`: ```json { "triggered": true, "discoveredCandidates": [{classes, bounds, display}], "suggestedSelectors": [".wrapper", ".page-index"], "recommendation": ".wrapper, .page-index, .app-container" } ``` **Using Exploration Results**: ```javascript // After extraction, check for suggestions IF result.exploration?.triggered: REPORT: result.exploration.warning REPORT: "Suggested selectors: " + result.exploration.recommendation // Update script by adding to commonClassSelectors array // Then re-run extraction for better coverage ``` **Selector Update Workflow**: 1. Run extraction on unfamiliar site 2. Check `result.exploration.suggestedSelectors` 3. Add relevant selectors to script's `commonClassSelectors` 4. Re-run extraction → improved container detection ### Step 3: Memory Check ```bash # 1. Check if inputs cached in session memory IF session_has_inputs: SKIP Step 2 file reading # 2. Check if output already exists bash(find {base_path}/layout-extraction -name "layout-*.json" -print -quit | grep -q . && echo "exists") IF exists: SKIP to completion ``` --- **Phase 0 Output**: `input_mode`, `base_path`, `variants_count`, `targets[]`, `device_type`, loaded inputs ## Phase 1: Layout Concept Generation ### Step 1: Generate Layout Concept Options (Agent Task 1) **Executor**: `Task(ui-design-agent)` Launch agent to generate `variants_count` layout concept options for each target: ```javascript Task(ui-design-agent): ` [LAYOUT_CONCEPT_GENERATION_TASK] Generate {variants_count} structurally distinct layout concepts for each target SESSION: {session_id} | MODE: explore | BASE_PATH: {base_path} TARGETS: {targets} | DEVICE_TYPE: {device_type} ## Input Analysis - Targets: {targets.join(", ")} - Device type: {device_type} - Visual references: {loaded_images if available} ${dom_structure_available ? "- DOM Structure: Read from .intermediates/layout-analysis/dom-structure-*.json" : ""} ## Analysis Rules - For EACH target, generate {variants_count} structurally DIFFERENT layout concepts - Concepts must differ in: grid structure, component arrangement, visual hierarchy - Each concept should have distinct navigation pattern, content flow, and responsive behavior ## Generate for EACH Target For target in {targets}: For concept_index in 1..{variants_count}: 1. **Concept Definition**: - concept_name (descriptive, e.g., "Classic Three-Column Holy Grail") - design_philosophy (1-2 sentences explaining the structural approach) - layout_pattern (e.g., "grid-3col", "flex-row", "single-column", "asymmetric-grid") - key_components (array of main layout regions) - structural_features (list of distinguishing characteristics) 2. **Wireframe Preview** (simple text representation): - ascii_art (simple ASCII box diagram showing layout structure) - Example: ┌─────────────────┐ │ HEADER │ ├──┬─────────┬────┤ │ L│ MAIN │ R │ └──┴─────────┴────┘ ## Output Write single JSON file: {base_path}/.intermediates/layout-analysis/analysis-options.json Use schema from INTERACTIVE-DATA-SPEC.md (Layout Extract: analysis-options.json) CRITICAL: Use Write() tool immediately after generating complete JSON ` ``` ### Step 2: Verify Options File Created ```bash bash(test -f {base_path}/.intermediates/layout-analysis/analysis-options.json && echo "created") # Quick validation bash(cat {base_path}/.intermediates/layout-analysis/analysis-options.json | grep -q "layout_concepts" && echo "valid") ``` **Output**: `analysis-options.json` with layout concept options for all targets --- ## Phase 1.5: User Confirmation (Optional - Triggered by --interactive) **Purpose**: Allow user to select preferred layout concept(s) for each target before generating detailed templates **Trigger Condition**: Execute this phase ONLY if `--interactive` flag is present ### Step 1: Check Interactive Flag ```bash # Skip this entire phase if --interactive flag is not present IF NOT --interactive: SKIP to Phase 2 # Interactive mode enabled REPORT: "🎯 Interactive mode: User selection required for {targets.length} target(s)" ``` ### Step 2: Load and Present Options ```bash # Read options file options = Read({base_path}/.intermediates/layout-analysis/analysis-options.json) # Parse layout concepts layout_concepts = options.layout_concepts ``` ### Step 2: Present Options to User (Per Target) For each target, present layout concept options and capture selection: ``` 📋 Layout Concept Options for Target: {target} We've generated {variants_count} structurally different layout concepts for review. Please select your preferred concept for this target. {FOR each concept in layout_concepts[target]: ═══════════════════════════════════════════════════ Concept {concept.index}: {concept.concept_name} ═══════════════════════════════════════════════════ Philosophy: {concept.design_philosophy} Pattern: {concept.layout_pattern} Components: {FOR each component in concept.key_components: • {component} } Features: {FOR each feature in concept.structural_features: • {feature} } Wireframe: {concept.wireframe_preview.ascii_art} ═══════════════════════════════════════════════════ } ``` ### Step 3: Capture User Selection and Update Options File (Per Target) **Interaction Strategy**: If total concepts > 4 OR any target has > 3 concepts, use batch text format: ``` 【目标[N] - [target]】选择布局方案 [key]) Concept [index]: [concept_name] [design_philosophy] [key]) Concept [index]: [concept_name] [design_philosophy] ... 请回答 (格式: 1a 2b 或 1a,b 2c 多选)： User input: "[N][key] [N][key] ..." → Single selection per target "[N][key1,key2] [N][key3] ..." → Multi-selection per target ``` Otherwise, use `AskUserQuestion` below. ```javascript // Use AskUserQuestion tool for each target (multi-select enabled) FOR each target: AskUserQuestion({ questions: [{ question: "Which layout concept(s) do you prefer for '{target}'?", header: "Layout for " + target, multiSelect: true, // Multi-selection enabled (default behavior) options: [ {FOR each concept in layout_concepts[target]: label: "Concept {concept.index}: {concept.concept_name}", description: "{concept.design_philosophy}" } ] }] }) // Parse user response (array of selections) selected_options = user_answer // Check for user cancellation IF selected_options == null OR selected_options.length == 0: REPORT: "⚠️ User canceled selection. Workflow terminated." EXIT workflow // Extract concept indices from array selected_indices = [] FOR each selected_option_text IN selected_options: match = selected_option_text.match(/Concept (\d+):/) IF match: selected_indices.push(parseInt(match[1])) ELSE: ERROR: "Invalid selection format. Expected 'Concept N: ...' format" EXIT workflow // Store selections for this target (array of indices) selections[target] = { selected_indices: selected_indices, // Array of selected indices concept_names: selected_indices.map(i => layout_concepts[target][i-1].concept_name) } REPORT: "✅ Selected {selected_indices.length} layout(s) for {target}" // Calculate total selections across all targets total_selections = sum([len(selections[t].selected_indices) for t in targets]) // Update analysis-options.json with user selection (embedded in same file) options_file = Read({base_path}/.intermediates/layout-analysis/analysis-options.json) options_file.user_selection = { "selected_at": "{current_timestamp}", "selection_type": "per_target_multi", "session_id": "{session_id}", "total_selections": total_selections, "selected_variants": selections // {target: {selected_indices: [...], concept_names: [...]}} } // Write updated file back Write({base_path}/.intermediates/layout-analysis/analysis-options.json, JSON.stringify(options_file, indent=2)) ``` ### Step 4: Confirmation Message ``` ✅ Selections recorded! Total: {total_selections} layout(s) {FOR each target, selection in selections: • {target}: {selection.selected_indices.length} layout(s) selected {FOR each index IN selection.selected_indices: - Concept {index}: {layout_concepts[target][index-1].concept_name} } } Proceeding to generate {total_selections} detailed layout template(s)... ``` **Output**: `analysis-options.json` updated with embedded `user_selection` field ## Phase 2: Layout Template Generation (Agent Task 2) **Executor**: `Task(ui-design-agent)` × `Total_Selected_Templates` in **parallel** ### Step 1: Load User Selections or Default to All ```bash # Read analysis-options.json which may contain user_selection options = Read({base_path}/.intermediates/layout-analysis/analysis-options.json) layout_concepts = options.layout_concepts # Check if user_selection field exists (interactive mode) IF options.user_selection AND options.user_selection.selected_variants: # Interactive mode: Use user-selected variants selections_per_target = options.user_selection.selected_variants total_selections = options.user_selection.total_selections ELSE: # Non-interactive mode: Generate ALL variants for ALL targets (default behavior) selections_per_target = {} total_selections = 0 FOR each target in targets: selections_per_target[target] = { "selected_indices": [1, 2, ..., variants_count], # All indices "concept_names": [] # Will be filled from options } total_selections += variants_count # Build task list for all selected concepts across all targets task_list = [] FOR each target in targets: selected_indices = selections_per_target[target].selected_indices # Array concept_names = selections_per_target[target].concept_names # Array FOR i in range(len(selected_indices)): idx = selected_indices[i] concept = layout_concepts[target][idx - 1] # 0-indexed array variant_id = i + 1 # 1-based variant numbering task_list.push({ target: target, variant_id: variant_id, concept: concept, output_file: "{base_path}/layout-extraction/layout-{target}-{variant_id}.json" }) total_tasks = task_list.length REPORT: "Generating {total_tasks} layout templates across {targets.length} targets" ``` ### Step 2: Launch Parallel Agent Tasks Generate layout templates for ALL selected concepts in parallel: ```javascript FOR each task in task_list: Task(ui-design-agent): ` [LAYOUT_TEMPLATE_GENERATION_TASK #{task.variant_id} for {task.target}] Generate detailed layout template based on user-selected concept. Focus ONLY on structure and layout. DO NOT concern with visual style (colors, fonts, etc.). SESSION: {session_id} | BASE_PATH: {base_path} TARGET: {task.target} | VARIANT: {task.variant_id} DEVICE_TYPE: {device_type} USER SELECTION: - Selected Concept: {task.concept.concept_name} - Philosophy: {task.concept.design_philosophy} - Pattern: {task.concept.layout_pattern} - Key Components: {task.concept.key_components.join(", ")} - Structural Features: {task.concept.structural_features.join(", ")} ## Input Analysis - Target: {task.target} - Device type: {device_type} - Visual references: {loaded_images if available} ${dom_structure_available ? "- DOM Structure Data: Read from .intermediates/layout-analysis/dom-structure-{task.target}.json - USE THIS for accurate layout properties" : ""} ## Generation Rules - Develop the user-selected layout concept into a detailed template - Use the selected concept's key_components as foundation - Apply the selected layout_pattern (grid-3col, flex-row, etc.) - Honor the structural_features defined in the concept - Expand the concept with complete DOM structure and CSS layout rules ${dom_structure_available ? ` - IMPORTANT: You have access to real DOM structure data with accurate flex/grid properties - Use DOM data as primary source for layout properties - Extract real flex/grid configurations (display, flexDirection, justifyContent, alignItems, gap) - Use actual element bounds for responsive breakpoint decisions - Preserve identified patterns from DOM structure ` : ""} ## Template Generation 1. **DOM Structure**: - Semantic HTML5 tags:

,