---
name: visual-asset-workflow
description: Generate distinctive educational visuals using creative brief methodology. Use when creating chapter illustrations, diagrams, or teaching visuals with Gemini.
---

# Visual Asset Workflow Skill

## Context & Problem

Educational visual generation converges toward generic infographics with technical specifications ("44pt Roboto Bold, 250px box") that activate prediction mode instead of reasoning mode. This produces bland, PowerPoint-default aesthetics instead of distinctive, pedagogically effective visuals.

This skill provides professional creative brief methodology to activate Gemini 3's reasoning capabilities.

---

## Core Principles

1. **Story activates reasoning** - Narrative intent produces distinctive visuals; technical specs produce generic ones
2. **Proficiency dictates complexity** - A2 students need <5 sec grasp; C2 professionals handle dense information
3. **Prerequisites gate content** - Visuals cannot assume knowledge students don't have yet
4. **Pedagogy drives hierarchy** - Visual weight teaches importance, not arbitrary aesthetics

---

## Dimensional Guidance

### Planning Before Execution

**Avoid:** Jumping into visual analysis without context
**Prefer:** Strategic planning phase (Q0)

Read FIRST:
- `apps/learn-app/docs/chapter-index.md` → Extract part, proficiency (A2/B1/C2), prerequisites
- `apps/learn-app/docs/[part]/[chapter]/README.md` → Understand lesson structure

Detect conflicts BEFORE work:
- Proficiency-complexity mismatch (complex visual for A2 beginners)
- Prerequisite violations (Python code when students haven't learned it)
- Pedagogical layer incoherence (Layer 1 content using Layer 4 approaches)

Output strategic plan, WAIT for approval before proceeding.

**Principle:** Plan prevents wasted work (Chapter 9 failure: 5 wrong lessons from skipping planning)

---

### Prompt Structure: Professional Creative Briefs

**Avoid:** Technical specifications
```
❌ "Title: 44pt Roboto Bold at (50, 20)"
❌ "Box: 250px × 90px, #aaaaaa, 8px corners"
❌ "Shadow: 4px offset, 8px blur"
```

**Prefer:** Story + Intent + Metaphor
```
✅ The Story: [1-2 sentence narrative of what's visualized]
✅ Emotional Intent: Should feel [exponential growth, surprising magnitude]
✅ Visual Metaphor: [Multiplication cascade - like compound interest]
✅ Key Insight: [ONE thing students must grasp]
✅ Color Semantics: Blue (#2563eb) = Authority (teaches governance concept)
✅ Typography Hierarchy: Largest = Key insight (not arbitrary sizing)
✅ Pedagogical Reasoning: Why these choices serve teaching
```

**Principle:** Creative briefs activate reasoning mode; specifications activate prediction mode

**Why it matters:** Gemini 3 reasons about HOW to achieve intent → Distinctive visuals instead of generic

---

### Token Conservation Strategy

**When:** Batch mode with >8 visuals OR continuation session

**Apply condensation while preserving reasoning activation:**

**ALWAYS KEEP:**
- Story (1-2 sentence narrative)
- Emotional Intent (what it should FEEL like)
- Visual Metaphor (universal concept)
- Key Insight (ONE thing students must grasp)
- Color semantics with hex codes (#2563eb)
- Pedagogical reasoning (why these choices)

**CONDENSE:**
- Long examples → Short labels
- Verbose descriptions → Bullet points
- Repeated patterns → Compact notation

**NEVER REMOVE:**
- Narrative elements
- Intent statements
- Reasoning explanations

**Example:**
```
FULL: "Top Layer shows the Coordinator at center top with label..."
CONDENSED: "Top Layer - Coordinator: Center top: 'Orchestrator'..."
```

**Target:** 60-70% token reduction, 100% reasoning activation preserved

**Principle:** Efficiency through compression, not through elimination of reasoning triggers

---

### Proficiency-Complexity Alignment

**Avoid:** One-size-fits-all complexity

**Prefer:** Proficiency-gated constraints

**A2 Beginner** (Non-negotiable limits):
- Max 5-7 elements
- <5 second grasp
- Static only (no interactive)
- Max 2×2 grids
- Clear hierarchy (largest = most important)

**B1 Intermediate**:
- Max 7-10 elements
- <10 second grasp
- Interactive Tier 1 OK (tap-to-reveal)
- Max 3×3 grids

**C2 Professional**:
- No artificial limits
- Dense infographics OK
- Full interactive architecture
- Production complexity

**Principle:** Overwhelming A2 students = learning failure; artificial simplicity for C2 = patronizing

---

### Prerequisite Validation Gate

**Avoid:** Assuming knowledge students don't have

**Prefer:** Validate against chapter prerequisites

**Detection:**
- Check Part number: Part 1-2 = no programming, Part 3 = markdown/prompts, Part 4+ = Python
- Check prerequisite list from chapter-index.md

**Example Violations:**
- ❌ Python code in Chapter 9 (Part 3 - students haven't learned it)
- ❌ Git commands in Part 2 (students haven't learned CLI)

**Exception:** Meta-level teaching OK
- ✅ Teaching "markdown code block syntax" by showing Python code block (teaches markdown, not Python)

**Principle:** Visual cannot require unknown knowledge

---

### Constitutional Alignment

**Avoid:** Decorative visuals without pedagogical purpose

**Prefer:** Every visual serves specific learning objective

**Principle 3 (Factual Accuracy):**
- Verify all statistics, dates, technical specs
- Enable Google Search grounding for factual claims
- Cite sources for data

**Principle 7 (Minimal Content):**
- Reject "let's add a visual for variety"
- Every element must teach something
- Remove non-teaching decoration

**Principle:** Visual decisions align with project constitution

---

### Pedagogical Layer Coherence

**Avoid:** Layer mismatch

**Prefer:** Visual approach matches chapter's pedagogical layer

**L1 (Manual Foundation):**
- Step-by-step diagrams
- Concrete examples
- Clear labeling (building vocabulary)

**L2 (AI Collaboration):**
- Before/after comparisons
- Iteration flows
- Three Roles Framework INVISIBLE (no role labels)

**L3 (Intelligence Design):**
- Architecture diagrams
- Reusable pattern illustrations

**L4 (Spec-Driven):**
- Specification → implementation flow
- Component composition diagrams

**Principle:** Visual design reinforces pedagogical approach

---

### Duplicate Prevention Protocol

**Avoid:** Generating different prompts that produce the same visual

**Prevent BEFORE generation:**

1. **Review existing visuals in chapter:**
   - List all `*.png` files in target chapter directory
   - Read corresponding `*.prompt.md` files
   - Identify visual patterns already used

2. **Validate prompt distinctiveness:**
   - Does this prompt's intent differ clearly from existing prompts?
   - Example conflicts to detect:
     - ❌ Timeline + Graph → Both might render as timeline
     - ❌ Architecture + Workflow → Both might render as hierarchy
     - ❌ Same metaphor, different names → Same visual result

3. **Differentiation strategy:**
   - Make visual type explicit in story ("GRAPH showing exponential growth" not just "showing growth")
   - Use distinct metaphors (cascade vs tree vs timeline vs curve)
   - Specify unique structural elements (2D axes vs linear flow vs hierarchical pyramid)

**Detect AFTER generation (in image-generator):**
- Visual comparison with existing chapter images
- Prompt alignment check (does output match brief intent?)

**Principle:** Prevention cheaper than rework

---

## Anti-Patterns

**Never:**
- Generate visuals without reading chapter-index.md first (skipping context)
- Use pixel specifications, font sizes, coordinates in prompts (kills reasoning)
- Assume knowledge not in prerequisites (prerequisite violation)
- Create decorative visuals without learning objective (Principle 7 violation)
- Apply same complexity to A2 and C2 students (proficiency mismatch)
- Create prompts without checking for duplicate visual patterns (causes rework)

**Even if it seems reasonable:**
- Don't use Python examples in Part 3 (students don't know Python yet)
- Don't create complex multi-step visuals for A2 (cognitive overload)
- Don't specify "44pt Roboto Bold" (removes Gemini's judgment)
- Don't skip distinctiveness validation "because they have different filenames" (names differ, visuals might not)

---

## Creative Variance

You tend to default to comparison diagrams even with story-driven prompts. Vary visual types:

- **Timeline progressions** (evolution over time)
- **Multiplication cascades** (compound growth visualization)
- **Hierarchical authority flows** (governance models)
- **Transformation sequences** (before → after → impact)
- **Conceptual metaphors** (abstract → concrete mapping)

Match visual type to story, not habit.

---

## Post-Generation Reflection

After batch completion, analyze systematically (Q8):

**Success patterns:**
- Quality gate performance (which caught most issues?)
- Average iterations (efficiency indicator)
- Time vs estimate (planning accuracy)

**Failure analysis:**
- Deferred visuals root causes (layout? spelling? concept mismatch?)
- Guardrail gaps (what principle would have prevented this?)
- Planning effectiveness (conflicts caught early vs missed?)

**Continuous improvement:**
- Pattern-based updates (not one-off fixes)
- New guardrails from learnings
- Prompt template refinements

Document in: `history/visual-assets/reflections/chapter-{NN}-reflection.md`

**Principle:** Systematic reflection → Improved future performance

---

## Success Indicators

You'll know this skill is working when:

- ✅ Zero pixel specifications in prompts (creative briefs only)
- ✅ Strategic plan created before visual analysis (Q0 complete)
- ✅ Proficiency conflicts detected early (A2 limits enforced)
- ✅ Prerequisite violations prevented (no unknown concepts)
- ✅ Story/Intent/Metaphor in every prompt (reasoning activated)
- ✅ Token conservation applied in batch mode (60-70% reduction)
- ✅ Duplicate prevention validation passed (zero duplicate visuals)
- ✅ Visuals feel distinctive and compelling (not generic PowerPoint)
- ✅ Reflection document created after batch (systematic learning)

**Result:** Professional-quality visuals that teach effectively, generated efficiently through planning, with zero duplicates requiring rework.