--- name: picture description: Generate images using Google Imagen. Use when user needs 'hero image', 'presentation visual', 'conceptual illustration', or 'generate an image'. Composes with itv-styling for brand-constrained outputs. (user) allowed-tools: - Bash - Read --- # Image Generation Generate AI images using Google Imagen via the Gemini API. ## When to Use - Presentation images (hero visuals, section dividers) - Conceptual illustrations (visual metaphors, abstract concepts) - Photo-realistic images (product mockups, scenarios) - Visual explanations that benefit from AI generation ## When NOT to Use - **Precise diagrams or charts** — use `diagramming` skill (editable SVG, exact data) - **Screenshots** — use `screenshot` skill - **Simple icons** — often faster to find stock or use emoji **Overlap with diagramming:** There's fuzzy boundary. A "chart for a presentation" could go either way: - Need precise data, editability → diagramming - Need striking visual punch → image-generation - Use judgement; ask if unclear. ## Workflow ### 1. Understand the Need Clarify with user: - **Purpose** — presentation, concept illustration, visual metaphor? - **Style** — photorealistic, illustration, abstract? - **Brand** — does it need ITV styling? (if so, read itv-styling skill) ### 2. Draft with Flash Use the fast model for initial iterations: ```bash ~/.claude/skills/picture/imagen.sh "prompt" --model gemini-2.5-flash-image ``` ### 3. Review and Refine Open the image, assess, iterate: ```bash # Edit mode: refine previous output ~/.claude/skills/picture/imagen.sh "make it warmer, add more contrast" --input ./images/previous.png ``` ### 4. Final Render with Pro For client-facing or final deliverables: ```bash ~/.claude/skills/picture/imagen.sh "prompt" --model gemini-3-pro-image-preview ``` ## Command Reference ```bash # Basic generation imagen.sh "prompt" [--output ./images] [--model MODEL] # Edit existing image imagen.sh "refinement prompt" --input previous.png # Models --model gemini-2.5-flash-image # Fast, cheap (default) --model gemini-3-pro-image-preview # Higher quality --model imagen-4.0-generate-preview-06-06 # Imagen 4 ``` Output: Saves to `./images/` with timestamped filename, prints path. ## Prompting Framework Based on Max Woolf's Nano Banana research. ### Structure ``` [Specific object description with exact requirements in CAPS] Aspects that MUST be followed EXACTLY: - [Compositional rule 1] - [Compositional rule 2] [Publication/camera details for style elevation] Do not include [unwanted elements]. ``` ### Key Techniques | Technique | Example | |-----------|---------| | Structured bullets | Requirements as dashed list, not prose | | ALL CAPS constraints | "MUST", "EXACTLY" increases adherence | | Hex colors | `#9F2B68` more precise than "magenta" | | Composition rules | "rule of thirds", "negative space", "depth of field" | | Style elevators | "Pulitzer Prize-winning cover photo for NYT" | | Camera specs | "Canon EOS 90D DSLR camera" | | Publication targets | "Vanity Fair cover profile" | | Negative constraints | "Do not include text, watermarks, or line overlays" | ### Example Prompt ``` A professional headshot of a confident business executive. Aspects that MUST be followed EXACTLY: - Shot from shoulders up, rule of thirds composition - Neutral background with soft gradient #E8E8E8 to #FFFFFF - Natural 3PM diffuse lighting from left - Sharp focus on eyes, slight bokeh on background Pulitzer Prize-winning portrait, Canon EOS R5, 85mm f/1.4. Do not include any text, logos, or watermarks. ``` ## Composing with Brand Skills ### With itv-styling When creating ITV-branded images: 1. Read `itv-styling` for color palette and principles 2. Bake brand constraints into prompt: ``` Corporate presentation image for ITV. Aspects that MUST be followed EXACTLY: - Dark background #0F2323 (ITV dark green) - Accent elements in #E8E557 (ITV yellow) or #4ECDC4 (ITV teal) - Clean, modern, professional aesthetic - No busy patterns or off-brand colors Professional corporate photography style. ``` ### With diagramming For hybrid needs (visual + precise data): - Generate AI background/illustration with image-generation - Overlay precise elements with diagramming - Composite manually if needed ## Limitations | Limitation | Workaround | |------------|------------| | Style transfer fails ("Studio Ghibli style") | Use structural descriptions instead | | Text generation imperfect | Add text as overlay after generation | | Exact positioning difficult | Iterate with refinement prompts | | Rate limits | Use Flash for drafts, Pro only for finals | ## Output Location Images save to `./images/` in the project directory: - Created on first use - Timestamped filenames for uniqueness - Stays with project for easy reference ## Anti-Patterns | Pattern | Problem | Fix | |---------|---------|-----| | Skip brand check | Inconsistent styling | Load itv-styling first when brand applies | | Vague prompts | Poor results | Use specific, concrete descriptions | | Wrong tool for data | Inaccurate charts | Use diagram skill for precise data | ## See Also - `references/prompting.md` — Extended prompting reference - `diagramming` skill — For precise diagrams and charts - `itv-styling` skill — For brand-constrained outputs