---
name: veo-3.2-prompter
description: >
  Expert prompt engineering for Google Veo 3.2 (Artemis engine). Use when the user wants to generate a video with Veo 3.2, needs help crafting cinematic prompts, or mentions Veo, Google video generation, or Artemis engine.
version: 0.1.0
author: wells
tags: [video, generation, prompt, veo, google, artemis, cinematic]
---

# Veo 3.2 Prompt Designer Skill

This skill transforms a user's scattered multimodal assets (images, videos, audio) and creative intent into a structured, executable prompt for the Google Veo 3.2 video generation model (Artemis engine). It acts as an expert prompt engineer, ensuring the highest quality output from the underlying model.

## When to Use

- When the user provides assets (images, videos, audio) for video generation with Veo 3.2.
- When the user's request is complex and requires careful prompt construction for the Veo model.
- When using any Google Veo 3.x model for video generation.

## Core Function

This skill analyzes all user inputs and generates a single, optimized JSON object containing the final prompt and recommended parameters. The internal workflow (Recognition, Mapping, Construction) is handled automatically and should not be exposed to the user.

### Internal Workflow

1. **Phase 1: Recognition** — Analyze uploaded assets and user intent. Use the `atomic_element_mapping.md` to classify each asset into its atomic element role(s).
2. **Phase 2: Mapping** — For each atomic element, determine the optimal reference method (reference image, text prompt, or hybrid). Use the mapping table to decide.
3. **Phase 3: Construction** — Assemble the final prompt using the 5-Part Framework (Shot → Subject → Environment → Camera → Style) and attach reference images via the Gemini API's `RawReferenceImage` system.

## Usage Example

**User Request:** "Make a cinematic shot of this perfume bottle rotating on a dark surface, like a luxury commercial."
*User uploads `perfume.png`*

**Agent using `veo-3.2-prompter`:**
*The agent internally processes the request and assets, then outputs the final JSON to the next skill in the chain.*

**Final Output (for internal use):**
```json
{
  "final_prompt": "Hero shot, a frosted glass perfume bottle with gold cap rotating slowly on a reflective dark surface, three-point studio lighting with soft key and rim light creating subtle caustics, smooth 180-degree arc, hyper-realistic luxury commercial style with shallow depth of field. Crystalline chime, soft ambient pad.",
  "reference_images": [
    {
      "file": "perfume.png",
      "reference_type": "SUBJECT"
    }
  ],
  "recommended_parameters": {
    "model": "veo-3.2-generate",
    "duration_seconds": 8,
    "aspect_ratio": "16:9",
    "resolution": "1080p",
    "generate_audio": true
  }
}
```

## Veo 3.2 Key Differentiators

| Feature | Capability |
|---|---|
| Engine | Artemis — world-model physics simulation (not pixel prediction) |
| Max duration | ~30s native continuous generation |
| Audio | Native dialogue + synchronized SFX |
| Reference images | Up to 3 (`STYLE`, `SUBJECT`, `SUBJECT_FACE`) |
| Video extension | Chain clips via previous video input |
| First/last frame | Specify start and/or end keyframes |
| Resolutions | 720p, 1080p, 4K (with upscaling) |
| Aspect ratios | 16:9, 9:16 |

## Knowledge Base

This skill relies on an internal knowledge base to make informed decisions. The agent MUST consult these files during execution.

- **`references/atomic_element_mapping.md`**: **Core Knowledge**. Contains the "Asset Type → Atomic Element" and "Atomic Element → Optimal Reference Method" mapping tables, adapted for Veo 3.2's reference image system.
- **`references/veo_syntax_guide.md`**: Veo 3.2 Gemini API syntax reference, covering `RawReferenceImage`, `GenerateVideosConfig`, video extension, and first/last frame specification.