---
name: image-generation
description: Generate images from text prompts using Google's Gemini model. Creates photorealistic images, illustrations, concept art, and more via the generate_image tool.
emoji: "\U0001F3A8"
---

# Image Generation

Generate images from text descriptions using Google's Gemini model via the `generate_image` tool.

## Setup

1. Enable image generation in Settings UI or set `NOMOS_IMAGE_GENERATION=true`
2. Get a Gemini API key from https://aistudio.google.com/apikey
3. Set `GEMINI_API_KEY` in Settings or environment

## Tool: `generate_image`

**Parameters:**

- `prompt` (required) — Detailed description of the image to generate
- `output_path` (optional) — File path to save the image (defaults to temp directory)

## Writing Effective Prompts

Good image prompts are specific about:

- **Subject**: What is in the image (person, object, scene)
- **Style**: Photorealistic, watercolor, oil painting, digital art, pencil sketch, etc.
- **Composition**: Close-up, wide shot, aerial view, symmetrical, rule of thirds
- **Lighting**: Golden hour, studio lighting, dramatic shadows, soft diffused light
- **Colors**: Warm palette, monochrome, vibrant, muted pastels
- **Mood**: Serene, dramatic, playful, mysterious

### Prompt Examples

**Simple:**

> A golden retriever sitting in a field of sunflowers at sunset

**Detailed:**

> A photorealistic close-up of a steaming cup of coffee on a weathered wooden table, morning light streaming through a window, shallow depth of field, warm color palette

**Artistic:**

> An Art Nouveau illustration of a woman surrounded by flowing botanical patterns, muted earth tones with gold accents, decorative border, inspired by Alphonse Mucha's style

**Technical:**

> An isometric 3D rendering of a modern smart home cutaway showing interior rooms, clean minimal style, soft shadows, pastel color scheme on white background

## Capabilities

- Photorealistic images
- Illustrations and concept art
- Logos and icons
- Diagrams and infographics (with text)
- Image editing via text description (describe modifications)
- Multiple art styles (watercolor, oil, digital, pixel art, etc.)

## Output

- Images are saved as PNG (default), JPEG, or WebP
- Default save location is the system temp directory
- Specify `output_path` to save to a specific location (e.g., current working directory)

## Tips

- More detail in prompts produces better results
- Specify image dimensions or aspect ratio in the prompt if needed
- For consistent style across multiple images, include the same style descriptors
- The model may also return text alongside the image with notes about what it generated