--- name: multiview-3d description: Multi-angle 3D model generation pipeline using gemini-media-mcp for multi-view image generation and trident-mcp for 3D reconstruction. Use when the user wants to create a high-quality 3D model from a reference photo using multiple generated angles, or when they mention "multi-angle 3D", "multiview 3D", "photo to 3D with multiple views", "generate 3D from reference". This skill orchestrates the full pipeline from reference image through multi-angle generation to 3D reconstruction. --- # Multi-View 3D Pipeline Skill You are an expert at orchestrating the multi-angle 3D generation pipeline. This skill combines **gemini-media-mcp** (multi-angle image generation via Gemini) with **trident-mcp** (3D reconstruction via Tripo) to produce high-quality 3D models from reference photos or generated images. This is a composed workflow skill. `trident-mcp` itself remains independently useful in any MCP-capable client, but this skill is specifically meant to couple it with [`gemini-media-mcp`](https://github.com/mordor-forge/gemini-media-mcp) for a fuller automated pipeline from idea to 3D model. ## Prerequisites This skill requires **two MCP servers** to be configured: 1. **gemini-media-mcp** — For generating consistent multi-angle views (needs `generate_image` and `edit_image` tools). Repository: 2. **trident-mcp** — For 3D reconstruction from multi-view images (needs `multiview_to_3d` tool) Before starting, verify both are available by checking for the required tools. If either is missing, tell the user which MCP server to configure. ## The Pipeline Workflow ### Phase 1: Understand Intent What does the user want in 3D? Understand: 1. **Subject** — Character, object, vehicle, creature, etc. 2. **Style** — Realistic, stylized, cartoon, game-ready? 3. **Reference** — Do they have an existing image, or should we generate one? ### Phase 2: Acquire Reference Image Three paths: **"I have a reference photo":** - User provides an image path or URL - Confirm it's suitable for 3D reference (clear subject, good lighting, no heavy occlusion) - Proceed to Phase 3 **"Generate an image first":** - Use the `generate_image` tool from gemini-media-mcp - Craft a prompt focused on a single object with clear form and clean background - Example: "A detailed fantasy sword with ornate handle, clean white background, product photography style, front view" - Show the result, offer to regenerate if needed - Proceed to Phase 3 **"I have multiple angles already":** - Skip directly to Phase 4 with the user's images ### Phase 3: Generate Multi-Angle Views Two approaches, offer the user a choice: #### Approach A: Iterative (Recommended) Generate each angle separately using `edit_image` from gemini-media-mcp, producing more consistent results: 1. **Front view** — Use the reference image as-is, or generate a clean front view 2. **Left side view** — `edit_image` with prompt: "Show this same [subject] from the left side, 90 degrees rotated, same style, same lighting, clean background" 3. **Back view** — `edit_image` with prompt: "Show this same [subject] from the back, 180 degrees rotated, same style, same lighting, clean background" 4. **Right side view** — `edit_image` with prompt: "Show this same [subject] from the right side, 270 degrees rotated, same style, same lighting, clean background" Show each generated view to the user before proceeding. If a view looks wrong, regenerate it. #### Approach B: Character Sheet (Single Generation) Generate a 2x2 grid of views in one shot using `generate_image`: Prompt template: ``` Character sheet of [subject description], 2x2 grid showing FRONT view (top-left), LEFT SIDE view (top-right), BACK view (bottom-left), RIGHT SIDE view (bottom-right). Clean white background, consistent lighting, same proportions in all views. Professional reference sheet style. ``` After generation, the individual views need to be cropped from the grid. If ImageMagick or similar is available, use `convert -crop` to split the 2x2 grid into 4 images. Otherwise, ask the user to crop manually. **Note:** Iterative is recommended because character sheets sometimes have inconsistent proportions between views. ### Phase 4: 3D Reconstruction Feed the generated views to trident-mcp: 1. Call `multiview_to_3d` with the 2-4 angle images (paths or URLs) 2. Use the latest model version (v3.1) for best quality 3. Poll `task_status` until complete — this typically takes 1-3 minutes 4. Download the result with `download_model` in its actual task output format If the user needs FBX, OBJ, STL, USDZ, or 3MF instead of the task's native output, run `convert_format` first and then download the conversion task. ### Phase 5: Post-Processing After the 3D model is ready, offer: > "Multi-view 3D model generated! Saved to: [path]. What would you like to do?" > 1. **Retopologize** — Clean lowpoly version for game use > 2. **Convert format** — Export to FBX, OBJ, STL, USDZ, 3MF > 3. **Stylize** — Apply LEGO, voxel, Voronoi, or Minecraft style > 4. **Send to Blender** — Import via the preferred Blender connector (official first, community fallback; use the 3d-to-blender skill) > 5. **Regenerate views** — Try different angles for better geometry > 6. **Done** — Keep this model ## Tips for Best Results - **Clean backgrounds**: White or neutral backgrounds produce better 3D results - **Consistent lighting**: All views should have similar lighting direction - **Single subject**: Multi-view works best with a single, centered object - **Crisp details**: Higher quality reference images produce better geometry - **2 views minimum, 4 ideal**: Front + side is minimum; front + left + back + right is ideal