--- name: video-production-ledger description: Comprehensive video production master ledger system that transforms content outlines into frame-by-frame production documents detailing all visual, audio, timing, and technical specifications. This skill should be used when creating detailed production plans for videos, breaking down scripts into shot-by-shot specifications with A-roll/B-roll alternatives, sound design, visual effects, timing coordinates, and complete editorial guidance following the four-pillar engagement framework (visual variety, visual continuity, immersive audio, healthy pacing). --- # Video Production Master Ledger Transform content outlines and scripts into comprehensive, frame-by-frame production master ledgers that serve as the complete blueprint for video production and post-production. ## Purpose and Overview The Video Production Master Ledger system converts scripts and outlines into detailed production documents that specify every aspect of a video's creation. Unlike simple shot lists or rough storyboards, master ledgers provide: - **Frame-accurate timing** for every visual and audio element - **Multiple alternatives** for A-roll, B-roll, and visual treatments - **Complete audio specification** including music, SFX, risers, hits, and drones - **Viewer attention mapping** showing exactly where focus should be directed - **Engagement mechanics** tied to the four-pillar framework - **Technical specifications** for effects, transitions, color grading, and motion ## When to Use This Skill Use this skill when: - Converting video scripts or outlines into production-ready specifications - Planning video content that requires precise engagement optimization - Creating detailed shot lists with timing, alternatives, and technical specs - Breaking down educational, entertainment, or marketing videos into actionable production plans - Documenting complete visual and audio workflows for video editors - Establishing production standards for consistent video quality ## Core Framework: The Four Pillars of Addictive Editing All production decisions in the master ledger align with these engagement principles: ### Pillar 1: Healthy Variety of Visuals **Principle:** Change what viewers see every few seconds to maintain attention without causing visual confusion. **Implementation:** - Alternate between A-roll (subject on camera), B-roll (supplementary footage), motion graphics, and text overlays - Maintain 3-10 second shot durations for engaging content - Use longer durations (10+ seconds) only when footage inherently holds attention - Specify exact visual type and duration for each segment ### Pillar 2: Visual Continuity **Principle:** Create seamless visual flow where every element blends naturally into the next. **Implementation:** - All graphics must animate into frame (never magically appear) - Maintain consistent focal points across cuts to avoid viewer eye jumping - Use transitions strategically to mark topic changes - Specify animation style (slide, fade, scale) for every graphic element ### Pillar 3: Immersive Audio **Principle:** Layer sound design to create double stimulation and emotional guidance. **Implementation:** - Base music layer establishing mood for each segment - Movement sounds (whoosh) for all animated elements - Emphasis sounds (hits, highlights) for important moments - Emotional manipulation through risers (build tension), hits (release tension), and drones (create mystery) - Music synchronization with visual beats and topic transitions ### Pillar 4: Healthy Pacing **Principle:** Balance stimulation with comprehension, adjusting density to content type and audience expectation. **Implementation:** - Entertainment content: Higher cut frequency, constant stimulation - Educational content: Moderate pacing with strategic emphasis - Authentic/vlog content: Minimal cuts, preserving natural flow - Specify exact pacing strategy for each segment ## Master Ledger Structure ### Document Format Each master ledger is organized into these sections: 1. **Project Overview** - High-level video metadata and strategy 2. **Segment Breakdown** - Frame-by-frame specification of every moment 3. **Audio Architecture** - Complete sound design mapping 4. **Visual Asset Requirements** - Comprehensive list of needed footage/graphics 5. **Technical Specifications** - Software settings, export specs, quality standards ### Segment Specification Template For each segment in the video, specify: ``` SEGMENT [Number]: [Title/Description] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ TIMING ├─ Start: [MM:SS.ms] ├─ End: [MM:SS.ms] └─ Duration: [seconds] SCRIPT/VOICEOVER [Exact words spoken, with emphasis markers] VISUAL PRIMARY ├─ Type: [A-roll / B-roll / Motion Graphic / Text / Hybrid] ├─ Description: [Detailed description of what viewer sees] ├─ Focal Point: [Where viewer eye should focus - coordinates or description] ├─ Duration on Screen: [seconds] └─ Technical Notes: [Camera angle, framing, specific visual requirements] VISUAL ALTERNATIVES (minimum 3) Alt 1: [Type] - [Description] - [Why this works] Alt 2: [Type] - [Description] - [Why this works] Alt 3: [Type] - [Description] - [Why this works] VISUAL EFFECTS & TREATMENTS ├─ Color Grade: [Mood/style - e.g., "Warm, slightly desaturated"] ├─ Motion: [Scale/position animation - e.g., "Slow push-in, 100% to 105% over 3s"] ├─ Overlays: [Captions, graphics, vignettes - with timing] ├─ Transitions: [Type, duration, and justification] └─ Focus Direction: [Methods - darken/blur/highlight/circle/arrow/animate] AUDIO LAYERS ├─ Music: [Track name/mood] at [volume %] │ ├─ Sync Points: [Where music hits match visual beats] │ └─ Fades: [In/out timing and duration] ├─ Dialogue: [Clean/processed, volume %, any effects] ├─ SFX Movement: [Whoosh/swipe sounds for animations] @ [timing] ├─ SFX Emphasis: [Hit/pop sounds for highlights] @ [timing] ├─ SFX Emotional: [Risers/drones] @ [timing] for [purpose] └─ Ambient: [Background atmosphere if needed] ENGAGEMENT MECHANICS ├─ Attention Hook: [What grabs viewer in first 0.5s] ├─ Viewer Emotion: [Target emotional state] ├─ Pillar Focus: [Which of 4 pillars is prioritized] └─ Retention Strategy: [How this moment prevents drop-off] EDITOR NOTES [Any specific guidance, common pitfalls, or creative direction] ``` ## Production Workflow ### Phase 1: Script Analysis and Strategic Planning **Input Required:** - Complete video script or detailed outline - Target audience profile - Video purpose (entertainment/education/marketing/vlog) - Desired video length - Any existing brand guidelines or style preferences **Process:** 1. Read the complete script/outline 2. Identify natural segment boundaries (topic shifts, tone changes) 3. Determine primary engagement strategy based on content type 4. Map emotional arc across the entire video 5. Establish pacing rhythm (where to intensify, where to breathe) **Output:** - Project Overview document with strategy summary - Segment list with high-level descriptions - Emotional/energy graph showing intensity over time ### Phase 2: Detailed Segment Specification For each segment identified in Phase 1: **Step 1: Timing Allocation** - Assign precise start/end timestamps based on script reading speed - Account for natural pauses and emphasis - Build in breathing room between segments **Step 2: Visual Planning** - Define primary visual type (A-roll, B-roll, graphic) - Create 3-5 alternative visual approaches - Specify focal points to guide viewer attention - Plan visual variety rhythm within segment **Step 3: Audio Layering** - Select music that matches segment emotion - Identify all moments needing SFX - Plan riser/hit/drone placement for emotional manipulation - Sync audio peaks with visual moments **Step 4: Engagement Optimization** - Apply four-pillar framework to each moment - Identify potential drop-off points and add hooks - Balance stimulation density with comprehension needs - Verify continuity with previous/next segments **Step 5: Technical Specification** - Define all effects, color grades, motion parameters - Specify transition types and durations - List overlay timing and animation styles - Note any special technical requirements **Output:** Complete segment specification using template format ### Phase 3: Asset Requirements Compilation **Process:** 1. Extract all unique visual assets needed from segment specs 2. Categorize by type (A-roll shots, B-roll footage, graphics, stock) 3. Prioritize based on importance and difficulty to obtain 4. Note alternatives if primary assets unavailable **Output:** - Organized asset list with descriptions - Shot list for A-roll filming - B-roll footage requirements (film vs. stock) - Motion graphic briefs - Stock footage search queries ### Phase 4: Technical Specification Sheet **Process:** 1. Compile all software requirements 2. Establish quality standards (resolution, bitrate, format) 3. Document color grading approach 4. List required plugins, fonts, sound libraries 5. Define export settings **Output:** Technical specification document covering: - Software and versions - Project settings - Quality standards - Required assets (sounds, fonts, plugins) - Export specifications ## Detailed Specification Guidelines ### Visual Specification Standards **A-Roll Specifications Must Include:** - Camera angle and framing (wide/medium/close-up) - Subject position in frame - Lighting mood - Background treatment (blur amount, color) - Eye contact point for viewer focus continuity - Duration and whether subject is static or moving **B-Roll Specifications Must Include:** - Exact content description - Required motion (pan, tilt, static, handheld) - Color treatment to match segment mood - Specific visual details that matter to narrative - Duration and if looped - Relationship to voiceover (illustrative/metaphorical/explanatory) **Motion Graphic Specifications Must Include:** - Content type (diagram, text, chart, icon, illustration) - Animation style (fade/slide/scale/perspective/custom) - Entry and exit timing and method - Color palette and relationship to brand - Complexity level (simple icon vs. detailed diagram) - Key information and visual hierarchy **Text Overlay Specifications Must Include:** - Exact text content - Font, size, weight, color - Animation in/out (with timing) - Position on screen - Duration on screen - Emphasis techniques (color shift, scale, glow) - Maximum 3 words at once for readability ### Audio Specification Standards **Music Selection Criteria:** - Primary emotion (anticipation, excitement, mystery, comfort, intensity) - Energy level (1-10 scale) - Tempo range (BPM) - Instrumentation character (electronic, organic, hybrid) - Sync points with visual beats - Volume levels for each segment - Fade in/out timing (sudden, quick, gradual) **Sound Effect Categories:** 1. **Movement SFX** - For every animated element - Whoosh variations (light, medium, heavy) - Swipe sounds for transitions - Slide sounds for moving graphics - Timing: Sync with visual motion start 2. **Emphasis SFX** - For important moments - Pop sounds for appearing elements - Hit sounds for impactful statements - Click sounds for interface elements - Timing: Exact frame alignment critical 3. **Emotional SFX** - For mood manipulation - Risers (duration 0.5-3s) building into important moments - Hits (sharp, immediate) releasing tension at reveals - Drones (sustained, 5-30s) creating underlying atmosphere - Timing: Strategic placement based on emotional arc 4. **Ambient SFX** - For environmental context - Background atmosphere appropriate to B-roll - Volume: Subtle, typically 10-30% of dialogue - Duration: Continuous through related segments ### Timing Specification Standards **Timestamp Format:** MM:SS.ms (00:00.000) **Timing Precision Requirements:** - Segment boundaries: Frame-accurate (±0.033s) - SFX placement: Frame-accurate - Music sync points: Frame-accurate - Visual cuts: Frame-accurate - Text appearance: Within 0.1s - Transitions: Exact duration specified **Reading Speed Standards:** - Casual speaking: 150-160 words per minute - Enthusiastic/fast: 170-190 words per minute - Measured/serious: 130-140 words per minute - Calculate segment duration by word count ÷ speaking rate ### Engagement Mechanics Specification For each segment, explicitly define: **Attention Hook** - First 0.5s element that captures focus - Options: Sudden motion, contrasting color, unexpected sound, text emphasis, scale change - Specify exact technique and timing **Viewer Emotion Target** - Desired emotional state - Categories: Curious, excited, informed, empowered, entertained, contemplative, energized - Link to music and pacing choices **Primary Pillar** - Which engagement pillar is emphasized - Variety: When introducing new information or building energy - Continuity: When explaining complex ideas requiring focus - Audio: When manipulating emotion or building anticipation - Pacing: When matching audience expectations for content type **Retention Strategy** - Specific technique preventing drop-off - Pattern interruption (sudden change after repetition) - Curiosity gap (promising information to come) - Emotional payoff (delivering satisfying conclusion) - Momentum building (increasing energy toward next segment) ## Reference Files ### Audio Element Library Reference For detailed sound effect selection guidance and categorization, see: - `references/audio_elements_guide.md` - Comprehensive SFX library organization - `references/music_mood_matrix.md` - Music selection framework by emotional target ### Visual Treatment Library Reference For specific effect implementation and parameter recommendations, see: - `references/visual_effects_catalog.md` - Complete effects library with use cases - `references/motion_graphics_templates.md` - Standard animation patterns ### Example Master Ledgers For complete production ledger examples across different video types, see: - `references/example_educational_ledger.md` - Detailed educational content breakdown - `references/example_entertainment_ledger.md` - High-energy entertainment specification - `references/example_vlog_ledger.md` - Authentic vlog minimal production approach ## Quality Standards ### Completeness Checklist A production-ready master ledger must include: - [ ] Complete timing for every segment (no gaps) - [ ] Primary visual specified for every moment - [ ] Minimum 3 alternatives per primary visual - [ ] Audio layers defined for every segment - [ ] At least one SFX per 5 seconds of content - [ ] Music track specified with volume and sync points - [ ] Focal point identified for every visual - [ ] Engagement mechanics defined per segment - [ ] All transitions specified with type and duration - [ ] Technical specifications for all effects - [ ] Complete asset requirements list - [ ] Editor notes for complex or critical moments ### Review Process Before delivering master ledger: 1. **Timing Review** - Verify no gaps, overlaps, or timing conflicts 2. **Continuity Review** - Check focal point flow across cuts 3. **Audio Sync Review** - Confirm music hits align with visual moments 4. **Alternative Viability** - Ensure all alternatives are genuinely usable 5. **Technical Feasibility** - Verify all specifications are achievable 6. **Engagement Density** - Confirm attention maintenance throughout 7. **Brand Alignment** - Check consistency with any provided guidelines ## Output Format Deliver master ledgers as structured markdown documents with: 1. **Project Overview Section** at top 2. **Segment Specifications** in chronological order 3. **Audio Architecture Map** showing all music and SFX timing 4. **Visual Asset Requirements** comprehensive list 5. **Technical Specifications** complete software/export details Use consistent formatting with clear hierarchy: - H1 for document title - H2 for major sections - H3 for segment titles - Clear visual dividers between segments - Code blocks or tables for technical specifications - Emphasis formatting for critical notes ## Advanced Techniques ### Sync Mapping for Music-Driven Segments When segment energy should match music perfectly: 1. Load music track and identify beat/measure boundaries 2. Align segment start with strong beat or measure start 3. Map visual hits (cuts, reveals, text) to music hits 4. Place risers before major music builds 5. Time transitions to finish on beat resolutions ### Emotional Arc Continuity Maintain emotional flow across segments: 1. Map desired emotional intensity (1-10) for each segment 2. Never jump more than 3 points between segments 3. Use transition type to bridge emotional gaps: - Small gap (1-2 points): Simple cut - Medium gap (2-3 points): Quick transition with SFX - Large gap (4+ points): Requires intermediary segment ### Viewer Attention Heatmapping Predict where viewer attention goes: 1. Mark primary focal point for each visual 2. Track eye position across cuts 3. Flag any jump larger than 30% of screen width 4. Add guidance elements (motion, highlight) when jumps occur 5. Maintain focal continuity for important information ### Cognitive Load Management Balance information delivery with engagement: 1. Rate each segment for cognitive demand (1-10) 2. High-demand segments (7-10): Reduce visual variety, maintain continuity 3. Low-demand segments (1-3): Increase variety, add stylistic elements 4. Medium-demand segments (4-6): Balance based on audience sophistication 5. Never exceed sustained cognitive load of 8 for more than 30 seconds ## Common Patterns and Templates ### Opening Hook Pattern (First 5-10 seconds) ``` SEGMENT 1: HOOK Timing: 00:00.000 - 00:08.000 Visual: Fast-paced montage of compelling B-roll - 5-7 quick cuts (1-2s each) - High-energy visuals showing end result or transformation - Each cut must contain movement or strong visual interest Audio: - Energetic music with strong beat - No dialogue or minimal teaser dialogue - Synchronized cutting to music beats - Optional riser building toward title card Purpose: Immediate engagement, sets energy level, promises value ``` ### Explanation Pattern (Teaching Complex Concept) ``` SEGMENT X: CONCEPT EXPLANATION Timing: [Start] - [End] Visual Primary: Motion graphic breaking down concept - Start simple, build complexity progressively - Use 3-5 second pauses at each build step - Highlight new elements with color/animation - Keep existing elements visible but de-emphasized Visual Support: Picture-in-picture A-roll in corner - Shows presenter explaining, builds connection - Small enough to not distract from main graphic - Can be removed during complex graphic moments Audio: - Music: Lower energy, more contemplative - Volume: 30% during explanation, 50% during pauses - SFX: Highlight sound for each new element introduced - No risers/hits that might distract from learning Purpose: Clear understanding, retention of complex information ``` ### Transition Between Topics Pattern ``` SEGMENT X: TOPIC TRANSITION Timing: [Start] - [End] (typically 1-3 seconds) Visual: Full-screen transition effect - Type matches energy level and topic relationship - Duration: 0.5-1.5s for related topics, 1-3s for major shifts - Can include text card announcing new topic Audio: - Music: Fades out end of Topic A, fades in for Topic B - Optional: Hit sound at transition point - If major shift: Brief moment of silence (0.2-0.5s) Purpose: Signal change, reset attention, prepare viewer for new context ``` ## Troubleshooting Common Issues ### Issue: Too Much Visual Variety (Viewer Confusion) **Symptoms:** Many cuts under 2 seconds, constant change without purpose, no focal point consistency **Solutions:** - Extend shots with inherent interest to 5-8 seconds - Group related information with single visual - Establish consistent focal point across 2-3 related cuts - Reduce transition effects ### Issue: Too Little Visual Variety (Viewer Boredom) **Symptoms:** Long stretches without visual change, static shots over 10 seconds, repetitive visual patterns **Solutions:** - Add motion graphics overlay to static shots - Implement slow push-in or pan on still shots - Cut to related B-roll every 5-7 seconds - Introduce text overlays for key points ### Issue: Audio-Visual Desynchronization **Symptoms:** Music hits not matching visual moments, SFX feeling detached, emotional tone mismatch **Solutions:** - Map music beat grid and align cuts to beats - Place all SFX at exact frame of visual event - Adjust music track timing or find better sync point - Consider music edit (splice) if structural mismatch exists ### Issue: Cognitive Overload **Symptoms:** Too much happening simultaneously, competing focal points, insufficient processing time **Solutions:** - Reduce SFX density in information-heavy segments - Extend duration of complex motion graphics - Remove text overlays during B-roll storytelling - Simplify color grading to reduce visual noise ### Issue: Engagement Drop-off at Specific Points **Symptoms:** Predictable viewer exit points, long segments without hooks, monotonous pacing **Solutions:** - Add pattern interruption (sudden change) every 8-10 seconds - Increase music energy leading into weak segment - Insert compelling B-roll or motion graphic - Add riser before important information to build anticipation ## Integration with Production Pipeline ### Pre-Production Phase Master ledger informs: - Shot list creation for filming - B-roll footage requirements - Motion graphics briefs for designers - Music licensing and selection - SFX library acquisition ### Production Phase Master ledger guides: - A-roll filming priorities and setup - B-roll capture shopping list - On-set timing considerations - Contingency planning for missed shots ### Post-Production Phase Master ledger serves as: - Editor's primary reference document - Timeline structure blueprint - Audio mix specification - QA checklist for completeness - Revision communication tool ## Best Practices ### Do's ✅ **Specify alternatives:** Always provide 3-5 visual options per segment ✅ **Time precisely:** Use frame-accurate timestamps for all elements ✅ **Layer audio:** Multiple audio elements create richness ✅ **Guide focus:** Explicitly state where viewer eye should be ✅ **Connect emotion:** Link all creative choices to target emotional state ✅ **Plan continuity:** Consider flow across cuts, not just within shots ✅ **Justify choices:** Explain "why" in editor notes when non-obvious ✅ **Build flexibility:** Structure allows substitutions without breaking flow ### Don'ts ❌ **Don't under-specify:** "Add B-roll here" is insufficient - describe exactly what B-roll ❌ **Don't ignore timing:** Vague durations lead to pacing problems ❌ **Don't forget audio:** Visual-only specs miss half the engagement equation ❌ **Don't neglect alternatives:** Single-option specs fail when assets unavailable ❌ **Don't break continuity:** Jumping focal points destroy immersion ❌ **Don't over-complicate simple moments:** Not every second needs maximum stimulation ❌ **Don't contradict pillars:** Every choice should support engagement framework ❌ **Don't create in isolation:** Each segment must connect to adjacent segments ## Success Metrics A high-quality master ledger enables: - **Editor efficiency:** 50%+ faster editing with clear specifications - **First-cut accuracy:** 80%+ of choices match creative intent - **Asset optimization:** Zero missing assets, minimal unused footage - **Revision reduction:** Clear specs reduce back-and-forth iterations - **Quality consistency:** Systematic approach ensures professional results - **Scalability:** Framework applies to videos of any length or type ## Conclusion The Video Production Master Ledger transforms the editing process from creative guesswork into systematic execution of a proven engagement framework. By specifying every visual, audio, timing, and technical element with precision and alternatives, the master ledger ensures editors can create compelling, attention-holding content that serves the video's purpose while maintaining production efficiency. The four-pillar framework underlying all specifications—visual variety, visual continuity, immersive audio, and healthy pacing—provides a unified theory of viewer engagement that guides every creative decision from conceptual outline to final export.