--- name: ai-podcast-creation description: "Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabilities: multi-voice conversations, background music, intro/outro, full episodes. Use for: podcast production, audiobooks, voice content, audio newsletters. Triggers: podcast, ai podcast, text to speech podcast, audio content, voice over, ai audiobook, multi voice, conversation ai, notebooklm alternative, audio generation, podcast automation, ai narrator, voice content, audio newsletter, podcast maker" allowed-tools: Bash(infsh *) --- # AI Podcast Creation Create AI-powered podcasts and audio content via [inference.sh](https://inference.sh) CLI. ![AI Podcast Creation](https://cloud.inference.sh/u/4mg21r6ta37mpaz6ktzwtt8krr/01jz00krptarq4bwm89g539aea.png) ## Quick Start > Requires inference.sh CLI (`infsh`). Get installation instructions: `npx skills add inference-sh/skills@agent-tools` ```bash infsh login # Generate podcast segment infsh app run infsh/kokoro-tts --input '{ "prompt": "Welcome to the AI Frontiers podcast. Today we explore the latest developments in generative AI.", "voice": "am_michael" }' ``` ## Available Voices ### Kokoro TTS | Voice ID | Description | Best For | |----------|-------------|----------| | `af_sarah` | American female, warm | Host, narrator | | `af_nicole` | American female, professional | News, business | | `am_michael` | American male, authoritative | Documentary, tech | | `am_adam` | American male, conversational | Casual podcast | | `bf_emma` | British female, refined | Audiobooks | | `bm_george` | British male, classic | Formal content | ### DIA TTS (Conversational) | Voice ID | Description | Best For | |----------|-------------|----------| | `dia-conversational` | Natural conversation | Dialogue, interviews | ### Chatterbox | Voice ID | Description | Best For | |----------|-------------|----------| | `chatterbox-default` | Expressive | Casual, entertainment | ## Podcast Workflows ### Simple Narration ```bash # Single voice podcast segment infsh app run infsh/kokoro-tts --input '{ "prompt": "Your podcast script here. Make it conversational and engaging. Add natural pauses with punctuation.", "voice": "am_michael" }' ``` ### Multi-Voice Conversation ```bash # Host introduction infsh app run infsh/kokoro-tts --input '{ "prompt": "Welcome back to Tech Talk. Today I have a special guest to discuss AI developments.", "voice": "am_michael" }' > host_intro.json # Guest response infsh app run infsh/kokoro-tts --input '{ "prompt": "Thanks for having me. I am excited to share what we have been working on.", "voice": "af_sarah" }' > guest_response.json # Merge into conversation infsh app run infsh/media-merger --input '{ "audio_files": ["", ""], "crossfade_ms": 500 }' ``` ### Full Episode Pipeline ```bash # 1. Generate script with Claude infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Write a 5-minute podcast script about the impact of AI on creative work. Format as a two-person dialogue between HOST and GUEST. Include natural conversation, questions, and insights." }' > script.json # 2. Generate intro music infsh app run infsh/ai-music --input '{ "prompt": "Podcast intro music, upbeat, modern, tech feel, 15 seconds" }' > intro_music.json # 3. Generate host segments infsh app run infsh/kokoro-tts --input '{ "prompt": "", "voice": "am_michael" }' > host.json # 4. Generate guest segments infsh app run infsh/kokoro-tts --input '{ "prompt": "", "voice": "af_sarah" }' > guest.json # 5. Generate outro music infsh app run infsh/ai-music --input '{ "prompt": "Podcast outro music, matching intro style, fade out, 10 seconds" }' > outro_music.json # 6. Merge everything infsh app run infsh/media-merger --input '{ "audio_files": [ "", "", "", "" ], "crossfade_ms": 1000 }' ``` ### NotebookLM-Style Content Generate podcast-style discussions from documents. ```bash # 1. Extract key points infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Read this document and create a podcast script where two hosts discuss the key points in an engaging, conversational way. Include questions, insights, and natural dialogue.\n\nDocument:\n" }' > discussion_script.json # 2. Generate Host A infsh app run infsh/kokoro-tts --input '{ "prompt": "", "voice": "am_michael" }' > host_a.json # 3. Generate Host B infsh app run infsh/kokoro-tts --input '{ "prompt": "", "voice": "af_sarah" }' > host_b.json # 4. Interleave and merge infsh app run infsh/media-merger --input '{ "audio_files": ["", "", "", ""], "crossfade_ms": 300 }' ``` ### Audiobook Chapter ```bash # Long-form narration infsh app run infsh/kokoro-tts --input '{ "prompt": "Chapter One. It was a dark and stormy night when the first AI achieved consciousness...", "voice": "bf_emma", "speed": 0.9 }' ``` ## Audio Enhancement ### Add Background Music ```bash # 1. Generate podcast audio infsh app run infsh/kokoro-tts --input '{ "prompt": "", "voice": "am_michael" }' > podcast.json # 2. Generate ambient music infsh app run infsh/ai-music --input '{ "prompt": "Soft ambient background music for podcast, subtle, non-distracting, loopable" }' > background.json # 3. Mix with lower background volume infsh app run infsh/media-merger --input '{ "audio_files": [""], "background_audio": "", "background_volume": 0.15 }' ``` ### Add Sound Effects ```bash # Transition sounds between segments infsh app run infsh/ai-music --input '{ "prompt": "Short podcast transition sound, whoosh, 2 seconds" }' > transition.json ``` ## Script Writing Tips ### Prompt for Claude ```bash infsh app run openrouter/claude-sonnet-45 --input '{ "prompt": "Write a podcast script with these requirements: - Topic: [YOUR TOPIC] - Duration: 5 minutes (about 750 words) - Format: Two hosts (HOST_A and HOST_B) - Tone: Conversational, informative, engaging - Include: Hook intro, 3 main points, call to action - Mark speaker changes clearly Make it sound natural, not scripted. Add verbal fillers like \"you know\" and \"I mean\" occasionally." }' ``` ## Podcast Templates ### Interview Format ``` HOST: Introduction and welcome GUEST: Thank you, happy to be here HOST: First question about background GUEST: Response with story HOST: Follow-up question GUEST: Deeper insight ... continue pattern ... HOST: Closing question GUEST: Final thoughts HOST: Thank you and outro ``` ### Solo Episode ``` Introduction with hook Topic overview Point 1 with examples Point 2 with examples Point 3 with examples Summary and takeaways Call to action Outro ``` ### News Roundup ``` Intro music Welcome and date Story 1: headline + details Story 2: headline + details Story 3: headline + details Analysis/opinion segment Outro ``` ## Best Practices 1. **Natural punctuation** - Use commas and periods for pacing 2. **Short sentences** - Easier to speak and listen 3. **Varied voices** - Different speakers prevent monotony 4. **Background music** - Subtle, at 10-15% volume 5. **Crossfades** - Smooth transitions between segments 6. **Edit scripts** - Remove filler before generating ## Related Skills ```bash # Text-to-speech models npx skills add inference-sh/skills@text-to-speech # AI music generation npx skills add inference-sh/skills@ai-music-generation # LLM for scripts npx skills add inference-sh/skills@llm-models # Content pipelines npx skills add inference-sh/skills@ai-content-pipeline # Full platform skill npx skills add inference-sh/skills@agent-tools ``` Browse all apps: `infsh app list --category audio`