--- name: "cli-anything-videocaptioner" description: >- AI-powered video captioning — transcribe speech, optimize/translate subtitles, and burn them into video via the stable VideoCaptioner backend. Free ASR and translation included. --- # cli-anything-videocaptioner AI-powered video captioning tool. Transcribe speech → optimize subtitles → translate → burn into video. ## Installation ```bash pip install cli-anything-videocaptioner ``` **Prerequisites:** - Python 3.10-3.12 (`videocaptioner` 1.4.1 requires `>=3.10,<3.13`) - `videocaptioner` must be installed (`pip install videocaptioner`) - FFmpeg required for video synthesis ## Usage ### Basic Commands ```bash # Show help cli-anything-videocaptioner --help # Start interactive REPL mode cli-anything-videocaptioner # Transcribe a video (free, no setup) cli-anything-videocaptioner transcribe video.mp4 --asr bijian # Translate subtitles (free Bing translator) cli-anything-videocaptioner subtitle input.srt --translator bing --target-language en # Full pipeline: transcribe → translate → burn subtitles cli-anything-videocaptioner process video.mp4 --asr bijian --translator bing --target-language en --subtitle-mode hard # Review subtitle/script consistency before a final hard-burn cli-anything-videocaptioner synthesize video.mp4 -s subtitles.srt \ --subtitle-mode hard \ --review-script approved_script.txt # Render a one-frame subtitle preview for review cli-anything-videocaptioner review subtitles.srt \ --script approved_script.txt \ --preview-video video.mp4 \ --preview-output review_5s.png # JSON output (for agent consumption) cli-anything-videocaptioner --json transcribe video.mp4 --asr bijian ``` ### REPL Mode When invoked without a subcommand, the CLI enters an interactive REPL session: ```bash cli-anything-videocaptioner # Enter commands interactively with tab-completion and history ``` ## Command Groups ### transcribe — Speech to subtitles ``` transcribe [--asr bijian|jianying|whisper-api|whisper-cpp] [--language CODE] [--format srt|ass|txt|json] [-o PATH] ``` - `bijian` (default): Free, Chinese & English, no setup - `whisper-api`: All languages, requires `--whisper-api-key` ### subtitle — Optimize and translate ``` subtitle [--translator llm|bing|google] [--target-language CODE] [--layout target-above|source-above|target-only|source-only] [--no-optimize] [--no-translate] [-o PATH] ``` - Three steps: Split → Optimize → Translate - Bing/Google translators are free - 38 target languages supported (BCP 47 codes) ### synthesize — Burn subtitles into video ``` synthesize