--- name: convert-url-html-to-md description: "Convert HTML web pages to Markdown for documentation extraction. Use when fetching documentation from websites, extracting structured content, getting clean main content (clean=true), or getting full page with nav/sidebar to discover URLs (clean=false). Ideal for learning libraries and building context from web sources." --- # Convert URL HTML to Markdown Extract web content as markdown using a two-phase approach for comprehensive documentation gathering. ## Two-Phase Workflow For optimal documentation extraction: 1. **Discovery (clean=false)**: Get full page including navigation and sidebars to discover all documentation URLs 2. **Extraction (clean=true)**: Extract main content from discovered URLs ## Usage ```bash # From the skill directory cd ~/.claude/skills/convert-url-html-to-md # Clean mode - main content only (recommended for docs) node scripts/convert_url.js --clean=true # Full page mode - includes nav/sidebar (for discovering URLs) node scripts/convert_url.js --clean=false # Default is clean=true node scripts/convert_url.js ``` ## Examples ```bash # Get all navigation links from a docs site node scripts/convert_url.js https://ui.shadcn.com/docs --clean=false # Extract specific documentation content node scripts/convert_url.js https://ui.shadcn.com/docs/components/radix/aspect-ratio --clean=true ``` ## Installation Dependencies are included. Run once: ```bash cd ~/.claude/skills/convert-url-html-to-md npm install ``` ## Output The script outputs markdown directly to stdout. Redirect to file if needed: ```bash node scripts/convert_url.js --clean=true > output.md ``` ## Credits This skill is based on [urltomarkdown](https://github.com/macsplit/urltomarkdown) by [Lee Hanken](https://github.com/macsplit), licensed under MIT. Modified and adapted as a Claude skill by [1naichii](https://github.com/1naichii).