--- name: browser-content-capture description: Capture content from JavaScript-rendered pages, login-protected sites, and multi-page documentation using agent-browser CLI. Use when capturing browser content, extracting web data, saving page content. context: fork agent: data-pipeline-engineer version: 2.0.0 author: OrchestKit AI Agent Hub tags: [browser, agent-browser, scraping, spa, authentication, 2026] user-invocable: false allowed-tools: Bash, Read, Write --- # Browser Content Capture **Capture web content that traditional scrapers cannot access using agent-browser CLI.** ## Overview This skill enables content extraction from sources that require browser-level access: - **JavaScript-rendered SPAs** (React, Vue, Angular apps) - **Login-protected documentation** (private wikis, gated content) - **Dynamic content** (infinite scroll, lazy loading, client-side routing) - **Multi-page site crawls** (documentation trees, tutorial series) ## Overview **Use when:** - `WebFetch` returns empty or partial content - Page requires JavaScript execution to render - Content is behind authentication - Need to navigate multi-page structures - Extracting from client-side routed apps **Do NOT use when:** - Static HTML pages (use `WebFetch` - faster) - Public API endpoints (use direct HTTP calls) - Simple RSS/Atom feeds --- ## Quick Start ### Basic Capture Pattern ```bash # 1. Navigate to URL agent-browser open https://docs.example.com # 2. Wait for content to render agent-browser wait --load networkidle # 3. Get interactive snapshot agent-browser snapshot -i # 4. Extract text content agent-browser get text body # 5. Take screenshot agent-browser screenshot /tmp/capture.png # 6. Close when done agent-browser close ``` --- ## agent-browser Commands Reference | Command | Purpose | When to Use | |---------|---------|-------------| | `open ` | Go to URL | First step of any capture | | `snapshot -i` | Get interactive element tree | Understanding page structure | | `eval "