--- name: tapestry description: Unified content extraction and action planning. Use when user says "tapestry ", "weave ", "help me plan ", "extract and plan ", "make this actionable ", or similar phrases indicating they want to extract content and create an action plan. Automatically detects content type (YouTube video, article, PDF) and processes accordingly. allowed-tools: - Bash - Read - Write --- # Tapestry: Unified Content Extraction + Action Planning This is the **master skill** that orchestrates the entire Tapestry workflow: 1. Detect content type from URL 2. Extract content using appropriate skill 3. Automatically create a Ship-Learn-Next action plan ## When to Use This Skill Activate when the user: - Says "tapestry [URL]" - Says "weave [URL]" - Says "help me plan [URL]" - Says "extract and plan [URL]" - Says "make this actionable [URL]" - Says "turn [URL] into a plan" - Provides a URL and asks to "learn and implement from this" - Wants the full Tapestry workflow (extract โ†’ plan) **Keywords to watch for**: tapestry, weave, plan, actionable, extract and plan, make a plan, turn into action ## How It Works ### Complete Workflow: 1. **Detect URL type** (YouTube, article, PDF) 2. **Extract content** using appropriate skill: - YouTube โ†’ youtube-transcript skill - Article โ†’ article-extractor skill - PDF โ†’ download and extract text 3. **Create action plan** using ship-learn-next skill 4. **Save both** content file and plan file 5. **Present summary** to user ## URL Detection Logic ### YouTube Videos **Patterns to detect:** - `youtube.com/watch?v=` - `youtu.be/` - `youtube.com/shorts/` - `m.youtube.com/watch?v=` **Action:** Use youtube-transcript skill ### Web Articles/Blog Posts **Patterns to detect:** - `http://` or `https://` - NOT YouTube, NOT PDF - Common domains: medium.com, substack.com, dev.to, etc. - Any HTML page **Action:** Use article-extractor skill ### PDF Documents **Patterns to detect:** - URL ends with `.pdf` - URL returns `Content-Type: application/pdf` **Action:** Download and extract text ### Other Content **Fallback:** - Try article-extractor (works for most HTML) - If fails, inform user of unsupported type ## Step-by-Step Workflow ### Step 1: Detect Content Type ```bash URL="$1" # Check for YouTube if [[ "$URL" =~ youtube\.com/watch || "$URL" =~ youtu\.be/ || "$URL" =~ youtube\.com/shorts ]]; then CONTENT_TYPE="youtube" # Check for PDF elif [[ "$URL" =~ \.pdf$ ]]; then CONTENT_TYPE="pdf" # Check if URL returns PDF elif curl -sI "$URL" | grep -i "Content-Type: application/pdf" > /dev/null; then CONTENT_TYPE="pdf" # Default to article else CONTENT_TYPE="article" fi echo "๐Ÿ“ Detected: $CONTENT_TYPE" ``` ### Step 2: Extract Content (by Type) #### YouTube Video ```bash # Use youtube-transcript skill workflow echo "๐Ÿ“บ Extracting YouTube transcript..." # 1. Check for yt-dlp if ! command -v yt-dlp &> /dev/null; then echo "Installing yt-dlp..." brew install yt-dlp fi # 2. Get video title VIDEO_TITLE=$(yt-dlp --print "%(title)s" "$URL" | tr '/' '_' | tr ':' '-' | tr '?' '' | tr '"' '') # 3. Download transcript yt-dlp --write-auto-sub --skip-download --sub-langs en --output "temp_transcript" "$URL" # 4. Convert to clean text (deduplicate) python3 -c " import sys, re seen = set() vtt_file = 'temp_transcript.en.vtt' try: with open(vtt_file, 'r') as f: for line in f: line = line.strip() if line and not line.startswith('WEBVTT') and not line.startswith('Kind:') and not line.startswith('Language:') and '-->' not in line: clean = re.sub('<[^>]*>', '', line) clean = clean.replace('&', '&').replace('>', '>').replace('<', '<') if clean and clean not in seen: print(clean) seen.add(clean) except FileNotFoundError: print('Error: Could not find transcript file', file=sys.stderr) sys.exit(1) " > "${VIDEO_TITLE}.txt" # 5. Cleanup rm -f temp_transcript.en.vtt CONTENT_FILE="${VIDEO_TITLE}.txt" echo "โœ“ Saved transcript: $CONTENT_FILE" ``` #### Article/Blog Post ```bash # Use article-extractor skill workflow echo "๐Ÿ“„ Extracting article content..." # 1. Check for extraction tools if command -v reader &> /dev/null; then TOOL="reader" elif command -v trafilatura &> /dev/null; then TOOL="trafilatura" else TOOL="fallback" fi echo "Using: $TOOL" # 2. Extract based on tool case $TOOL in reader) reader "$URL" > temp_article.txt ARTICLE_TITLE=$(head -n 1 temp_article.txt | sed 's/^# //') ;; trafilatura) METADATA=$(trafilatura --URL "$URL" --json) ARTICLE_TITLE=$(echo "$METADATA" | python3 -c "import json, sys; print(json.load(sys.stdin).get('title', 'Article'))") trafilatura --URL "$URL" --output-format txt --no-comments > temp_article.txt ;; fallback) ARTICLE_TITLE=$(curl -s "$URL" | grep -oP '\K[^<]+' | head -n 1) ARTICLE_TITLE=${ARTICLE_TITLE%% - *} curl -s "$URL" | python3 -c " from html.parser import HTMLParser import sys class ArticleExtractor(HTMLParser): def __init__(self): super().__init__() self.content = [] self.skip_tags = {'script', 'style', 'nav', 'header', 'footer', 'aside', 'form'} self.in_content = False def handle_starttag(self, tag, attrs): if tag not in self.skip_tags and tag in {'p', 'article', 'main'}: self.in_content = True def handle_data(self, data): if self.in_content and data.strip(): self.content.append(data.strip()) def get_content(self): return '\n\n'.join(self.content) parser = ArticleExtractor() parser.feed(sys.stdin.read()) print(parser.get_content()) " > temp_article.txt ;; esac # 3. Clean filename FILENAME=$(echo "$ARTICLE_TITLE" | tr '/' '-' | tr ':' '-' | tr '?' '' | tr '"' '' | cut -c 1-80 | sed 's/ *$//') CONTENT_FILE="${FILENAME}.txt" mv temp_article.txt "$CONTENT_FILE" echo "โœ“ Saved article: $CONTENT_FILE" ``` #### PDF Document ```bash # Download and extract PDF echo "๐Ÿ“‘ Downloading PDF..." # 1. Download PDF PDF_FILENAME=$(basename "$URL") curl -L -o "$PDF_FILENAME" "$URL" # 2. Extract text using pdftotext (if available) if command -v pdftotext &> /dev/null; then pdftotext "$PDF_FILENAME" temp_pdf.txt CONTENT_FILE="${PDF_FILENAME%.pdf}.txt" mv temp_pdf.txt "$CONTENT_FILE" echo "โœ“ Extracted text from PDF: $CONTENT_FILE" # Optionally keep PDF echo "Keep original PDF? (y/n)" read -r KEEP_PDF if [[ ! "$KEEP_PDF" =~ ^[Yy]$ ]]; then rm "$PDF_FILENAME" fi else # No pdftotext available echo "โš ๏ธ pdftotext not found. PDF downloaded but not extracted." echo " Install with: brew install poppler" CONTENT_FILE="$PDF_FILENAME" fi ``` ### Step 3: Create Ship-Learn-Next Action Plan **IMPORTANT**: Always create an action plan after extracting content. ```bash # Read the extracted content CONTENT_FILE="[from previous step]" # Invoke ship-learn-next skill logic: # 1. Read the content file # 2. Extract core actionable lessons # 3. Create 5-rep progression plan # 4. Save as: Ship-Learn-Next Plan - [Quest Title].md # See ship-learn-next/SKILL.md for full details ``` **Key points for plan creation:** - Extract actionable lessons (not just summaries) - Define a specific 4-8 week quest - Create Rep 1 (shippable this week) - Design Reps 2-5 (progressive iterations) - Save plan to markdown file - Use format: `Ship-Learn-Next Plan - [Brief Quest Title].md` ### Step 4: Present Results Show user: ``` โœ… Tapestry Workflow Complete! ๐Ÿ“ฅ Content Extracted: โœ“ [Content type]: [Title] โœ“ Saved to: [filename.txt] โœ“ [X] words extracted ๐Ÿ“‹ Action Plan Created: โœ“ Quest: [Quest title] โœ“ Saved to: Ship-Learn-Next Plan - [Title].md ๐ŸŽฏ Your Quest: [One-line summary] ๐Ÿ“ Rep 1 (This Week): [Rep 1 goal] When will you ship Rep 1? ``` ## Complete Tapestry Workflow Script ```bash #!/bin/bash # Tapestry: Extract content + create action plan # Usage: tapestry <URL> URL="$1" if [ -z "$URL" ]; then echo "Usage: tapestry <URL>" exit 1 fi echo "๐Ÿงต Tapestry Workflow Starting..." echo "URL: $URL" echo "" # Step 1: Detect content type if [[ "$URL" =~ youtube\.com/watch || "$URL" =~ youtu\.be/ || "$URL" =~ youtube\.com/shorts ]]; then CONTENT_TYPE="youtube" elif [[ "$URL" =~ \.pdf$ ]] || curl -sI "$URL" | grep -iq "Content-Type: application/pdf"; then CONTENT_TYPE="pdf" else CONTENT_TYPE="article" fi echo "๐Ÿ“ Detected: $CONTENT_TYPE" echo "" # Step 2: Extract content case $CONTENT_TYPE in youtube) echo "๐Ÿ“บ Extracting YouTube transcript..." # [YouTube extraction code from above] ;; article) echo "๐Ÿ“„ Extracting article..." # [Article extraction code from above] ;; pdf) echo "๐Ÿ“‘ Downloading PDF..." # [PDF extraction code from above] ;; esac echo "" # Step 3: Create action plan echo "๐Ÿš€ Creating Ship-Learn-Next action plan..." # [Plan creation using ship-learn-next skill] echo "" echo "โœ… Tapestry Workflow Complete!" echo "" echo "๐Ÿ“ฅ Content: $CONTENT_FILE" echo "๐Ÿ“‹ Plan: Ship-Learn-Next Plan - [title].md" echo "" echo "๐ŸŽฏ Next: Review your action plan and ship Rep 1!" ``` ## Error Handling ### Common Issues: **1. Unsupported URL type** - Try article extraction as fallback - If fails: "Could not extract content from this URL type" **2. No content extracted** - Check if URL is accessible - Try alternate extraction method - Inform user: "Extraction failed. URL may require authentication." **3. Tools not installed** - Auto-install when possible (yt-dlp, reader, trafilatura) - Provide install instructions if auto-install fails - Use fallback methods when available **4. Empty or invalid content** - Verify file has content before creating plan - Don't create plan if extraction failed - Show preview to user before planning ## Best Practices - โœ… Always show what was detected ("๐Ÿ“ Detected: youtube") - โœ… Display progress for each step - โœ… Save both content file AND plan file - โœ… Show preview of extracted content (first 10 lines) - โœ… Create plan automatically (don't ask) - โœ… Present clear summary at end - โœ… Ask commitment question: "When will you ship Rep 1?" ## Usage Examples ### Example 1: YouTube Video (using "tapestry") ``` User: tapestry https://www.youtube.com/watch?v=dQw4w9WgXcQ Claude: ๐Ÿงต Tapestry Workflow Starting... ๐Ÿ“ Detected: youtube ๐Ÿ“บ Extracting YouTube transcript... โœ“ Saved transcript: Never Gonna Give You Up.txt ๐Ÿš€ Creating action plan... โœ“ Quest: Master Video Production โœ“ Saved plan: Ship-Learn-Next Plan - Master Video Production.md โœ… Complete! When will you ship Rep 1? ``` ### Example 2: Article (using "weave") ``` User: weave https://example.com/how-to-build-saas Claude: ๐Ÿงต Tapestry Workflow Starting... ๐Ÿ“ Detected: article ๐Ÿ“„ Extracting article... โœ“ Using reader (Mozilla Readability) โœ“ Saved article: How to Build a SaaS.txt ๐Ÿš€ Creating action plan... โœ“ Quest: Build a SaaS MVP โœ“ Saved plan: Ship-Learn-Next Plan - Build a SaaS MVP.md โœ… Complete! When will you ship Rep 1? ``` ### Example 3: PDF (using "help me plan") ``` User: help me plan https://example.com/research-paper.pdf Claude: ๐Ÿงต Tapestry Workflow Starting... ๐Ÿ“ Detected: pdf ๐Ÿ“‘ Downloading PDF... โœ“ Downloaded: research-paper.pdf โœ“ Extracted text: research-paper.txt ๐Ÿš€ Creating action plan... โœ“ Quest: Apply Research Findings โœ“ Saved plan: Ship-Learn-Next Plan - Apply Research Findings.md โœ… Complete! When will you ship Rep 1? ``` ## Dependencies This skill orchestrates the other skills, so requires: **For YouTube:** - yt-dlp (auto-installed) - Python 3 (for deduplication) **For Articles:** - reader (npm) OR trafilatura (pip) - Falls back to basic curl if neither available **For PDFs:** - curl (built-in) - pdftotext (optional - from poppler package) - Install: `brew install poppler` (macOS) - Install: `apt install poppler-utils` (Linux) **For Planning:** - No additional requirements (uses built-in tools) ## Philosophy **Tapestry weaves learning content into action.** The unified workflow ensures you never just consume content - you always create an implementation plan. This transforms passive learning into active building. Extract โ†’ Plan โ†’ Ship โ†’ Learn โ†’ Next. That's the Tapestry way.