--- name: claims-validator description: Validate documentation for unsupported claims, made-up metrics, and unverifiable statements. Use when relevant to the task. --- # claims-validator Validate documentation for unsupported claims, made-up metrics, and unverifiable statements. ## Triggers - "check for unsupported claims" - "validate claims" - "review for BS" - "check metrics" - "verify claims in this document" - "find made-up stats" ## Purpose This skill identifies statements that make claims without evidence, including: - Performance metrics without benchmarks or data - Time/cost estimates without basis - Percentage claims without citation - Comparative statements without baselines - Features described as implemented that don't exist - Marketing superlatives presented as facts ## Behavior When triggered, this skill: 1. **Scans for metric claims**: - Percentage improvements ("40% faster", "reduces by 60%") - Time estimates ("saves 2-3 hours", "in minutes not hours") - Cost projections ("$50-150/month", "ROI of 3x") - Performance numbers ("99x faster", "sub-millisecond") 2. **Identifies unsupported comparatives**: - "faster than", "better than", "more efficient" - "best", "leading", "revolutionary", "game-changing" - "comprehensive", "complete", "full-featured" 3. **Checks for feature claims**: - Commands or flags mentioned that don't exist in codebase - Features described in present tense that aren't implemented - Integration claims without actual integration code 4. **Validates citations**: - Claims that reference data should have sources - Benchmarks should link to methodology - Statistics should be reproducible 5. **Generates report**: - List each claim found - Classification (metric, comparative, feature, cost) - Recommendation (remove, add citation, verify, rephrase) ## Claim Categories ### Metrics Without Data ```markdown # Flagged "Time Saved: 92-96% (9-15 hours → 45-60 minutes)" "99x faster routing" "45x cache speedup" # Problem No benchmark data, methodology, or reproducible test # Fix Remove claim, or add: "Based on [benchmark/test], measured [how]" ``` ### Cost Estimates Without Basis ```markdown # Flagged "Budget $20-50/month for moderate use" "Light usage: ~$10-20/month" "Enterprise teams may see $100-500+/month" # Problem No actual usage data, varies wildly by use case # Fix Remove specific numbers, or link to pricing calculator/methodology ``` ### Time Estimates Without Data ```markdown # Flagged "Deploy Full SDLC Framework (2 Minutes)" "5 minutes, replaces 2-4 hours manual work" "campaign setup from 2-3 weeks → 1 week" # Problem No measurement, varies by project complexity # Fix Remove time claims, describe what it does instead ``` ### Comparative Claims Without Baseline ```markdown # Flagged "faster than manual processes" "more efficient than traditional approaches" "better than existing solutions" # Problem No specific comparison, no baseline defined # Fix Remove comparison, or specify exactly what's being compared ``` ### Feature Claims for Unimplemented Features ```markdown # Flagged "aiwg -migrate-workspace # Optional migration tool" "Run 'config-validator --fix' to apply automated fixes" # Problem Command doesn't exist in codebase # Fix Remove until implemented, or mark as "Planned:" ``` ### Marketing Superlatives ```markdown # Flagged "comprehensive", "revolutionary", "game-changing" "best-in-class", "industry-leading", "cutting-edge" "seamless", "effortless", "zero-friction" # Problem Subjective claims that can't be verified # Fix Replace with specific, factual descriptions ``` ## Validation Report Format ```markdown # Claims Validation Report **Document**: README.md **Date**: 2025-12-09 **Claims Found**: 12 **Issues**: 8 ## Summary | Category | Found | Unsupported | Action Needed | |----------|-------|-------------|---------------| | Metrics | 5 | 4 | Remove or cite | | Time estimates | 3 | 3 | Remove | | Cost estimates | 2 | 2 | Remove | | Comparatives | 1 | 1 | Rephrase | | Features | 1 | 1 | Remove (not implemented) | ## Issues ### 1. Unsupported Metric **Line 204**: "Time Savings: 20-98% reduction across 5 core use cases" **Problem**: No benchmark data or methodology **Action**: Remove claim ### 2. Cost Estimate Without Data **Line 1180**: "Light usage: ~$10-20/month" **Problem**: No actual usage data to support estimate **Action**: Remove specific numbers ### 3. Non-existent Feature **Line 583**: "aiwg -migrate-workspace" **Problem**: Command not implemented **Action**: Remove until implemented ## Recommendations 1. **Remove all percentage-based time savings claims** 2. **Remove cost estimates** - direct users to measure their own usage 3. **Remove non-existent commands** - only document what exists 4. **Replace superlatives with specifics** - say what it does, not how good it is ``` ## Usage Examples ### Validate a Document ``` User: "Check README.md for unsupported claims" Output: "Claims Validation: README.md Found 8 unsupported claims: Metrics (4): - Line 204: '20-98% reduction' - no data - Line 362: '56-63% time saved' - no data - Line 588: '99x faster routing' - no benchmark - Line 589: '45x cache speedup' - no benchmark Time Estimates (2): - Line 187: '5 minutes, replaces 2-4 hours' - unmeasured - Line 366: '2 Minutes' in heading - varies by project Non-existent Features (2): - Line 583: 'aiwg -migrate-workspace' - not implemented - Line 651: 'aiwg -rollback-workspace' - not implemented Recommendation: Remove these claims. Describe what features do, not how much time/money they save." ``` ### Quick Check ``` User: "Any BS in this section?" Output: "Found 2 issues: 1. 'Revolutionary approach' - marketing speak, be specific 2. '10x productivity gain' - no measurement Suggest: Replace with factual descriptions of functionality." ``` ## Integration This skill complements: - **Voice Framework**: Voice defines *how* to write, claims-validator checks *what* you claim - **config-validator**: Validates config files, claims-validator validates prose claims ## What This Skill Does NOT Flag - Factual descriptions of features that exist - Documented benchmarks with methodology - Qualified statements ("may vary", "depending on", "in our testing") - User testimonials clearly attributed - Comparative claims with specific baselines cited ## Output Location - Validation reports: `.aiwg/reports/claims-validation.md`