--- name: benchmark provider: codex description: Compare the current design against quality baselines. Shows where the design ranks relative to typical AI output, typical professional output, and exceptional output. user-invokable: true --- # /benchmark — Quality Baseline Comparison Run /score and compare results against three baselines: **AI Default Output (typical score: 35-50)** - Inter font, purple gradient, nested cards - No type scale, arbitrary spacing, minimal motion - No accessibility considerations **Professional Baseline (typical score: 70-80)** - Intentional font choice, cohesive palette - Consistent spacing, responsive design - Basic accessibility compliance **Exceptional Standard (typical score: 90+)** - Mathematical type scale, computed color system - Perfect grid alignment, purposeful motion - Full WCAG compliance, design system tokens Show the current score against all three baselines. Identify the biggest gap between current and the next tier. Suggest the single highest-impact fix.