--- name: estimate type: workflow description: "Produces time and complexity estimates for features, tasks, or sprints using story points, t-shirt sizing, or day estimates. Use when the user asks for an estimate, wants to size a feature, or mentions estimation or planning poker." argument-hint: "[task-description]" user-invocable: true allowed-tools: Read, Glob, Grep effort: 1 when_to_use: "When estimating task effort, complexity, or duration for planning purposes" --- When this skill is invoked: 1. **Read the task description** from the argument. If the description is too vague to estimate meaningfully, ask for clarification before proceeding. 2. **Read CLAUDE.md** for project context: tech stack, coding standards, architectural patterns, and any estimation guidelines. 3. **Read relevant design documents** from `design/docs/` if the task relates to a documented feature or system. 4. **Scan the codebase** to understand the systems affected by this task: - Identify files and modules that would need to change - Assess the complexity of those files (size, dependency count, cyclomatic complexity) - Identify integration points with other systems - Check for existing test coverage in the affected areas 5. **Read past sprint data** from `production/sprints/` if available: - Look for similar completed tasks and their actual effort - Calculate historical velocity (planned vs actual) - Identify any estimation bias patterns (consistently over or under) 6. **Analyze the following factors**: **Code Complexity**: - Lines of code in affected files - Number of dependencies and coupling level - Whether this touches core/engine code vs leaf/feature code - Whether existing patterns can be followed or new patterns are needed **Scope**: - Number of systems touched - New code vs modification of existing code - Amount of new test coverage required - Data migration or configuration changes needed **Risk**: - New technology or unfamiliar libraries - Unclear or ambiguous requirements - Dependencies on unfinished work - Cross-system integration complexity - Performance sensitivity 7. **Generate the estimate**: ```markdown ## Task Estimate: [Task Name] Generated: [Date] ### Task Description [Restate the task clearly in 1-2 sentences] ### Complexity Assessment | Factor | Assessment | Notes | |--------|-----------|-------| | Systems affected | [List] | [Core, business, UI, etc.] | | Files likely modified | [Count] | [Key files listed below] | | New code vs modification | [Ratio, e.g., 70% new / 30% modification] | | | Integration points | [Count] | [Which systems interact] | | Test coverage needed | [Low / Medium / High] | [Unit, integration, manual] | | Existing patterns available | [Yes / Partial / No] | [Can follow existing code or new ground] | **Key files likely affected:** - `[path/to/file1]` -- [what changes here] - `[path/to/file2]` -- [what changes here] - `[path/to/file3]` -- [what changes here] ### Effort Estimate | Scenario | Days | Assumption | |----------|------|------------| | Optimistic | [X] | Everything goes right, no surprises, requirements are clear | | Expected | [Y] | Normal pace, minor issues, one round of review feedback | | Pessimistic | [Z] | Significant unknowns surface, blocked for a day, requirements change | **Recommended budget: [Y days]** [If historical data is available: "Based on [N] similar tasks that averaged [X] days actual vs [Y] days estimated, a [correction factor] adjustment has been applied."] ### Confidence: [High / Medium / Low] **High** -- Clear requirements, familiar systems, follows existing patterns, similar tasks completed before. **Medium** -- Some unknowns, touches moderately complex systems, partial precedent from previous work. **Low** -- Significant unknowns, new technology, unclear requirements, or cross-cutting concerns across many systems. [Explain which factors drive the confidence level for this specific task.] ### Risk Factors | Risk | Likelihood | Impact | Mitigation | |------|-----------|--------|------------| | [Specific risk] | [High/Med/Low] | [Days added if realized] | [How to reduce] | | [Another risk] | [Likelihood] | [Impact] | [Mitigation] | ### Dependencies | Dependency | Status | Impact if Delayed | |-----------|--------|-------------------| | [What must be done first] | [Done / In Progress / Not Started] | [How it affects this task] | ### Suggested Breakdown | # | Sub-task | Estimate | Notes | |---|----------|----------|-------| | 1 | [Research / spike] | [X days] | [If unknowns need investigation first] | | 2 | [Core implementation] | [X days] | [The main work] | | 3 | [Integration with system X] | [X days] | [Connecting to existing code] | | 4 | [Testing and validation] | [X days] | [Writing tests, manual verification] | | 5 | [Code review and iteration] | [X days] | [Review feedback, fixes] | | | **Total** | **[Y days]** | | ### Historical Comparison [If similar tasks exist in sprint history:] | Similar Task | Estimated | Actual | Relevant Difference | |-------------|-----------|--------|-------------------| | [Past task 1] | [X days] | [Y days] | [What makes it similar/different] | | [Past task 2] | [X days] | [Y days] | [What makes it similar/different] | ### Notes and Assumptions - [Key assumption that affects the estimate] - [Another assumption] - [Any caveats about scope boundaries -- what is included vs excluded] - [Recommendations: e.g., "Consider a spike first if requirement X is unclear"] ``` 8. **Output the estimate** to the user with a brief summary: recommended budget, confidence level, and the single biggest risk factor. ### Guidelines - Always give a range (optimistic / expected / pessimistic), never a single number. Single-point estimates create false precision. - The recommended budget should be the expected estimate, not the optimistic one. Padding is not dishonest -- it is realistic. - If confidence is Low, recommend a time-boxed spike or prototype before committing to the full estimate. - Be explicit about what is included and excluded. Scope ambiguity is the most common source of estimation error. - Round to half-day increments. Estimating in hours implies false precision for tasks longer than a day. - If the task is too large to estimate confidently (more than 10 days expected), recommend breaking it into smaller tasks and estimating those individually. - Do not pad estimates silently. If risk exists, call it out explicitly in the risk factors section so the team can decide how to handle it. ## Protocol - **Question**: Reads task description from argument; asks for clarification if scope is too vague to estimate - **Options**: Skip — single analysis path - **Decision**: Skip — estimate is advisory - **Draft**: Full estimate shown in conversation only - **Approval**: Skip — read-only; no files written ## Output Deliver exactly: - **Estimate**: X days (rounded to half-day increments) - **Confidence**: High / Medium / Low with one-sentence reason - **Risk factors**: explicit list — no silent padding - **Recommendation**: proceed / run a spike first / break into smaller tasks