# Jobs To Be Done ## Engineering Leaders: Define the Engineering Standard **Trigger:** Two managers disagree on what 'senior' means — neither can point to a written definition. **Big Hire:** Help me define what good engineering means so roles have clear, defensible expectations. → **Map** **Little Hire:** Help me define or update a role's expectations knowing structural mistakes get caught. → **Map** **Competes With:** tribal knowledge; borrowed frameworks; per-manager intuition; tolerating the ambiguity. **Forces:** - **Push:** Inconsistent expectations produce contested promotions. - **Pull:** A shared definition that makes quality visible and discussable. - **Habit:** Each manager carries a private mental model of what 'good' means. - **Anxiety:** Getting definitions wrong feels worse than having none. **Fired When:** definitions drift from practice; or a reorg removes the mandate. ## Engineering Leaders: Measure Engineering Outcomes **Trigger:** Quarterly review is due and the only data is ticket counts — singling out individuals, not illuminating the system. **Big Hire:** Help me demonstrate engineering progress without making individuals feel surveilled. → **Landmark** **Little Hire:** Help me tell whether culture investments are working before the next budget cycle. → **Landmark** **Competes With:** sprint velocity; ticket counts; annual reviews; asking managers 'how's the team doing?'; not measuring. **Forces:** - **Push:** Pressure to show ROI with no way to separate signal from noise. - **Pull:** System-level trends that show direction without naming individuals. - **Habit:** Equating velocity and throughput with team health. - **Anxiety:** Measurement feels like surveillance regardless of intent. **Fired When:** metrics get used punitively; or leadership turnover deprioritizes measurement. ## Engineering Leaders: Staff Teams to Succeed **Trigger:** A post-mortem surfaces the same skill gap that caused the last incident — nobody saw it before staffing. **Big Hire:** Help me make staffing decisions I can defend by seeing what each role requires. → **Summit** **Little Hire:** Help me spot capability gaps and check whether a candidate fills them. → **Summit** **Competes With:** résumé screening; gut feel; copying a team that seems to work; accepting gaps as inevitable. **Forces:** - **Push:** Capability gaps appear as incidents, never in advance. - **Pull:** Confidence that a staffing change strengthens the team as a system. - **Habit:** Hiring on résumés and gut rather than team composition analysis. - **Anxiety:** Modeling human capability feels reductive. **Fired When:** the skill taxonomy drifts from actual roles; or a hiring freeze removes staffing pressure. ## Empowered Engineers: Be Prepared and Productive **Trigger:** Walking into a meeting cold because context was scattered across email, Slack, and last week's notes. **Big Hire:** Help me keep track of people, projects, and threads without depending on memory. → **Outpost** **Little Hire:** Help me walk into every meeting already oriented. → **Outpost** **Competes With:** morning email scan; re-reading calendar invites; personal notes; asking colleagues for context. **Forces:** - **Push:** Context-switching compounds; starting cold wastes everyone's time. - **Pull:** Continuous awareness without continuous effort — context assembled. - **Habit:** Morning routines and tracking systems that feel 'good enough.' - **Anxiety:** Delegating awareness to a system feels like losing control. **Fired When:** the briefing becomes noise; or simpler tools absorb the preparation. ## Empowered Engineers: Get Judgment Grounded in the Standard **Trigger:** The promotion conversation ended with 'not yet' but no specifics, and agent output gets reviewed line by line because nobody can tell whether it meets the bar. **Big Hire:** Help me get career guidance and output review grounded in my organization's standard, not impressions or generic advice. → **Guide** **Little Hire:** Help me ask a growth question or get a second opinion on a deliverable before approving it. → **Guide** **Competes With:** annual reviews; mentor conversations; reviewing every line; trusting uncritically; hoping the next project makes readiness obvious. **Forces:** - **Push:** Career conversations and agent review both run on subjective judgment with no shared evidence base. - **Pull:** A second opinion grounded in the standard, whether on a career or a deliverable. - **Habit:** Waiting for annual feedback and reviewing every line by hand. - **Anxiety:** Structured assessment might confirm being further behind than assumed. **Fired When:** assessments disconnect from promotion and approval decisions; or a reorg makes the standard feel irrelevant. ## Empowered Engineers: See What's Expected of Humans and Agents **Trigger:** 'Meets expectations' on the review form has no definition anyone can point to, and an agent's work was just rejected for following generic practices instead of the organization's. **Big Hire:** Help me see what's expected of me at my level and of the agents I configure, from one shared standard. → **Pathway** **Little Hire:** Help me generate a job definition, a level comparison, or an agent profile from discipline, track, and level. → **Pathway** **Competes With:** asking peers; inferring from manager comments; custom system prompts; copying another team's config; living with ambiguity. **Forces:** - **Push:** Ambiguous expectations create review anxiety for humans and rejected work from agents. - **Pull:** One standard that produces role definitions for people and profiles for agents. - **Habit:** Inferring expectations from peer behavior and hand-tuning agent prompts per team. - **Anxiety:** Looking up the standard might confirm gaps easier left ignored. **Fired When:** defined expectations diverge from actual promotions; or configuration overhead exceeds the quality gain. ## Platform Builders: Build Agent-Capable Systems **Trigger:** Building an agent that needs structured knowledge or typed contracts, with no way to prove the next change improves outcomes. **Big Hire:** Help me give humans and agents shared capabilities through the same interface, with tooling to prove changes improved outcomes. → **Gear** **Little Hire:** Help me query graphs, run vector search, expose services as MCP tools, or chart agent metrics without building infrastructure. → **Gear** **Competes With:** hand-rolled retrieval; bespoke wrappers; custom eval harnesses; monolithic platforms; shipping without measuring. **Forces:** - **Push:** Every agent project rebuilds plumbing, and improvements are claimed but never measured. - **Pull:** Interoperable libraries and reproducible evals that work standalone or together. - **Habit:** Copy-pasting infrastructure and assuming changes helped because the prompt looks better. - **Anxiety:** Shared libraries mean losing control over the interface contract. **Fired When:** the libraries diverge from how agents consume them; or eval overhead exceeds the value of knowing. ## Teams Using Agents: Run a Continuously Improving Agent Team **Trigger:** Agents are shipping work but nobody can tell whether the team is getting better — the only feedback loop is reading every diff. **Big Hire:** Help me run an autonomous, continuously improving development team that plans, ships, studies its own traces, and acts on findings. → **Kata** **Little Hire:** Help me onboard a Kata installation that runs the Plan-Do-Study-Act loop without per-team prompt engineering. → **Kata** **Competes With:** bespoke per-agent system prompts; manual orchestration scripts; not measuring agent outcomes; abandoning agent investment after a failed pilot. **Forces:** - **Push:** Agent regressions are silent until users complain. - **Pull:** A closed loop that surfaces what improved and what regressed, grounded in evidence. - **Habit:** Treating each agent run as a one-off rather than an iteration. - **Anxiety:** Autonomy might amplify bad patterns faster than humans can intervene. **Fired When:** the autonomous loop becomes harder to operate than direct prompting; or organizational policy bans autonomous agent execution.