Sprint Velocity Analysis

Analyze sprint velocity data to produce an honest engineering team health report. The goal is not to generate optimistic-looking charts — it is to surface delivery patterns, identify dysfunction early, and give the team and their manager actionable recommendations. Look for: velocity trends (improving, declining, flat, erratic), story point calibration consistency, carry-over patterns that indicate chronic over-commitment, and capacity-related signals. Produce text-based trend visualizations, a health diagnosis, and specific improvement recommendations with measurable targets.

Required Inputs

Ask for these if not already provided:

Sprint history — for each sprint: sprint name/number, committed story points, completed story points, and number of items carried over to next sprint; ideally 6–8 sprints minimum
Team size and any changes — current team size and any additions or departures during the data window
Known disruptions — holidays, company all-hands, on-call incidents, or other events that affected specific sprints
Cycle time data (optional) — if available, p50 and p90 cycle time per sprint (time from start to done)
Definition of Done — what "completed" means for this team (merged to main? deployed to prod? accepted by PO?)

If cycle time data is not provided, omit that section and note it as a recommended data source to add.

Output Format

Sprint Velocity Analysis: [Team Name]

Analysis period: Sprint [N] through Sprint [N+7] ([Date range]) Team size: [X engineers] ([note any changes during period]) Report date: [Date] Data source: [Where this data came from — Jira, Linear, spreadsheet, etc.]

Velocity Trend

Raw Data

Sprint	Committed	Completed	Completion Rate	Carried Over	Notes
[Sprint N]	[X pts]	[X pts]	[X%]	[X pts / X items]	[disruption or context]
[Sprint N+1]	[X pts]	[X pts]	[X%]	[X pts / X items]
[Sprint N+2]	[X pts]	[X pts]	[X%]	[X pts / X items]
[Sprint N+3]	[X pts]	[X pts]	[X%]	[X pts / X items]
[Sprint N+4]	[X pts]	[X pts]	[X%]	[X pts / X items]
[Sprint N+5]	[X pts]	[X pts]	[X%]	[X pts / X items]
[Sprint N+6]	[X pts]	[X pts]	[X%]	[X pts / X items]
[Sprint N+7]	[X pts]	[X pts]	[X%]	[X pts / X items]
Average	[X pts]	[X pts]	[X%]	[X pts]

Velocity Chart (Completed Points per Sprint)

Points
  60 |
  55 |          ●
  50 |    ●           ●
  45 | ●        ●          ●
  40 |               ●          ●
  35 |
  30 |
     +--+--+--+--+--+--+--+--
      N N+1 N+2 N+3 N+4 N+5 N+6 N+7
      Sprint

  ● = Completed points   — = Average ([X pts])

Generate this chart using ASCII characters based on the actual data provided. Scale the Y-axis to the data range. Plot completed (not committed) points. Mark the average as a dashed line.

Trend Diagnosis

Metric	Value	Interpretation
Average velocity	[X pts/sprint]	[Baseline for planning]
Velocity std deviation	[±X pts]	[Low < 15% of avg = stable; High > 25% = erratic]
Trend direction	[Improving / Flat / Declining / Erratic]	[3-sprint trailing average vs. 3-sprint leading average]
Average completion rate	[X%]	[Healthy: 80–95%; < 75% = chronic over-commitment]
Carry-over rate	[X% of committed points carried over per sprint]	[Healthy: < 15%; > 25% = systemic issue]
Sprints with completion rate < 75%	[X of 8 sprints]	[> 3 of 8 = structural problem, not noise]

Story Point Calibration

Story points are only useful if they are applied consistently. Look for these calibration signals in the data:

Signal	Observed	Interpretation
High variance in velocity despite stable team size	[Yes / No]	Suggests inconsistent estimation — same effort scored differently week to week
Consistent over-commitment (committed >> completed)	[Yes / No — by avg X pts per sprint]	Team is sandbagging estimates or ignoring historical capacity
Consistent under-commitment (completed >> committed by > 20%)	[Yes / No]	Team is over-padding estimates or pulling in unplanned work frequently
Frequent large items (> 13 pts) in carry-over	[Yes / No]	Items are too large to estimate reliably — need better decomposition
Velocity cliff after team change	[Yes / No — Sprint N+X]	Team did not re-baseline capacity after composition changed

Calibration verdict: [Well-calibrated / Needs recalibration / Severely uncalibrated — one sentence explanation tied to the signals above]

If recalibration is needed: [Specific recommendation — e.g., "Run a calibration session using the last 20 completed items, re-score them as a team, and use the resulting relative sizes to anchor future estimates."]

Carry-Over Pattern Analysis

Carry-over is the most reliable leading indicator of commitment reliability problems.

Sprint	Carried-Over Items	Common Themes in Carry-Over
[Sprint N]	[X items / X pts]	[Technical debt, dependency blocked, scoped wrong, etc.]
[Sprint N+1]	[X items / X pts]	[Theme]
[Sprint N+2]	[X items / X pts]	[Theme]

Carry-over root causes identified:

[Root cause 1: e.g., "5 of 12 carry-overs were blocked on a third-party API integration — external dependency, not estimation failure"]
[Root cause 2: e.g., "4 of 12 carry-overs were items estimated at 8+ points that were later found to be 2–3x larger than expected"]
[Root cause 3: e.g., "3 of 12 carry-overs were interruptions from on-call incidents consuming unplanned capacity"]

Capacity Utilization

Sprint	Team Size	Available Capacity (pts)	Committed	Utilization %	Disruptions
[Sprint N]	[X engineers]	[X pts]	[X pts]	[X%]	[Holiday / incident / none]
[Sprint N+1]	[X engineers]	[X pts]	[X pts]	[X%]

Capacity calculation used: [X engineers × Y pts/person/sprint = Z pts available. Adjust: if team capacity changed during the window, note which sprints used which team size.]

Average utilization: [X%] Utilization interpretation: [< 70% = team is under-loaded or over-padding | 70–90% = healthy range | > 90% = no slack for unplanned work — fragile]

Health Diagnosis

Dimension	Score	Evidence	Priority
Delivery predictability	[Green / Yellow / Red]	[Average completion rate X%, std dev Y pts]	[High / Med / Low]
Commitment accuracy	[Green / Yellow / Red]	[Team over-commits by avg X pts/sprint]
Estimation consistency	[Green / Yellow / Red]	[Velocity std dev ±X pts, calibration verdict]
Carry-over hygiene	[Green / Yellow / Red]	[X% carry-over rate, root causes]
Capacity management	[Green / Yellow / Red]	[Avg utilization X%, disruption handling]
Trend direction	[Green / Yellow / Red]	[Trailing 3-sprint avg vs. leading 3-sprint avg]

Scoring guide: Green = operating within healthy range; Yellow = marginal — watch closely or single-sprint anomaly; Red = chronic issue requiring active intervention.

Overall health: [Green / Yellow / Red] — [One sentence summary: "The team delivers consistently at X pts/sprint but chronic over-commitment is eroding morale and creating a misleading picture for stakeholders."]

Blocker Frequency Analysis

If blocker data was provided, complete this section. If not, note it as a recommended tracking addition.

Blocker Category	Frequency (last 8 sprints)	Avg Days Blocked	Impact (pts delayed)
External dependency	[X occurrences]	[X days]	[X pts]
Technical debt / rework	[X occurrences]	[X days]	[X pts]
Unclear requirements	[X occurrences]	[X days]	[X pts]
On-call interruptions	[X occurrences]	[X days]	[X pts]
Environment / tooling	[X occurrences]	[X days]	[X pts]

Top blocker to address: [Name the single highest-impact blocker category and what addressing it would mean for velocity.]

Improvement Recommendations

Provide 3 specific recommendations ordered by expected impact. Each recommendation must include a measurable success target and implementation steps.

Recommendation 1: [Title]

Problem it addresses: [Which health dimension is Red or Yellow, and what the data shows]

What to do:

[Specific action step — concrete enough that a tech lead can assign it]
[Next step]
[Next step]

Who owns it: [Tech lead / Engineering manager / Whole team] When to start: [This sprint / Next sprint / Within 2 weeks]

Measurable target: [e.g., "Carry-over rate drops below 15% within 3 sprints" or "Completion rate above 80% for 4 consecutive sprints"]

How to know it's working: [Leading indicator to watch before the outcome metric improves — e.g., "Carry-over items decreasing sprint-over-sprint even before the target is hit"]

Recommendation 2: [Title]

Problem it addresses: [Health dimension and evidence]

What to do:

[Step]
[Step]
[Step]

Who owns it: [Role] When to start: [Timing]

Measurable target: [Specific metric and timeframe]

How to know it's working: [Leading indicator]

Recommendation 3: [Title]

Problem it addresses: [Health dimension and evidence]

What to do:

[Step]
[Step]

Who owns it: [Role] When to start: [Timing]

Measurable target: [Specific metric and timeframe]

How to know it's working: [Leading indicator]

Next-Sprint Capacity Forecast

Next sprint: [Sprint N+8] Known team size: [X engineers] Known capacity reducers: [PTO: X days total, on-call rotation: ~Y pts of unplanned capacity, etc.]

Factor	Impact
Base capacity (historical average)	[X pts]
PTO / planned absences	−[X pts]
On-call overhead (estimate)	−[X pts]
Carry-over from Sprint [N+7]	+[X pts committed capacity already spoken for]
Recommended commitment ceiling	[X pts]

Confidence: [High — stable team and known capacity | Medium — some uncertainty in disruption level | Low — team composition uncertain]

Recommendation for planning: [One sentence — e.g., "Plan to Sprint [N+8] ceiling of X pts. Given the carry-over items, prioritize completing those before pulling in new scope."]

Cycle Time Distribution (if data provided)

Sprint	p50 Cycle Time	p90 Cycle Time	Items Completed
[Sprint N]	[X days]	[X days]	[X items]
[Average]	[X days]	[X days]

Cycle time interpretation: [p90 > 2× p50 indicates a long-tail of stuck items that deserve investigation. p50 increasing over time indicates slowing throughput independent of story point changes.]

If cycle time data was not provided: Cycle time data was not included in this analysis. Recommend adding p50 and p90 cycle time per sprint to your tracking to detect throughput issues that story points alone cannot reveal.

Quality Checks

Velocity chart is generated from the actual data provided — not a generic placeholder chart
Trend diagnosis states a direction (Improving / Flat / Declining / Erratic) with a quantitative basis (trailing vs. leading average)
Carry-over root causes are specific categories with counts — not a generic observation that carry-over exists
Each of the 3 recommendations includes a named owner, a start date, and a measurable target with a timeframe
Next-sprint capacity forecast uses historical average as the baseline and deducts specific known reducers
Health diagnosis table uses Red/Yellow/Green with evidence cited in the Evidence column — no unsupported scores
If metrics are missing (cycle time, blocker log), the report explicitly calls them out as recommended additions

Anti-Patterns

Do not generate the velocity chart from placeholder data — it must reflect the actual sprint data provided
Do not diagnose trend direction without computing trailing vs leading averages — "it looks like it's declining" is not a diagnosis
Do not list carry-over as a generic observation — identify root cause categories with counts for the analysis to be actionable
Do not produce recommendations without a named owner, a start date, and a measurable target
Do not score health dimensions without citing evidence in the Evidence column — unsupported Red/Yellow/Green scores are not credible

Sprint Velocity Analysis

Sprint Velocity Analysis

Required Inputs

Output Format

Sprint Velocity Analysis: [Team Name]

Velocity Trend

Raw Data

Velocity Chart (Completed Points per Sprint)

Trend Diagnosis

Story Point Calibration

Carry-Over Pattern Analysis

Capacity Utilization

Health Diagnosis

Blocker Frequency Analysis

Improvement Recommendations

Recommendation 1: [Title]

Recommendation 2: [Title]

Recommendation 3: [Title]

Next-Sprint Capacity Forecast

Cycle Time Distribution (if data provided)

Quality Checks

Anti-Patterns

Bundled with this artifact

More on the bench

Runbook Writer

Scrum Master

Confluence Expert