Understand your team's change failure rate
Learn about how often deployments require remediation after the fact, and how that has changed over time
Trend
See how the Change Failure Rate has trended over time with the overall CFR shown in blue, as well as an "Urgent" CFR shown in green. Deployments including a remediation within 48 hours of the previous deployment are considered urgent.
💡Increases in either the overall CFR or Urgent CFR can surface reduction in quality over time, so track this metric closely, and consider putting a target for your team to track against.
Failures by Repository
Observe where the failures are occurring to look for hotspots in your codebase. This bar chart indicates how many failures have occurred in a given repository, with the length of the horizontal line representing the proportion of failures across all repositories. Use this chart to determine how to mitigate risk from particularly troublesome parts of the codebase.
Breakdown Table
Explore the data to learn how frequently teams deployments require remediation by looking at the CFR(%), how often there was urgent remediation required, as well as the total number of Deployments, Failures (non-urgent), and Fix-Only deployments. This table can be pivoted both by people properties like team and report group.
Note that Uplevel attributes deployments to both the person that initiated the deployment workflow, as well as authors of PRs that were included in a deployment. This means that a single deployment (successful or otherwise) can count for multiple groups of people in the breakdown table. Additionally, a person can be a member of multiple segments. These will cause the sum of successful production deployments in this breakdown to be slightly greater than the totals shown above.
Example: a ‘Manager’ segment where the manager is in their own team’s segment (e.g., Chris Riccio's Team), in addition to being part of their manager’s team (Joseph Levy's Team).
💡 Tip: If there are repositories or groups of people with elevated CFR rates, then it could merit a conversation about what's leading to these statistics. It might be time to sharpen the axe or reduce some tech-debt to make deploying code easier, which should improve both users and developers experiences.