If your engineering team is between 5 and 50 people and you’ve Googled “how to implement DORA metrics,” you’ve hit the same wall as everyone else: every guide assumes you have a Platform Engineering team, OpenTelemetry collectors, and three months of runway to “do it properly.”
This guide is for you. We’ve helped 30+ SMB engineering teams stand up the four DORA metrics — Deployment Frequency, Lead Time for Changes, Change Failure Rate, and Mean Time to Recovery (MTTR) — using nothing but the tools they already pay for. Below is the exact 4-week roadmap we use, plus the realistic benchmarks for teams your size.
What Are DORA Metrics (and Why They Matter for SMBs)?#
DORA metrics — formalized by Nicole Forsgren, Jez Humble and Gene Kim in Accelerate (2018) — are four indicators of Software Delivery Performance. They were extracted from the DORA (DevOps Research and Assessment) team’s seven-year research program at Google Cloud, which validated that these four metrics correlate with both organizational performance and engineer wellbeing.
The 4 Key DORA Metrics Explained in 90 Seconds#
| Metric | What it measures | Why SMBs care |
|---|---|---|
| Deployment Frequency | How often you ship to production | Proxy for delivery throughput |
| Lead Time for Changes | First commit → production | Reveals review/CI bottlenecks |
| Change Failure Rate (CFR) | % of deploys causing incidents | Quality vs. speed balance |
| Time to Restore (MTTR) | Incident open → resolved | Resilience under stress |
Why “Elite” Google/Netflix Benchmarks Don’t Apply to You#
The DORA State of DevOps report classifies teams as Elite, High, Medium, Low. Elite means: deploys multiple times per day, lead time under 1 hour, CFR 0–15%, MTTR under 1 hour. If you have a 10-person team running a B2B SaaS, chasing Elite is actively harmful — you’ll burn budget on tooling and your team will resent you.
We publish realistic SMB benchmarks below, sourced from 30+ teams we’ve worked with directly.
Pre-Implementation Checklist for Small Teams#
Before you write a single line of measurement code:
- ✅ You ship code to production at least weekly.
- ✅ You have basic incident tracking (PagerDuty, Statuspage, or even an
#incidentsSlack channel). - ✅ Your founder or CEO understands that DORA is for engineering decision-making, not for performance reviews. This is non-negotiable.
Tooling You Already Own#
You don’t need DataDog or Honeycomb to start. The minimum viable stack:
- GitHub Actions (or GitLab CI) → deployment events
- GitHub PRs → first-commit and merge timestamps
- PagerDuty / Sentry / Statuspage → incident timestamps
- Notion or Google Sheets → aggregation layer
That’s it. Total monthly cost: $0 incremental.
Stakeholder Alignment in Under 30 Minutes#
Before kickoff, run a 30-minute meeting with your CEO and Head of Product. Say this verbatim:
“We’re going to start measuring four numbers. They will not be used to evaluate any individual. They exist so that when we say ‘we need to invest in CI speed,’ we have data instead of vibes.”
Get a verbal yes. Move on.
4-Week DORA Implementation Roadmap#
Week 1 — Instrument Deployment Frequency#
Set up a GitHub Actions workflow that, on every successful production deploy, posts a webhook payload to a Notion database. Required fields: deploy_id, timestamp, commit_sha, triggered_by.
Continuous Delivery maturity is irrelevant — what matters is consistent capture. Even a manual Slack post counts as long as it’s structured.
Week 2 — Measure Lead Time for Changes#
Write a small script (we have a Node.js template in the free Notion template) that, for each merged PR, computes:
lead_time = merge_timestamp − first_commit_timestamp
Store both raw values and a P50 / P85 rolling 30-day window. Median (P50) is your headline number; P85 catches the long-tail PRs that are actually killing your team’s morale.
Week 3 — Track Change Failure Rate (CFR)#
Define failed change explicitly. Our default: any deploy that requires a rollback, a hotfix within 24 hours, or that’s tagged as the cause in a postmortem. Build a Notion relation between Incidents and Deploys.
CFR formula:
CFR = failed_deploys_30d / total_deploys_30d
Week 4 — Capture Time to Restore Service (MTTR)#
Pipe PagerDuty incident open/resolve events to your aggregation layer. Compute the median for the rolling 30-day window. MTTR’s denominator is incident count — small teams have small N, so use the median, not the average.
Realistic Benchmarks for 5–50 Person Engineering Teams#
Based on data from 30+ SMB engineering teams across SaaS, fintech and B2B verticals (anonymized, 2024–2026):
| Metric | SMB P25 | SMB P50 | SMB P75 |
|---|---|---|---|
| Deployment Frequency | 1× / week | 3× / week | Daily |
| Lead Time for Changes | 5 days | 2 days | 8 hours |
| Change Failure Rate | 25% | 15% | 8% |
| MTTR | 8 hours | 3 hours | 45 min |
Read this carefully: these are not DORA’s “Elite/High/Medium/Low” classifications. They are the realistic distribution we see for SMB teams. If you’re at P50 across all four, you are doing better than the median public-cloud-native startup.
Case Study — 12-Engineer Fintech, 6 Months In#
A 12-engineer fintech team we worked with started at: Lead Time 14 days, deploys once weekly, CFR 35%, MTTR 18 hours. After 6 months of focused work driven by these measurements, they moved to: Lead Time 3 days, deploys 3× weekly, CFR 12%, MTTR 2 hours.
The single highest-leverage change: cutting the median PR review wait time from 22h to 4h, achieved via async review SLAs and a #review-please Slack channel — not by adding tooling.
Common Pitfalls When Rolling Out DORA in an SMB#
- Treating DORA as a leaderboard. The moment you compare engineers on their personal Lead Time, the metric is dead.
- Measuring without a question. Don’t start with “let’s track DORA.” Start with: “We feel slow — is that true and where?”
- Stopping at deployment. If your CFR is 25% but you don’t measure MTTR, you’re optimizing for ship-it-and-pray.
- Comparing yourself to Google. You don’t have their problems and you don’t have their money.
Free DORA Tracking Template (Notion + Google Sheets) #
We maintain a public Notion template that includes the four databases, the Google Sheets dashboard, the GitHub Actions workflow, and the script for Lead Time computation. Get the free template →
When to Move Beyond DORA: SPACE & DevEx#
DORA measures delivery performance. It deliberately excludes Developer Experience (DevEx) signals like cognitive load, flow state, and feedback loop quality.
When your DORA metrics plateau and the bottleneck stops being mechanical (CI speed, review queues) and starts being human (burnout, context switching, unclear ownership), graduate to the SPACE framework (Forsgren et al., 2021) or the DevEx framework (Noda, Forsgren, Storey, 2024).
We cover both in detail in our DORA vs SPACE comparison.