The DORA metrics have shaped how engineering teams measure performance for nearly a decade. In 2026, the framework continues to evolve. AI-assisted workflows have pushed deployment frequency higher, change failure rate lower, and the bar for elite performance has shifted. The teams that lead now operate under different assumptions than the leaders of three years ago.
According to the 2024 Accelerate State of DevOps report, organizations using AI tools in development report meaningful improvements in throughput and quality, with the gap between elite and low-performing organizations widening. The data confirms a pattern many engineering leaders observe firsthand. DevX explored the broader workforce impact in its report on AI acceleration and jobs.
The Four Core Metrics
The DORA framework tracks deployment frequency, lead time for changes, change failure rate, and mean time to restore. Each measures a different aspect of delivery health, and together they paint a balanced picture. The genius of the framework is that improvements on any one metric without trade-offs against the others reliably signal real engineering progress.
In 2026, the benchmarks for elite performance are higher than they used to be. Multiple deployments per day, lead times measured in hours, change failure rates below 5%, and recovery times under an hour are now table stakes for top-tier teams. Mid-tier teams hit these benchmarks weekly rather than daily.
How AI Has Shifted the Numbers
AI-assisted development raises throughput and, when applied carefully, improves quality. Code generation, AI code review, and intelligent test selection all compress lead time. Automated incident triage and runbooks compress recovery time. Teams that integrate AI into their daily workflow see measurable gains across all four metrics.
The risk is that AI without guardrails inflates throughput while degrading quality. Change failure rates can rise if AI-generated code skips review. Recovery times can grow if alerts are noisy and humans tune them out. The teams that win are the ones that pair AI productivity with stronger guardrails. DevX described this pattern in its analysis of why AI feels new again.
The New Fifth Metric: Reliability
DORA’s recent work has emphasized operational performance as a fifth dimension. It captures whether teams can keep services running well under real-world conditions, not just whether they can ship fast. The metric is harder to define crisply, but it forces conversations about service-level objectives, incident response, and customer experience.
Adding reliability prevents the common failure mode where teams optimize for delivery speed at the expense of resilience. Both matter, and the most effective teams treat them as a paired goal rather than a trade-off.
What Elite Teams Actually Do
Elite teams share a few habits. They invest in platform engineering to make the easy path the right path. They automate testing, security scanning, and policy enforcement so engineers do not have to remember every requirement. They keep batch sizes small, deploying frequent, low-risk changes rather than periodic large releases.
They also pay attention to developer experience. The GitLab Global DevSecOps report consistently finds that satisfied developers ship better software. Tools that reduce friction, clear documentation, and supportive culture all show up in the metrics.
Common Pitfalls
The most common pitfall is metric theater: gaming the numbers without improving outcomes. Deployment frequency rises if teams deploy trivial changes, but the metric loses meaning. Lead time drops if teams skip review, but quality suffers. The framework only works when teams measure honestly and act on what they see.
Another pitfall is treating DORA as the only set of metrics. Customer satisfaction, business outcomes, and team health all matter. DORA measures delivery health, not product success. Combine it with metrics that reflect what users and the business actually care about. DevX noted a related point about quantifying cyber risk in critical infrastructure: numbers must connect to consequences.
How to Start
If your team has not adopted DORA, start with instrumentation. Pull data from your CI, deployment, and incident management tools. Establish a baseline for each metric over the past quarter. Avoid setting aggressive targets immediately. Use the baseline to spot the bottleneck that, if removed, would unlock the most improvement.
Tackle one bottleneck at a time. Quick wins often come from automating manual approval steps, reducing batch sizes, or improving test reliability. Each small improvement compounds, and after a quarter or two, the metrics tell a clear story of progress.
The Outlook
DORA will continue to evolve as AI reshapes how software is built and operated. Expect more attention to operational metrics, more sophisticated benchmarks segmented by industry, and more guidance on how to combine DORA with platform engineering and developer experience measures.
The fundamental insight remains. Fast, reliable delivery, measured honestly, is the foundation of strong engineering teams. In 2026, the teams that take that insight seriously continue to outperform those that do not.
Related Coverage on DevX
Rashan is a seasoned technology journalist and visionary leader serving as the Editor-in-Chief of DevX.com, a leading online publication focused on software development, programming languages, and emerging technologies. With his deep expertise in the tech industry and her passion for empowering developers, Rashan has transformed DevX.com into a vibrant hub of knowledge and innovation. Reach out to Rashan at [email protected]

















