Simple is not easy

DORA 2025, Part 9. How to choose the right metrics and frameworks to measure development and AI impact in IT organizations

How to choose DORA, SPACE, DevEx, and HEART metrics and frameworks to assess development, team experience, and AI impact.

  • This chapter was prepared with the participation of
  • We'll send you the materials you need or a commercial proposal
  • Choosing metrics for organizational goals
  • Using measurement frameworks in the AI era

About this publication

19 Dec 2025 This publication is an adapted translation of Google Cloud and DORA's report, "2025 State of AI-Assisted Software Development." The original is available at the link. Reading time: 10 min. This chapter was prepared with the participation of: Sarah D’Angelo, Ph.D. - user experience researcher, Google Ambar Murillo - user experience researcher, Google AI & Infrastructure Sarah Inman, Ph.D. - user experience researcher, Google Kevin M. Storer, Ph.D. - user experience researcher, Google Cloud

Why development measurement frameworks matter

  1. Measuring the development process helps drive real change.

  2. But it is a difficult task: first you need to understand what is worth measuring and what can actually be measured.

  3. The key is to use metrics not for the numbers themselves, but to change how the team and the organization as a whole work.

  4. For this, it is better to rely on established frameworks.

  5. A framework breaks a broad concept, such as developer experience, into specific measurable elements like speed, quality, satisfaction, and so on.

  6. When the industry or academia talks about engineering metrics, the frameworks most often mentioned are SPACE, DevEx, and HEART.

  7. Choosing the right framework can be difficult, but it is a key step. Where should you start? Define which decisions you want to make based on data.

Three directions of measurement frameworks

  1. Different frameworks serve different purposes.

  2. They are usually focused on three areas:

  3. Each of these groups offers its own perspective on the evolution of the engineering system and how it should be measured.

  4. To determine which framework best fits your organization's goals, it helps to think of frameworks as the why behind measurement.

  5. They help explain why you are measuring and what actions should follow from the data you collect.

  6. Frameworks set the lens through which you view the data and help ensure your efforts align with organizational goals.

  7. To choose a framework, you can ask yourself:

  8. What has changed that makes measurement necessary?

  9. How will you use the insights?

  10. What decisions or improvements will measurement enable?

  11. Next comes the what - the metrics themselves.

  12. These are the core concepts that feed the framework, such as velocity metrics or usage metrics.

Self-reported data - data reported by developers

  1. There are usually two main approaches to data collection.

  2. The first is self-reported data: collecting information directly from developers about their experience.

  3. Surveys are answers to questions about opinions, satisfaction, and perceptions of different aspects of work.

  4. Interviews and focus groups - individual and group discussions for deeper exploration of topics.

  5. Diary studies - collecting data about actions, thoughts, and experiences during work.

  6. The main advantage of self-reported data is its ability to capture subjective experience and things that cannot be measured automatically: satisfaction, well-being, and perceived effectiveness.

  7. The advantage is that it does not require complex instrumentation or deep observability into toolchains.

  8. But there are also limitations: challenges with standardization, comparison across teams, and scaling.

  9. Subjective data leads to variation in interpretation and vulnerability to bias (for example, recall bias and social desirability bias).

Logs-based measures - data collected automatically

  1. The second approach is logs-based measures: metrics extracted from the systems and tools developers use. Examples:

  2. Quantity means counting artifacts: number of commits, number of users.

  3. Time - how long an activity takes, such as coding or review time.

  4. Frequency is the rate of events: for example, the number of deployments per month or PRs per week.

  5. The main advantage is continuous, standardized data at scale that provides a detailed view of activity.

  6. The limitation is that you need sufficient observability into the toolchain: integrations, data collection, and infrastructure.

  7. It is also important to understand that logs are not absolutely objective.

  8. Instrumentation approaches vary, errors are possible, and interpretation always depends on context and can be biased.

How frameworks and metrics are connected

A framework defines which concepts you want to measure, but the choice of specific metrics depends on your resources and data availability.

Do you have observability for logs-based metrics?

Do you have a research team for self-reported data? Different organizations have different capabilities.

Frameworks are guidance, but they cannot fully describe complex behavior.

It is an approximation of the truth, but it is impossible to measure everything.

A useful analogy: metrics are ingredients.

Frameworks are the recipes that use those ingredients.

The same ingredients can be combined in different ways to produce different "dishes" - that is, frameworks.

Some ingredients are unique, but in many cases you can still get a good result even if some ingredients are missing.

Frameworks differ because they are designed for different goals.

But their metrics often overlap. For example, productivity metrics such as number of commits and PRs appear in several frameworks.

An organization can use these metrics to assess the impact of a new team structure (organizational effectiveness), understand how well a developer tool works (product quality), and evaluate developer workload (developer experience). At the same time, some metrics are more specialized.

For example, developer well-being is an important element of developer experience frameworks, but it is usually not a core metric in models of organizational effectiveness or product quality.

Using a single framework helps focus action and can be a good starting point.

But you do not need to limit yourself to that.

As measurement goals and capabilities change, using multiple frameworks produces complementary insights that together create a more complete picture than any single one alone.

The key is to measure what helps you and your organization stay focused on goals and be ready to act on the data you gather.

We'll curate materials for your task

We'll reply within 30 minutes and send relevant cases, diagrams, or analyses tailored to your context.

Using measurement frameworks in the AI era

  1. You may ask: does the arrival of AI in development change everything?

  2. Are the existing frameworks still enough, or do we need new ones?

  3. Any technology transformation creates the impression that metrics must be rebuilt from scratch.

  4. In practice, the changes are usually much more specific.

  5. If your goal is to understand how AI affects the developer experience, it is enough to update a small subset of metrics while keeping the overall measurement structure.

  6. You do not need to abandon the entire framework. On the contrary, existing metrics provide a baseline against which shifts can be seen. For example, you can add indicators for AI suggestion adoption, model quality, or trust in the model, while keeping the existing developer experience metrics such as perceived productivity, review time, and so on.

  7. As more advanced AI tools emerge, the roles themselves and the set of tasks in development will change.

  8. Metrics will need to adapt to new user profiles and changed processes, but the goals of measuring developer experience will likely remain the same.

  9. If the goal does not change, there is no need to change the framework either; it is enough to expand the set of metrics.

  10. Even if goals do change, that does not mean measurement has to start over.

  11. Many metrics fit multiple frameworks, so they can be quickly reassigned to new tasks.

Example: measuring code quality during AI adoption

Studying the impact of AI tools on the code being produced has become a new goal for companies - and a difficult one, since the technology is evolving very quickly.

A common question is how AI affects code quality.

Faster development looks positive in the short term, but if quality drops, long-term speed will also start to decline.

So a company might set a goal to maintain high code quality while actively adopting AI tools.

This goal draws on elements from all types of frameworks and includes metrics that are often already collected: code quality, engagement with tool usage, and perceived speed. In this case, you can keep using your current metrics and add new ones. For example, combining DORA software delivery metrics with a product quality framework like HEART provides deep insight into how teams perceive AI tools and how those tools affect delivery.

The PDCA cycle as the foundation of a sustainable measurement system

  1. Measuring development processes is a complex and ongoing task.

  2. There are many frameworks and approaches, but any of them is only useful when the organization can act on the data it gets.

  3. The key requirement is alignment with company goals and executive support; without that, measurement turns into reporting with no results.

  4. A deliberate choice of framework and metrics helps build a sustainable measurement system.

  5. If you follow the PDCA cycle, the work may look like this: Plan: define goals, choose the right framework, and secure leadership support. Do: establish baseline metrics and try changing part of the process. Check: measure again and assess progress toward the goals. Adjust: adapt the approach based on the new data.

  6. We do not propose a single best framework.

  7. The approach should fit your goals: the right one is the one that helps you make decisions and motivates the organization to act.

  8. Although frameworks differ in structure and emphasis, many of their metrics overlap.

  9. This means that metrics introduced today can be reused and adapted as goals or context change.

DORA AI Capabilities Model

  1. Conclusion written with the participation of Nathen Harvey - Head of DORA, Google Cloud.

  2. For more than a decade, DORA has remained a reliable guide for the engineering community, helping teams become stronger and more effective.

  3. The industry is changing quickly: AI, platform engineering, and new ways of organizing development are entering the workflow.

  4. But our mission remains the same: to study and highlight practices that help create truly high-performing teams. This year, we introduced the first version of the DORA AI capabilities model, an important step in advancing our research.

  5. As companies navigate the complex process of adopting AI, this model provides a grounded, data-driven structure.

  6. It shows seven key capabilities that amplify AI's positive impact on important organizational outcomes.

  7. A clear and open company stance on AI use

  8. Internal data available for AI

  9. We see it as a starting point for further dialogue with the DORA community and with companies that are introducing AI into development processes.

  10. We would be glad to learn how you apply these findings in practice, and we will continue refining and developing the model in future research.

A user focus as a critical factor in AI success

  1. This year's research reveals an important conclusion: we are still at an early stage of AI-assisted software development, when technologies are changing rapidly and practices are only beginning to take shape.

  2. Early attempts to standardize here are premature.

  3. Simply introducing AI tools does not lead to transformation.

  4. Moreover, AI's impact on team performance depends directly on one critical factor: user focus. With high confidence, we found that when a team is user-oriented, the positive impact of AI on its work increases.

  5. When that focus is missing, introducing AI can worsen team performance.

  6. This conclusion highlights a key condition for success: a deep understanding of end users is not a bonus, but a required foundation for effective AI use.

  7. If the user is not at the center of your product strategy, AI will not help - and in some cases may even harm team performance.

Practical use of the research: how to experiment

  1. This year's report findings are complex and may seem contradictory in places.

  2. This is a normal reflection of a rapidly changing environment.

  3. We suggest treating the findings not as strict instructions, but as hypotheses for experiments within your team.

  4. Here is how you can apply these ideas in practice:

  5. Run experiments in your organization by using DORA findings to form hypotheses and test them with your teams.

  6. This will help you better understand your operational context and identify the areas where improvements will have the greatest impact.

  7. Run internal surveys by using questions from this year's study as a starting point and adapting them to your needs.

  8. Add more nuanced questions that are relevant to your teams and current projects.

  9. Look at the platform as a whole: improving one feature does not fix a weak platform.

  10. Treat internal platforms as products: improve the full developer experience chain, from feedback to automation.

  11. Share what you learn - when you run experiments and gather insights, spread that knowledge across the organization through reports, internal communities of practice, or informal conversations.

  12. The goal is to build a culture of continuous learning.

  13. The main risk in the current situation is not falling behind, but widespread, chaotic action without meaningful results.

  14. Choose approaches and frameworks that fit your organization and drive meaningful change.

Join the DORA community

  1. Thank you for engaging with our research.

  2. We invite you to continue the journey with us.

  3. Share your experience, learn from peers, and find inspiration by joining the DORA community: https://dora.community.

  4. The DORA research is an example of the power of a global community united by a common goal.

  5. We are grateful to the thousands of researchers, experts, practitioners, leaders, and change agents who generously share their knowledge and experience.

  6. This report is a direct result of collective work.

  7. We hope the research results will help you and your teams create real change and build a culture of continuous improvement within your organizations.

Contacts

Let's Discuss Your Project

Leave your current contact details and describe your task. We will come back with clarifying questions and a proposal for the next step.