Building Less Flawed Metrics: Dodging Goodhart and Campbell's Laws

7 min read Original article ↗
References:

APA (American Psychiatric Association). (2013). Diagnostic and statistical manual of mental disorders. BMC Med, 17, 133–137.

Atkins, A., Wanick, V., & Wills, G. (2017). Metrics Feedback Cycle: measuring and improving user engagement in gamified eLearning systems. International Journal of Serious Games, 4(4), 3–19.

Berry, L. M., & Houston, J. P. (1993). Psychology at work: An introduction to industrial and organizational psychology. Brown & Benchmark/Wm. C. Brown Publ.

Blanchard, B. S., & Fabrycky, W. J. (1990). Systems engineering and analysis (4th ed.). Prentice Hall Englewood Cliffs, NJ.

Borsboom, D., Mellenbergh, G. J., & van Heerden, J. (2004). The concept of validity. Psychological review, 111(4), 1061.

Bradbury, A. (2014, sep). ‘Slimmed down’ assessment or increased accountability? Teachers, elections and UK government assessment policy. Oxford Review of Education, 40(5), 610–627. Retrieved from https://doi.org/10.1080/03054985.2014.963038 doi: 10.1080/03054985.2014.963038

Cames, M., Harthan, R. O., Fu¨ssler, J., Lazarus, M., Lee, C., Erickson, P., & SpaldingFecher, R. (2016). How additional is the clean development mechanism. Analysis of application of current tools and proposed alternatives. Oeko-Institut EV CLlMA. B, 3.

Campbell, D. T. (1979). Assessing the impact of planned social change. Evaluation and program planning, 2(1), 67–90.

Caplan, B. (2018). The case against education. Why the education system is a waste of time and money. Princeton University Press.

Caudill, H. L., & Porter, C. D. (2014, dec). An Historical Perspective of Reward Systems: Lessons Learned from the Scientific Management Era. International Journal of Human Resource Studies; Vol 4, No 4 (2014)DO - 10.5296/ijhrs.v4i4.6605. Retrieved from http://www.macrothink.org/journal/index.php/ijhrs/article/view/6605

Choi, J., Hecht, G. W., & Tayler, W. B. (2012). Lost in translation: The effects of incentive compensation on strategy surrogation. The Accounting Review, 87(4), 1135–1163.

Clifton, P. M., & Keogh, J. B. (2017). A systematic review of the effect of dietary saturated and polyunsaturated fat on heart disease. Nutrition, Metabolism and Cardiovascular Diseases, 27(12), 1060–1080.

Dai, H., Dietvorst, B. J., Tuckfield, B., Milkman, K. L., & Schweitzer, M. E. (2017, aug). Quitting When the Going Gets Tough: A Downside of High Performance Expectations. Academy of Management Journal, 61(5), 1667–1691. Retrieved from https://doi.org/10.5465/amj.2014.1045 doi: 10.5465/amj.2014.1045

Deresiewicz, W. (2015). Excellent sheep: The miseducation of the American elite and the way to a meaningful life. Free Press.

Duff, F. J., Mengoni, S. E., Bailey, A. M., & Snowling, M. J. (2015). Validity and sensitivity of the phonics screening check: implications for practice. Journal of Research in Reading, 38(2), 109–123.

Faeh, D., Paccaud, F., Cornuz, J., & Chiolero, A. (2008, apr). Consequences of smoking for body weight, body fat distribution, and insulin resistance. The American Journal of Clinical Nutrition, 87(4), 801–809. Retrieved from https://dx.doi .org/10.1093/ajcn/87.4.801 doi: 10.1093/ajcn/87.4.801

Flacker, J. M., & Kiely, D. K. (2003). Mortality-related factors and 1-year survival in nursing home residents. Journal of the American Geriatrics Society, 51(2), 213–221.

Fraade-Blanar, L., Blumenthal, M. S., Anderson, J. M., & Kalra, N. (2018). Measuring Automated Vehicle Safety.

Frances, A. (2017). Trump isn’t crazy. Psychology Today. Retrieved from https://www. psychologytoday.com/blog/saving-normal/201701/trump-isnt-crazy.

Gelman, A. (2010). Causality and Statistical Learning. American Journal of Sociology, 117(3), 955–966. Retrieved from http://arxiv.org/abs/1003.2619 doi: 10 .1086/662659

Goodhart, C. A. E. (1975). Problems of monetary management: the UK experience. In Papers in monetary economics. Reserve Bank of Australia.

Herzberg, F. (1968). One more time: How do you motivate employees. Harvard Business Review Boston, MA.

Hess, F. (2018, sep). Straight Up Conversation: Scholar Jay Greene on the Importance of field Trips. Education Week.

Holmstrom, B., & Milgrom, P. (1991). Multitask principal-agent analyses: Incentive contracts, asset ownership, and job design. JL Econ. & Org., 7, 24.

Hubbard, D. W. (2007). How to Measure Anything: finding the Value of Intangibles in Business (Second ed.). doi: 10.1002/9781118983836

Kalra, N., Hallegatte, S., Lempert, R., Brown, C., Fozzard, A., Gill, S., & Shah, A. (2014). Agreeing on Robust Decisions New Processes for Decision Making Under Deep Uncertainty. World Bank Policy Research Working Paper, No. 6906(June). doi: doi:10.1596/1813-9450-6906

Kenny, G. (2014). five questions to identify key stakeholders. HBR Harvard Business Review.

Klein, G. (2007). Performing a project premortem. Harvard Business Review, 85(9), 18–19.

Klein, G., Sonkin, P. D., & Johnson, P. (2019). Rendering a Powerful Tool Flaccid: The Misuse of Premortems on Wall Street.

Lempert, R. J., Groves, D. G., Popper, S. W., & Bankes, S. C. (2006). A General, Analytic Method for Generating Robust Strategies and Narrative Scenarios. Management Science, 52(4), 514–528. Retrieved from http://pubsonline.informs.org/ doi/abs/10.1287/mnsc.1050.0472 doi: 10.1287/mnsc.1050.0472

Liebowitz, S., & Kelly, M. L. (2018, nov). Everything You Know About State Education Rankings Is Wrong: Minds and dollars are a terrible thing to waste. Reason. Retrieved from https://reason.com/archives/2018/10/07/everything-you-know-about-stat

Liska, D. J., Cook, C. M., Wang, D. D., Gaine, P. C., & Baer, D. J. (2016). Trans fatty acids and cholesterol levels: An evidence map of the available science. Food and Chemical Toxicology, 98, 269–281.

Manheim, D. (2016). Overpowered Metrics Eat Underspecified Goals (Vol. 2016). Retrieved from https://www.ribbonfarm.com/2016/09/29/soft-bias -of-underspecified-goals/

Manheim, D. (2018). Value of Information for Policy Analysis (Doctoral dissertation, Pardee RAND).

Manheim, D., & Garrabrant, S. (2018). Categorizing Variants of Goodhart’s Law. , 1–10.

Mika, E., & Lee, B. (2017). Who Goes Trump? Tyranny as a Triumph of Narcissism. St. Martin’s Press.

Mitchell, D. J., Edward Russo, J., & Pennington, N. (1989). Back to the future: Temporal perspective in the explanation of events. Journal of Behavioral Decision Making, 2(1), 25–38. Retrieved from https://doi.org/10.1002/bdm.3960020103 doi: 10.1002/bdm.3960020103

Muller, J. Z. (2018). The tyranny of metrics. Princeton University Press.

O’Keefe, C., Cihon, P., Garfinkel, B., Flynn, C., Leung, J., & Dafoe, A. (2019). The Windfall Clause: Distributing the Benefits of AI for the Common Good. arXiv preprint arXiv:1912.11595.

Poulis, K., & Poulis, E. (2016). Problematizing fit and survival: transforming the law of requisite variety through complexity misalignment. Academy of Management Review, 41(3), 503–527.

Rasul, I., & Rogger, D. (2017). Management of bureaucrats and public service delivery: Evidence from the nigerian civil service. The Economic Journal, 128(608), 413– 446.

Rasul, I., Rogger, D., & Williams, M. (2017). Management and bureaucratic effectiveness: A scientific replication.

Rasul, I., Rogger, D., & Williams, M. J. (2018). Autonomy, incentives, and the effectiveness of bureaucrats. VoxDev. Retrieved from https://voxdev.org/topic/public-economics/autonomy-incentives-and-effectiveness-bureaucrats

Rodamar, J. (2017). There ought to be a law! Campbell v. Goodhart. Rogers, P. J., Petrosino, A., Huebner, T. A., & Hacsi, T. A. (2000). Program theory evaluation: Practice, promise, and problems. New directions for evaluation, 2000(87), 5–13. 23 Rosenhead, J., & Mingers, J. (2001). Rational analysis for a problematic world revisited (No. 2nd). John Wiley and Sons. Ruch, W. A. (1994). Measuring and managing individual productivity. Organizational linkages: Understanding the productivity paradox, 105–130.

Saltelli, A. (2020). Ethics of quantification or quantification of ethics? Futures, 116, 102509. Retrieved from http://www.sciencedirect.com/science/article/ pii/S0016328719303714 doi: https://doi.org/10.1016/j.futures.2019.102509

Schoeller, D. A. (1990). How accurate is self-reported dietary energy intake? Nutrition reviews, 48(10), 373–379.

Shorrock, S. (2019, may). Shorrock’s Law of Limits. Blog Post. Retrieved from https:// humanisticsystems.com/2019/10/24/shorrocks-law-of-limits/

Simon, H. A. (1947). Administrative behavior; a study of decision-making processes in administrative organization. Macmillan.

Simon, H. A. (1956). Rational choice and the structure of the environment. Psychological review, 63(2), 129.

Soares, N. (2015). Half-assing it with everything you’ve got. Retrieved 2019-07-22, from http://mindingourway.com/half-assing-it-with-everything-youve-got/

Strathern, M. (1997). ’Improving ratings’: audit in the British University system. European Review. doi: 10.1002/(SICI)1234-981X(199707)5:33.0.CO;2-4

Sturla, K., Shah, B., & McManus, J. (2018). The Great DIB-ate: Measurement for Development Impact Bonds. Stanford Social Innovation Review.

Szajewska, H., & Szajewski, T. (2016). Saturated fat controversy: importance of systematic reviews and meta-analyses. Critical reviews in food science and nutrition, 56(12), 1947–1951.

Taplin, D. H., & Clark, H. (2012). Theory of change basics: A primer on theory of change.

van Gelder, T., Vodicka, R., & Armstrong, N. (2016, sep). Augmenting Expert Elicitation with Structured Visual Deliberation. Asia & the Pacific Policy Studies, 3(3), 378–388. Retrieved from https://doi.org/10.1002/app5.145 doi: 10.1002/app5.145

Wigert, B., & Harter, J. (2017). Re-engineering performance management. Gallup. com. Viewed: March, 6, 2019.