Improvement and evaluation

Robert L Wears

doi:10.1136/bmjqs-2014-003889

Article Text

PDF

Editorial

Improvement and evaluation

Free

Robert L Wears

Correspondence to Dr Robert L Wears, Department of Emergency Medicine/CSRU, Univ of Florida/Imperial College London, Jacksonville, FL 32209, USA; wears{at}ufl.edu

https://doi.org/10.1136/bmjqs-2014-003889

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Two related papers1 ,2 in this issue of BMJ Quality & Safety provide interesting insights into the difficulties of evaluating improvement activities, and also illustrate why improvement is so hard. In a carefully crafted set of controlled, interrupted time series experiments, the authors examined the effectiveness in the operating theatre of two popular improvement interventions: standardised procedures and teamwork training. The primary outcomes in both were process measures: the theatre teams’ non-technical skills performance, and the count of ‘glitches’—omissions, interruptions or other untoward events that disrupted flow and had potential to affect safety or quality. In both experiments, the investigators took care to ensure the interventions were ‘owned’ by the frontline workers, and not imposed from without by managers disconnected with the realities of the workplace (although this also means that higher level support important for sustainability may have been lacking).

The papers report insufficient evidence to support improved performance from introducing standard operating procedures, even when those procedures were developed and implemented by the frontline staff themselves.1 However, they also report a partial success, in that, when accompanied by teamwork training, the combination of standard operating procedures and teamwork significantly improved non-technical skills performance.2 Curiously, in the combined experiment, technical performance as measured by ‘glitches’ per hour improved in experimental and control groups. Taken as a whole, the two papers suggest an interaction, or synergism, between the two interventions. Standardisation alone was not effective, but standardisation in conjunction with teamwork training, was (although we cannot be certain whether teamwork alone might have been similarly effective).

These two papers make a valuable contribution to the safety and quality literature by showing that the same intervention (standardisation) can be ineffective in one context (without teamwork training) but effective in another (with teamwork). One wonders how many negative reports of quality interventions were negative only because an important effect modifier was missing from the analysis; or conversely, how many positive reports attributed success to the planned intervention, when it was actually facilitated by an unmeasured interaction variable. There is a significant risk here of drawing the wrong lessons from previous work. This is a possible explanation for the heterogeneity that bedevils the safety and quality literature—a confusing patchwork of claims and counterclaims, reports of interventions that worked or failed, or worked here but not there (sometimes even within the same organisation).3 Systematic reviews of these reports have not helped much; by dealing with context as a nuisance variable and averaging it out, they tend to cast everything in a dim grey light—across the board, most interventions are neutral or dull average at best, further investigation is required.

These papers fall into a well-established evaluation framework that has become an orthodoxy in healthcare: the technical, rational, deterministic and reductionist approach of positivist ‘normal science’. The success of this approach in much of science, and the parallel success in industry of its philosophical cousin, statistical process control, has led healthcare into mistaking the map for the territory. Since positivist science has been such a successful lens through which to view aspects of the world, these aspects have been mistaken for the world and anything that does not fit or cannot be accommodated in a positivist paradigm is tacitly presumed to be unimportant or non-existent.

These methods were largely developed for static, engineered, inanimate systems; the paradigmatic model for statistical process control is the assembly line. They are approaches suitable to machines—there are seldom interactions among components, it is possible to change only one thing at a time, as a change in one part does not produce a consequent change in another.

However, healthcare systems are not assembly lines. They are complex, intractable, sociotechnical systems and4–6 organic rather than engineered. Their basic ‘physics’ is poorly understood at best. They do not simply accept change (eg, interventions), but adapt and reconfigure themselves in response to it; those adaptations reverberate and ramify throughout the system via positive and negative feedback loops with varying delays. These interactions among components are more important than the components themselves; the behaviour of one component depends in part on the behaviour of others, and the evolving cycles of reciprocal action and reaction reshape the universe of possibilities.7 ,8 This makes systems path dependent; the past trajectory of changes, reactions, and interactions influences future paths, opening some while closing others.9 Furthermore, sociotechnical systems are composed at least in part of sentient beings, so how those actors in the system understand and interpret interventions in context, and develop strategies to manage or integrate them within existing workflow, have strong influences.

These properties make it impossible to change only one thing,10 ,11 and difficult to predict the overall effect of changes by ‘summing’ across the individual effects.7 Thus, interventions in a complex sociotechnical system produce a chain of consequences that extend over time and cannot be fully anticipated. Such systems cannot be directly controlled in the Taylorist, rationalist way that managers or regulators would like; and evaluations of interventions in such systems can never be ‘one and done’, but must always be formative rather than summative.

The problem is exacerbated when the intervention itself is a complex social one.12 In the two papers discussed here, teamwork training is clearly a complex social intervention, but what about standard operating procedures? Standardisation is often viewed as a purely objective, technical exercise, but this is a misconception.13 However, objective, rationalised, complete and internally consistent a set of standardised procedures might be, their development, interpretation and application are social processes, subject to the context, history, politics and goals of actors in the system.14 In addition, there are inevitably gaps between the imagined world of the procedures and the real world of work,15 and conflicts among competing goals; both must be recognised, negotiated and resolved in action by workers in a community of practice. Finally, the cycle of adaptations set in motion by the intervention can feed back onto the original intervention itself, so that it also changes with time, triggering yet another cycle of adaptations.

Although complex sociotechnical systems cannot be directly controlled, all is not lost, because they can be influenced.8 Interventions may not lead directly to the desired behaviours, but they can ‘set the stage’ to enhance and sustain the emergence of those behaviours.16 This realisation will require us to modify our approach to both improvement and its evaluation. It will require accepting a broader range of sciences and methodologies as admissible; abandoning many of the Taylorist principles that have informed improvement efforts;17 and fundamentally re-examining the Newtonian-Cartesian assumptions that underlie them.18

Similarly, we will have to expand our evaluation methods to move beyond a certain methodological fetishism19 aimed at answering the ‘horse race’ question “Does A work better than B?” and adopt more nuanced methods20–22 aimed at a more complex set of questions: “Which works, how, why, for whom, to what extent and in what context?” These questions are often best addressed by qualitative, ethnographic methods aimed at providing a ‘thick description’ in a case study of an improvement effort.23–25 The value of this type of approach has been shown by careful, theory-driven studies of how and why initiatives are successful26: for example, discovering that the theory of improvement motivating a project at its beginning was not the way in which improvement actually, eventually occurred; or illuminating tensions and paradoxes in contrasting understandings of interventions.27

However, progress in this area is haunted by a difficult question: why is it that safety and quality in healthcare has been so strongly wedded to rationalist, Taylorist, Cartesian-Newtonian thinking about the nature of clinical practice, and how to improve it? Three factors supporting this marriage may be difficult to overcome. First, it offers the comforting modernist illusion that the muscular application of science can at last tame risk, uncertainty, and disorder, leading to a better, safer, more controllable world.28 Second, it offers a satisfying explanation for drawing meaning out of the inevitable failures that still must occur,29 while simultaneously not threatening those in power.30 And finally, it supports a long-standing secular trend increasing the power and influence of a technocratic elite18 of scientific-bureaucratic managers31 that accompanies the progressive industrialisation of healthcare.32 ,33 Ironically, the external pressures on healthcare to achieve the precision, safety and efficiencies of linear production systems is driving some very counter-productive behaviours and undermining our desired goals.

References

↵
1. Morgan L,
2. New S,
3. Robertson E
, et al. Effectiveness of facilitated introduction of a standard operating procedure into routine processes in the operating theatre: a controlled interrupted time series. BMJ Qual Saf 2015;24:120–27.
↵
1. Morgan L,
2. Pickering SP,
3. Hadi M
, et al. A combined teamwork training and work standardisation intervention in operating theatres: controlled interrupted time series study. BMJ Qual Saf 2015;24:111–19.
↵
1. Davidoff F
. Heterogeneity is not always noise. JAMA 2009;302:2580–6. doi:10.1001/jama.2009.1845
OpenUrl CrossRef PubMed Web of Science
↵
1. Waterson P
. Sociotechnical design of work systems. In: Wilson JR, Corlett N, eds. Evaluation of human work. 3rd edn. London, UK: Taylor & Francis, 2005:769–92.
↵
1. Eason K
. Afterword: the past, present and future of sociotechnical systems theory. Appl Ergon 2014;45:213–20. doi:10.1016/j.apergo.2013.09.017
OpenUrl CrossRef PubMed
↵
1. Kleiner BM
. Macroegonomics: work system analysis and design. Hum Factors 2008;50:461–7. doi:10.1518/001872008X288501
OpenUrl Abstract/FREE Full Text
↵
1. Jervis R
. System effects: complexity in political and social life. Princeton, NJ: Princeton University Press, 1998:328.
↵
1. Axelrod R,
2. Cohen MD
. Harnessing complexity: organizational implications of a scientific frontier. New York, NY: Basic Books, 2000:184.
↵
1. Vaughan D
. System effects: on slippery slopes, repeating negative patterns, and learning from mistake? In: Starbuck HW, Farjoun M, eds. Organization at the limits: NASA and the Columbia Accident. London, UK: Blackwell, 2005:41–59.
↵
1. Thomas L
. On meddling. N Engl J Med 1976;294:599–600. doi:10.1056/NEJM197603112941108
OpenUrl CrossRef PubMed Web of Science
↵
1. Sterman JD
. System dynamics modeling: tools for learning in a complex world. California Manag Rev 2001;43:8–25. doi:10.2307/41166098
OpenUrl
↵
1. Davidoff F
. Improvement interventions are social treatments, not pills. Ann Intern Med 2014;161:526–7. doi:10.7326/M14-1789
OpenUrl CrossRef PubMed Web of Science
↵
1. Wears RL
. Standardisation and its discontents. Cogn Technol Work 2014. doi:10.1007/s10111-014-0299-6 [epub ahead of print 26 Sep 2014].
↵
1. Høyland S,
2. Aase K,
3. Hollund JG, et al
. What is it about checklists? Exploring safe work practices in surgical teams. In: Bieder C, Bourier M, eds. Trapping safety into rules: how desireable or avoidable is proceduralization. Farnham UK: Ashgate, 2013:121–38.
↵
1. Hollnagel E
. Why is work-as-imagined different from work-as-done? In: Wears RL, Hollnagel E, Braithwaite J, eds. Resilience in everyday clinical work. Farnham, UK: Ashgate, 2015 (in press):249–64.
↵
1. Hilligoss B
. Selling patients and other metaphors: A discourse analysis of the interpretive frames that shape emergency department admission handoffs. Soc Sci Med 2014;102: 119–28. doi:10.1016/j.socscimed.2013.11.034
OpenUrl CrossRef PubMed
↵
1. Berwick DM
. Improvement, trust, and the healthcare workforce. Qual Saf Health Care 2003;12:448–52. doi:10.1136/qhc.12.6.448
OpenUrl Abstract/FREE Full Text
↵
1. Wears RL,
2. Hunte GS
. Seeing patient safety ‘Like a State’. Saf Sci 2014;67:50–7. doi:10.1016/j.ssci.2014.02.007
OpenUrl CrossRef
↵
1. Greenhalgh T,
2. Howick J,
3. Maskrey N
. Evidence based medicine: a movement in crisis? BMJ 2014;348:g3725. doi:10.1136/bmj.g3725
OpenUrl FREE Full Text
↵
1. Berwick DM
. The science of improvement. JAMA 2008;299:1182–4. doi:10.1001/jama.299.10.1182
OpenUrl CrossRef PubMed Web of Science
↵
1. Pawson R,
2. Tilley N
. Realistic evaluation. London, UK: Sage Publications, Ltd, 1997:235.
↵
1. Greenhalgh T,
2. Russell J
. Why do evaluations of eHealth Programs fail? An alternative set of guiding principles. PLoS Med 2010;7:e1000360. doi:10.1371/journal.pmed.1000360
OpenUrl CrossRef PubMed
↵
1. Flyvbjerg B
. Case study. In: Denzin NK, Lincoln YS, eds. Sage handbook of qualitative research. 4th edn. Thousand Oaks, CA: Sage, 2011:301–16.
↵
1. Geertz C
. Thick description: toward an interpretive theory of culture. In: The interpretation of cultures: selected essays. New York, NY: Basic Books, 1973:3–30.
↵
1. Leslie M,
2. Paradis E,
3. Gropper MA
, et al. Applying ethnography to the study of context in healthcare quality and safety. BMJ Qual Saf 2014;23:99–105. doi:10.1136/bmjqs-2013-002335
OpenUrl Abstract/FREE Full Text
↵
1. Dixon-Woods M,
2. Bosk CL,
3. Aveling EL
, et al. Explaining Michigan: developing an ex post theory of a quality improvement program. Milbank Q 2011;89:167–205. doi:10.1111/j.1468-0009.2011.00625.x
OpenUrl CrossRef PubMed Web of Science
↵
1. Greenhalgh T,
2. Potts HW,
3. Wong G
, et al. Tensions and paradoxes in electronic patient record research: a systematic literature review using the meta-narrative method. Milbank Q 2009;87:729–88. doi:10.1111/j.1468-0009.2009.00578.x
OpenUrl CrossRef PubMed Web of Science
↵
1. Dekker SWA,
2. Nyce J,
3. Myers D
. The little engine who could not: “rehabilitating” the individual in safety research. Cogn Technol Work 2013;15:277–82. doi:10.1007/s10111-012-0228-5
OpenUrl CrossRef Web of Science
↵
1. Dekker SWA
. The psychology of accident investigation: epistemological, preventive, moral and existential meaning-making. Theor Issues Ergon Sci 2014. doi:10.1080/1463922X.2014.955554 [epub ahead of print 14 Oct 2014].
↵
1. Dekker SWA,
2. Nyce JM
. There is safety in power, or power in safety. Saf Sci 2014;67:44–9. doi:10.1016/j.ssci.2013.10.013
OpenUrl CrossRef
↵
1. Harrison S,
2. Moran M,
3. Wood B
. Policy emergence and policy convergence: the case of ‘scientific-bureaucratic medicine’ in the United States and United Kingdom. Br J Politics Int Relations 2002;4:1–24. doi:10.1111/1467-856X.41068
OpenUrl CrossRef
↵
1. Starr P
. The social transformation of American Medicine: the rise of a sovereign profession and the making of a vast industry. New York, NY: Basic Books, 1983:528.
↵
1. Kleinke JD
. The industrialization of health care. JAMA 1997;278:1456–7. doi:10.1001/jama.278.17.1456
OpenUrl CrossRef PubMed

Footnotes

Competing interests None.
Provenance and peer review Not commissioned; internally peer reviewed.

Linked Articles

Original research
A combined teamwork training and work standardisation intervention in operating theatres: controlled interrupted time series study

Lauren Morgan Sharon P Pickering Mohammed Hadi Eleanor Robertson Steve New Damian Griffin Gary Collins Oliver Rivero-Arias Ken Catchpole Peter McCulloch
BMJ Quality & Safety 2014; 24 111-119 Published Online First: 22 Jul 2014. doi: 10.1136/bmjqs-2014-003204
Original research
Effectiveness of facilitated introduction of a standard operating procedure into routine processes in the operating theatre: a controlled interrupted time series

Lauren Morgan Steve New Eleanor Robertson Gary Collins Oliver Rivero-Arias Ken Catchpole Sharon P Pickering Mohammed Hadi Damian Griffin Peter McCulloch
BMJ Quality & Safety 2014; 24 120-127 Published Online First: 03 Nov 2014. doi: 10.1136/bmjqs-2014-003158

[1] ↵
Morgan L,
New S,
Robertson E
, et al. Effectiveness of facilitated introduction of a standard operating procedure into routine processes in the operating theatre: a controlled interrupted time series. BMJ Qual Saf 2015;24:120–27.

[2] Morgan L,

[3] New S,

[4] Robertson E

[5] ↵
Morgan L,
Pickering SP,
Hadi M
, et al. A combined teamwork training and work standardisation intervention in operating theatres: controlled interrupted time series study. BMJ Qual Saf 2015;24:111–19.

[6] Morgan L,

[7] Pickering SP,

[8] Hadi M

[9] ↵
Davidoff F
. Heterogeneity is not always noise. JAMA 2009;302:2580–6. doi:10.1001/jama.2009.1845
OpenUrl CrossRef PubMed Web of Science

[10] Davidoff F

[11] ↵
Waterson P
. Sociotechnical design of work systems. In: Wilson JR, Corlett N, eds. Evaluation of human work. 3rd edn. London, UK: Taylor & Francis, 2005:769–92.

[12] Waterson P

[13] ↵
Eason K
. Afterword: the past, present and future of sociotechnical systems theory. Appl Ergon 2014;45:213–20. doi:10.1016/j.apergo.2013.09.017
OpenUrl CrossRef PubMed

[14] Eason K

[15] ↵
Kleiner BM
. Macroegonomics: work system analysis and design. Hum Factors 2008;50:461–7. doi:10.1518/001872008X288501
OpenUrl Abstract/FREE Full Text

[16] Kleiner BM

[17] ↵
Jervis R
. System effects: complexity in political and social life. Princeton, NJ: Princeton University Press, 1998:328.

[18] Jervis R

[19] ↵
Axelrod R,
Cohen MD
. Harnessing complexity: organizational implications of a scientific frontier. New York, NY: Basic Books, 2000:184.

[20] Axelrod R,

[21] Cohen MD

[22] ↵
Vaughan D
. System effects: on slippery slopes, repeating negative patterns, and learning from mistake? In: Starbuck HW, Farjoun M, eds. Organization at the limits: NASA and the Columbia Accident. London, UK: Blackwell, 2005:41–59.

[23] Vaughan D

[24] ↵
Thomas L
. On meddling. N Engl J Med 1976;294:599–600. doi:10.1056/NEJM197603112941108
OpenUrl CrossRef PubMed Web of Science

[25] Thomas L

[26] ↵
Sterman JD
. System dynamics modeling: tools for learning in a complex world. California Manag Rev 2001;43:8–25. doi:10.2307/41166098
OpenUrl

[27] Sterman JD

[28] ↵
Davidoff F
. Improvement interventions are social treatments, not pills. Ann Intern Med 2014;161:526–7. doi:10.7326/M14-1789
OpenUrl CrossRef PubMed Web of Science

[29] Davidoff F

[30] ↵
Wears RL
. Standardisation and its discontents. Cogn Technol Work 2014. doi:10.1007/s10111-014-0299-6 [epub ahead of print 26 Sep 2014].

[31] Wears RL

[32] ↵
Høyland S,
Aase K,
Hollund JG, et al
. What is it about checklists? Exploring safe work practices in surgical teams. In: Bieder C, Bourier M, eds. Trapping safety into rules: how desireable or avoidable is proceduralization. Farnham UK: Ashgate, 2013:121–38.

[33] Høyland S,

[34] Aase K,

[35] Hollund JG, et al

[36] ↵
Hollnagel E
. Why is work-as-imagined different from work-as-done? In: Wears RL, Hollnagel E, Braithwaite J, eds. Resilience in everyday clinical work. Farnham, UK: Ashgate, 2015 (in press):249–64.

[37] Hollnagel E

[38] ↵
Hilligoss B
. Selling patients and other metaphors: A discourse analysis of the interpretive frames that shape emergency department admission handoffs. Soc Sci Med 2014;102: 119–28. doi:10.1016/j.socscimed.2013.11.034
OpenUrl CrossRef PubMed

[39] Hilligoss B

[40] ↵
Berwick DM
. Improvement, trust, and the healthcare workforce. Qual Saf Health Care 2003;12:448–52. doi:10.1136/qhc.12.6.448
OpenUrl Abstract/FREE Full Text

[41] Berwick DM

[42] ↵
Wears RL,
Hunte GS
. Seeing patient safety ‘Like a State’. Saf Sci 2014;67:50–7. doi:10.1016/j.ssci.2014.02.007
OpenUrl CrossRef

[43] Wears RL,

[44] Hunte GS

[45] ↵
Greenhalgh T,
Howick J,
Maskrey N
. Evidence based medicine: a movement in crisis? BMJ 2014;348:g3725. doi:10.1136/bmj.g3725
OpenUrl FREE Full Text

[46] Greenhalgh T,

[47] Howick J,

[48] Maskrey N

[49] ↵
Berwick DM
. The science of improvement. JAMA 2008;299:1182–4. doi:10.1001/jama.299.10.1182
OpenUrl CrossRef PubMed Web of Science

[50] Berwick DM

[51] ↵
Pawson R,
Tilley N
. Realistic evaluation. London, UK: Sage Publications, Ltd, 1997:235.

[52] Pawson R,

[53] Tilley N

[54] ↵
Greenhalgh T,
Russell J
. Why do evaluations of eHealth Programs fail? An alternative set of guiding principles. PLoS Med 2010;7:e1000360. doi:10.1371/journal.pmed.1000360
OpenUrl CrossRef PubMed

[55] Greenhalgh T,

[56] Russell J

[57] ↵
Flyvbjerg B
. Case study. In: Denzin NK, Lincoln YS, eds. Sage handbook of qualitative research. 4th edn. Thousand Oaks, CA: Sage, 2011:301–16.

[58] Flyvbjerg B

[59] ↵
Geertz C
. Thick description: toward an interpretive theory of culture. In: The interpretation of cultures: selected essays. New York, NY: Basic Books, 1973:3–30.

[60] Geertz C

[61] ↵
Leslie M,
Paradis E,
Gropper MA
, et al. Applying ethnography to the study of context in healthcare quality and safety. BMJ Qual Saf 2014;23:99–105. doi:10.1136/bmjqs-2013-002335
OpenUrl Abstract/FREE Full Text

[62] Leslie M,

[63] Paradis E,

[64] Gropper MA

[65] ↵
Dixon-Woods M,
Bosk CL,
Aveling EL
, et al. Explaining Michigan: developing an ex post theory of a quality improvement program. Milbank Q 2011;89:167–205. doi:10.1111/j.1468-0009.2011.00625.x
OpenUrl CrossRef PubMed Web of Science

[66] Dixon-Woods M,

[67] Bosk CL,

[68] Aveling EL

[69] ↵
Greenhalgh T,
Potts HW,
Wong G
, et al. Tensions and paradoxes in electronic patient record research: a systematic literature review using the meta-narrative method. Milbank Q 2009;87:729–88. doi:10.1111/j.1468-0009.2009.00578.x
OpenUrl CrossRef PubMed Web of Science

[70] Greenhalgh T,

[71] Potts HW,

[72] Wong G

[73] ↵
Dekker SWA,
Nyce J,
Myers D
. The little engine who could not: “rehabilitating” the individual in safety research. Cogn Technol Work 2013;15:277–82. doi:10.1007/s10111-012-0228-5
OpenUrl CrossRef Web of Science

[74] Dekker SWA,

[75] Nyce J,

[76] Myers D

[77] ↵
Dekker SWA
. The psychology of accident investigation: epistemological, preventive, moral and existential meaning-making. Theor Issues Ergon Sci 2014. doi:10.1080/1463922X.2014.955554 [epub ahead of print 14 Oct 2014].

[78] Dekker SWA

[79] ↵
Dekker SWA,
Nyce JM
. There is safety in power, or power in safety. Saf Sci 2014;67:44–9. doi:10.1016/j.ssci.2013.10.013
OpenUrl CrossRef

[80] Dekker SWA,

[81] Nyce JM

[82] ↵
Harrison S,
Moran M,
Wood B
. Policy emergence and policy convergence: the case of ‘scientific-bureaucratic medicine’ in the United States and United Kingdom. Br J Politics Int Relations 2002;4:1–24. doi:10.1111/1467-856X.41068
OpenUrl CrossRef

[83] Harrison S,

[84] Moran M,

[85] Wood B

[86] ↵
Starr P
. The social transformation of American Medicine: the rise of a sovereign profession and the making of a vast industry. New York, NY: Basic Books, 1983:528.

[87] Starr P

[88] ↵
Kleinke JD
. The industrialization of health care. JAMA 1997;278:1456–7. doi:10.1001/jama.278.17.1456
OpenUrl CrossRef PubMed

[89] Kleinke JD

Log in using your username and password

Main menu

Log in using your username and password

You are here

Statistics from Altmetric.com

Request Permissions

References

Footnotes

Linked Articles

Read the full text or download the PDF:

Log in using your username and password