Article Text

Identifying early warning signs for diagnostic errors in primary care: a qualitative study
  1. John Balla1,
  2. Carl Heneghan1,
  3. Clare Goyder1,
  4. Matthew Thompson2
  1. 1Centre for Evidence-based Medicine, University of Oxford, Oxford, UK
  2. 2Department of Primary Health Care, University of Oxford, Oxford, UK
  1. Correspondence to Dr John Balla; john.balla{at}phc.ox.ac.uk

Abstract

Objective We investigate the mechanisms of diagnostic error in primary care consultations to detect warning signs for possible error. We aim to identify places in the diagnostic reasoning process associated with major risk indicators.

Design A qualitative study using semistructured interviews with open-ended questions.

Setting A 2-month study in primary care conducted in Oxfordshire, UK.

Participants We approached about 25 experienced general practitioners by email or word of mouth, 15 volunteered for the interviews and were available at a convenient time.

Intervention Interview transcripts provided 45 cases of error. Three researchers searched these independently for underlying themes in relation to our conceptual framework.

Outcome measures Locating steps in the diagnostic reasoning process associated with major risk of error and detecting warning signs that can alert clinicians to increased risk of error.

Results Initiation and closure of the cognitive process are most exposed to risk of error. Cognitive biases developed early in the process lead to errors at the end. These warning signs can be used to alert clinicians to the increased risk of diagnostic error. Ignoring red flags or critical cues was related to processes being biased through the initial frame, but equally well, it could be explained by knowledge gaps.

Conclusions Cognitive biases developed at the initial framing of the problem relate to errors at the end of the process. We refer to these biases as warning signs that can alert clinicians to the increased risk of diagnostic error. We conclude that lack of knowledge is likely to be an important factor in diagnostic error. Reducing diagnostic errors in primary care should focus on early and systematic recognition of errors including near misses, and a continuing professional development environment that promotes reflection in action to highlight possible causes of process bias and of knowledge gaps.

  • Education & Training (see Medical Education & Training)
  • Primary Care
  • Qualitative Research
  • Clinical reasoning
  • Clinical decision making

This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/2.0/ and http://creativecommons.org/licenses/by-nc/2.0/legalcode.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Article summary

Article focus

  • Investigation of the relation of clinical reasoning to diagnostic error.

  • Identifying places in the diagnostic reasoning process associated with major risk of error.

  • Detection of warning signs that can alert clinicians to the potential of increased risk of error.

Key messages

  • Initiation and closure of the cognitive process are most exposed to the risk of diagnostic error in primary care.

  • Cognitive biases developed at initiation may be used as warning signs to alert clinicians to the increased risk of diagnostic error.

  • Deliberate practice of looking for warning signs may be a potential method of professional development to reduce error through reflection in action.

Strengths and limitations of this study

  • The design of the study is based on a strong theoretical framework provided by the dual theory of cognition and is based on real-life situations.

  • Hindsight bias may lead to unjustified causal explanations of errors.

  • Limitations concern generalisability, participants consisting of self-selected individuals and lack of replication in other cohorts.

Introduction

Diagnostic error has been defined ‘as a diagnosis that was unintentionally delayed (sufficient information was available earlier), wrong (another diagnosis was made before the correct one), or missed (no diagnosis was ever made)’.1–3

These definitions have also been refined to emphasise the ‘longitudinal aspect of diagnosis’, where a precise diagnosis need not be made, or it can wait, ‘because other decisions may take priority’.4 The relative importance of diagnostic errors to the widespread health problem of medical errors in general is unclear, yet several US sources suggest that they are a major contributor.5 A systematic review of diagnostic error in primary care in the UK quoted diagnostic error accounts for the greatest proportion of medical malpractice claims against general practitioners (GPs) (63%).6 An analysis of 1000 claims of GPs in the UK identified 631 alleged delayed diagnoses.7 A New Zealand review of primary care claims showed that delays in diagnosis were few, but associated with a disproportionate number of serious and sentinel injuries (16%) and deaths (50%).8 Studies of diagnostic error outside of insurance claims generally focus on hospital populations and specialties.9 A recent review quotes a diagnostic error rate of between 5% and 15%, depending on the specialty and methods of data collection.10

Several commentators have noted the importance of attempting to reduce diagnostic error by understanding in more depth the cognitive reasoning processes underlying diagnostic decision-making.11 Winters et al12 noted that any attempts to improve diagnostic safety must ‘intuitively support how our brains work rather than how we would like them to work.’ Norman13 noted the potential importance of reflecting on the clinician's ‘own performances and identify places where their reasoning may have failed’. Indeed, Elstein11 suggested that a major objective for improving cognitive reasoning in future research should include delineating how feedback on clinical reasoning could be used to ‘guide clinicians to identify priority tasks for reducing diagnostic errors’. Ely et al4 propose that one method for doing this could be to identify those red flags ‘that should prompt a time-out’ as a priority for the study of diagnostic error.

We have previously suggested a model of the Dual Theory of Cognition (DTC) as a framework to explore diagnostic reasoning in primary care. This identifies high-risk places in the diagnostic reasoning process where the clinician is most likely to commit an error is consistent with our objective.14 In this study, we investigate the mechanisms of diagnostic error in primary care clinical practice using qualitative methods with a well-defined cohort and a strong theoretical base.

Methods

As indicated in the introduction, the majority of studies of diagnostic error rely on retrospective reviews of data collected for malpractice claims. A lesser number of reports deal with hospital incidents and consist of ill-defined cohorts. In these reports, a mix of poorly defined methodologies is used. What happened during the actual consultation is rarely understood and by the nature of the data unavailable. Instead, chart reviewers tend to make personal judgments. A further confounding problem relates to the unavoidable fact that clinicians may never find out about their errors and if they do, this could be weeks or months after the event. These issues will be discussed in more detail in the section on strengths and limitations of the study reported here. Our choice of method, relying on interviews for data collection was made with the aim of getting a little closer to the truth of what happens when a consultation effectively fails. Knowing the limitations of the method, our expectation was that this may lead us, and hopefully others, to further fruitful research. We leave the discussion on these matters to the section on Implications for further research, at the end of this paper.

Design

Semistructured interviews using open-ended questions were based on a model of clinical reasoning derived from the DCT. This is a generally accepted model of human cognition that encompasses an initial fast response to a problem (System 1) that occurs at a subconscious level and is associated with or followed by a slow analytic, reflective response (System 2).15 The initial response is based on16 a mental construct made up of the individual's interpretation of past experience, understanding of theories of professional knowledge, value judgments and the social context. Initial judgments based on these pre-existing constructs in relation to the context of the new case, lead to the construction of a frame17 that limits and gives direction to the rest of the cognitive process.18

Closure of the process may occur with minimal or no exposure to the analytic System 2. Just how much analysis occurs depends on the relative dominance of System 1. The use of closure rules19 is consistent with System 2. Diagnostic error frequently occurs at initiation, but most often at closure and both Systems 1 and 2 may be involved.4 It is the congruence of the DTC with what we know about clinical reasoning that leads us to use it as the model for this study.

Participants

Primary care clinicians (known as general practitioners or GPs in the UK) were asked to first define diagnostic error and then discuss such cases from their practice. Fifteen GPs (>5 years in practice) working in primary care in Oxfordshire, provided 45 cases. The model highlights key areas of knowledge and critical value judgments that are used in the clinical encounter that provide the questions for our interviews. These are described in appendix 1. We agreed that data saturation had been reached about two-thirds of the way and did not recruit more participants.

Analysis

Three researchers searched the text independently for underlying themes in relation to our conceptual framework. The steps in the dual theory provided the structure for this, so that it started with themes for the initial salient features of the case and ended with closure. Categories were coded according to emerging themes and added to or changed as new concepts emerged.20 The researchers compared their findings in person, on Skype and through emails. This occurred up to once or twice a week over a 2–3 month period. Differences in interpretation were few and were discussed. There were no major differences and consensus was reached on clustering to common themes. Direct quotes from the interviews appear in italics, numerics preceded by G are interview identifiers.

Results

Participants indicated that our questions were easy to follow and most commented that they enjoyed the interview as a good way to reflect on practice.

Definitions of diagnostic error

The definitions of diagnostic error provided by respondents were consistent with previous definitions. Over half of the 45 cases analysed were associated with the clinician focusing on a single diagnosis from presentation to closure of the cognitive process. Respondents divided errors into two different categories (G12): first, the wrong diagnostic label consisting of a diagnosis that's wrong or proven to be wrong by yourself or someone else at a later date (G3) and second, delayed diagnosis described as missed the boat you should have done something but you didn't (G2).

GPs raised two issues that caused difficulties in defining an error, namely variation in how to deal with the severity of the impact of the error: I rarely give someone a firm diagnosis … it would be an error if there was something serious and I had told someone [ it] wasn't (G8), and also what constitutes unacceptable delay given that lots of what we see is at low prevalence and evolving, so at the very front end it's very vague so actually most of that by definition should be delayed (G1).

Process of clinical reasoning in relation to diagnostic error

Results are reported with reference to the 45 cases and not individual GPs. The cohort provided sufficient data for a case-by-case analysis of errors, but not for comparing across the 15 GPs, though there were no indications of differing themes between individuals. The themes identified divided into two main groups: initiation and setting the initial diagnostic frame, followed by stopping the search for further clinical information and achieving diagnostic closure. Additional themes which emerged are also discussed.

Initiation and setting the initial diagnostic frame

Salient features link the individual clinician's personal knowledge of similar cases to the new presentation. A number of themes emerged from our analysis to shape the frame for the new case in its specific context. In most cases, GPs formed instantaneous diagnoses. For example, the patient's appearance—thought actually looked OK for a first child (G19), or, she didn't sound too unwell over the phone (G9)—before other information was available, provided a powerful bias for framing the case. In over 2/3 (31/45) of cases, the focus was on a single diagnosis, in about half of these based on presentation with a pre-existing diagnostic label (16/31). Box 1 reconstructs such a case.

Box 1

Case 27: illustration of initiation of the process and setting the initial diagnostic frame

The patient's history of previous psychosocial problems or abnormal behaviour, were predominant at this stage of the presentation: previous consultation which had set her up as a particular kind of person (4). Other salient features led GPs to instant recognition of a diagnosis or a limited number of differentials. For instance, just focusing on the vomiting (39), made the GP think of a gastric problem and delayed the diagnosis of an obstructed hernia. Wrong localisation framed the cognitive process and biased further information gathering, directly impacting on diagnostic closure thresholds.

Participants made repeated references to needing to focus on the natural history of disease and expected response to treatment. For instance, in referring to a case of missed cancer: people that have haemorrhoids that respond beautifully to treatment and have no other symptoms we don't tend to think, oh have they got a colonic cancer (14). Or, the need for experience: lacked experience at that time [to] potential of this case…. and work on the possible diagnosis (9). Most of these references to experience were suggestive of knowledge gaps rather than cognitive error only. Further examples of biases arising from the initial framing appear in table 1.

Table 1

Biases arising from salient features of presentation which initiate the diagnostic process and frame the direction of subsequent information gathering

Conditions and thresholds for diagnostic closure

Participants did not use numerical criteria to describe the thresholds they used to decide when to stop searching for more clinical information. Nor did they express confidence in their decisions this way. When pressed, some responded in terms such as: the test would exclude X in 70% of cases, or I was more than 80% certain that I excluded Y, but the basis for these numerical values was very unclear. Since we felt that a number of participants found these questions judgmental, we therefore dropped them as the interview schedule progressed.

A number of GPs raised safety netting spontaneously, or in response to our questions, related to diagnostic closure. It soon became apparent that recollections were hazy and they were unsure as to whether they actually used safety netting, or just thought that they should have. Therefore, we do not include safety netting in our analysis, recognising its importance as a potential cause of error.

The decisions made at closure were affected by biases from the setting of the initial frame, effectively impeding the reflective System 2 review expected at this stage. Box 2 is a reconstruction of such a case to provide insights into how the relationship between biases formed at initiation may affect decisions at closure.

Box 2

Case 14: illustration of dominant System 1 impeding System 2 review at closure, leading to error

Other themes related to diagnostic errors after ignoring or misinterpreting the predictive value of critical information coming from the patient, as did ignoring ‘gut feelings’.21 Other respondents noted the need to be circumspect when responding to patient needs, including poor outcome with a patient who did not wish to follow advice: had a cruise booked and he chose to cancel the appointment and go on the cruise (15). Some GPs raised issues about their own behaviours: one's own state of confidence or call it what you want competence confidence arrogance or risk taking or not risk caution all play in the actual what you decide to do (12); I'm right at one end of low referrers (43). Contextual factors were often raised as contributing to faulty decisions: [knowing how busy it is before a week end] do I want to send a frail, elderly lady up to the hospital on a Friday afternoon when it would be mayhem [there] (34); explaining a missed diagnosis: we were really busy and I think they came in as an emergency (39). Table 2 provides examples of biases affecting thresholds for ruling disease in or out.

Table 2

Effect of framing biases on closure thresholds for ruling disease in or out

Discussion

Main findings

The findings of this study identify the initiation and closure of the cognitive process in the clinical consultation as those most exposed to risk of diagnostic error. Initiation is a critical step as it sets the frame for subsequent information search, whereas closure occurs when thresholds for stopping the search have been met. We show that cognitive biases developed at framing appear to relate directly to errors at the end of the process. We refer to these as warning signs (table 2), as we believe they can be used to alert the clinician to the increased risk of diagnostic error. Previous studies4 have also highlighted these two steps as the points where most cognitive errors occur. However, our findings build on these, providing insights into the underlying sources of the biases that made the process go wrong. This is consistent with Wearth and Nemeth's17 observation that ‘[we] do not learn much by asking why the way a practitioner framed the problem turned out to be wrong. We do learn when we discover why that framing seemed so reasonable at the time’.

The initial process of framing a new case is mostly subconscious, and occurs within the domain of System 1 of the DTC (instant, fast response). The salient feature of the case leads to recognition of similar cases, and framing is modulated by constructs from experience, knowledge, values and social context.22 If there is no instant recognition, System 2 (slow, analytic thinking) may be engaged, but this may consist of no more than gut feelings.21 In the cases analysed here, there was a dominant focus on a single diagnosis. It is likely that other options were also entertained but forgotten by interviewees. The most significant biases occurring at this stage related to patients with pre-existing diagnostic labels and those with underlying psychosocial problems (table 1). Ignoring red flags or critical cues may have been related to processes being biased through the frame, but equally well, they could be explained by lack of knowledge of the significance of these clinical features.

We have previously suggested23 that the informal ‘rules’ that clinicians use to cease their search for further clinical features during a consultation (ie, stopping rules for diagnostic closure) involved three criteria: (1) high-risk conditions have been excluded and other options appropriately ruled in; (2) there was a direct response to the patient's needs; and (3) there was a reliable safety net in place. As a result of the biases we describe based on our current findings, it appears that in some cases red flags and critical cues suggesting alternative diagnoses were ignored or misinterpreted. The focus was on ruling in early diagnoses, rather than the usually preferred option of ruling hazardous conditions out first. This is consistent with System 1 being dominant, ignoring the reflective System 2. We found that ignoring or failing to search for important cues may be due to knowledge gaps or biased reasoning processes. Participants’ frequent reference to not being experienced suggests knowledge gaps contributing to a number of errors. For these reasons, and contrary to much of the literature (with the exception of Norman and Eva10), we therefore conclude that lack of knowledge is indeed likely to be an important factor in diagnostic error.

Finally, we found that participants did not use numerical values as thresholds for stopping the collection of more clinical data. We hypothesise that this relates to the complexity of these judgments: needing to satisfy decision rules based largely on subjective inputs. First, the clinician needs to take into account the basis of ruling disease in or out. Most early formulations are at least partly based on System 1 knowledge and have a tendency to bias unless one takes time for reflection through the influence of System 2. Robust data for predictive values may not exist or not be powerful enough to lead to safe closure. Second, assessing the values and individual needs of patients in the context of an illness is subjective. Third, confidence levels in a safety net may not be reliable, yet confidence in safe closure must be closely dependent on confidence in the quality of the net.23

If we add to these issues the impact of the context or environment on the clinician's decision-making24 and willingness or ability to involve System 2, it becomes apparent why Bayesian and similar ‘rational’ approaches are not the norm. It comes as no surprise when one of our GPs refers to closure as it feels like a nebulous thing.23 We believe that clinicians use individualised and often tacit guides for how to deal with the problem and other ‘multifarious factors that come into reckoning when making decisions.22 With this package they then make judgments, described by Polanyi16 writing about personal knowledge: in respect of choices made in the exercise of personal judgment …. there is always a range of discretion open in a choice….In view of the unspecifiability of the particulars on which such a decision will be based, it is heavily affected by the participation of the person pouring himself into these particulars and may in fact represent a major feat of his originality, and concludes that valid choices can be made by submitting to one's own sense of responsibility. This last statement sums up the core of professionalism, where we expect a deep sense of responsibility from the decision maker.

So, what can we offer to reduce error? In an ideal world clinicians would have robust clinical data with high true positive and negative characteristics. This could then be used to develop decision aids, as suggested by Buntinx et al25 However, such information is scarce and there would still be a great deal of subjective judgment to be made. Providing indicators of early warning signs where errors may occur in the diagnostic process may help to prompt a reflective review of the cognitive process. We need to look at ways of presenting this as an integral part of the clinical process. This approach may then become part of the ongoing continual professional development for clinicians. Figure 1, based on our results is a summary of how warning signs may be incorporated in a practice environment to constructively promote reflection in practice.

Figure 1

Questions to relate potential sources of error to cognitive process: anatomy of diagnostic error.

Strengths and limitations of the study

We designed the study to be consistent with desired future directions as outlined in the introduction. It is based on a strong theoretical framework provided by the DTC. The strength of this model is not only that it ‘intuitively supports how our brains work’,12 but particularly that the initiation of the process is consistent with System 1 working at a subconscious level. Our interviews explore what is salient in a presentation26 and show how initial biases influence the diagnostic process. We identify warning signs on the way, alerting clinicians to risk of error.4 These may have the potential to reduce diagnostic error.11

Limitations relate to three main issues. First, sampling: participants were not chosen at random but consisted of self-selected individuals made up of more than half of the cohort we approached. This is an almost universal problem with studies that involve busy and senior clinicians. Our research needs to be replicated in other settings in the UK or in countries with different health systems, or with other specialties. However, given that our findings are consistent with other studies with different methodologies and use a strong theoretical base the results are likely to have a degree of generalisability. Second, hindsight bias: clinicians generally become aware of errors after an event and the information may be fragmentary. They construct a narrative based on hindsight, ending up with ‘illusions that one has understood the past’.15 In the construction of the narrative, hindsight bias converts events ‘into a coherent causal framework’.17 Occurring at a subconscious level, the bias is inevitable and all previous studies of error were tainted by it. We therefore attempted to minimise these limitations by examining the clinicians in real-life situations rather than a laboratory, and avoiding data that were collected for other reasons (eg, litigation),17 interviews tried to reconstruct the context in which error took place. Third, we do not have evidence that alerts such as the warning signs we propose will reduce diagnostic errors in clinical practice. Similarly, deliberate practice27 has not been shown to be successful in non-procedural specialities but in view of its success in procedural clinical settings, would be a strong contender to be explored.4

Implications for practice

There are two prerequisites for change. (1) Early and systematic recognition of errors including near misses: this could be achieved through regular, non-threatening, in-practice audits or significant event analyses. Without this it will not be possible to reduce hindsight bias. (2) Provision of a clinical environment that promotes reflection in action to detect the causes of process bias and knowledge gaps. This may be more feasible when working in a group of trusted peers and using methods such as incident reviews and journal meetings focused on recent errors as a way of reviewing the diagnostic reasoning processes as well as knowledge gaps. Since reflection will need to focus both on the cognitive process and evidence-based diagnosis resources, it would be important to have meetings facilitated by a trained, preferably internal, GP.

Implications for future research

We do not know if the practice changes that we propose will lead to better clinical care and this needs separate evaluation. To explore the generalisability of our findings we need to replicate this study in groups of clinicians in other specialties and settings. Future studies will need to deal with cases closer to the event and place even more emphasis on context. The model of deliberate practice27 is likely to be suitable for ongoing professional development and training to reduce error. Some of these have led to improved clinical care, albeit in interventional specialties, which suggests that similar improvements might be feasible in primary care, but this needs evaluation.

Conclusions

Initiation and closure of the cognitive process are most exposed to the risk of diagnostic error in primary care. Cognitive biases developed at framing directly influence errors at the end of the process. We refer to these as warning signs that can alert clinicians to the increased risk of diagnostic error. The most significant reasoning biases we observed related to patients presenting with pre-existing diagnostic labels and psychosocial problems. Others included the use of heuristics, patient's appearance, wrong initial localisation of the problem and probabilistic bias. Subsequently ignored red flags or critical cues may have been related to biased process through the frame, but could also be explained by knowledge gaps.

We conclude that lack of knowledge is likely to be an important factor in diagnostic error. Reducing diagnostic errors in primary care should focus on early and systematic recognition of errors including near misses, and a continuing professional development environment that promotes reflection in action to highlight possible causes of process bias and of knowledge gaps. Alerting clinicians to warning signs of where there is an increased risk of error may be one way to prompt a reflective review of the cognitive process. For this to become an integral part of the clinical process, we may need to experiment with deliberate practice of looking for warning signs as a potential method of professional development to reduce error.

Acknowledgments

We wish to acknowledge a number of colleagues who helped us: Margaret Balla participated in the interview analysis and produced figure 1; Nia Roberts assisted with the literature review; Mary Hodgkinson and Dawn Fraser had the arduous task of transcribing the interviews. We particularly thank our willing participants who freely discussed the sensitive issues relating to their experiences with diagnostic error.

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

    Files in this Data Supplement:

Footnotes

  • Contributors JB designed the project, carried out interviews and analysis as well as writing the first draft, was involved in all drafts and was involved in the final manuscript. CH and MT conceived the project and were involved in the design, in all drafts and were involved in the final manuscript. CG was involved in the literature search, the analysis and final manuscript. All authors contributed to the final document.

  • Competing interests None.

  • Data sharing statement There are no additional data available.