The data of diagnostic error: big, large and small

Gurpreet Dhaliwal; Kaveh G Shojania

doi:10.1136/bmjqs-2018-007917

Article Text

PDF

Editorial

The data of diagnostic error: big, large and small

Free

Gurpreet Dhaliwal1,2,
Kaveh G Shojania3

¹ Department of Medicine, University of California, San Francisco, San Francisco, California, USA
² Medical Service, San Francisco VA Medical Center, San Francisco, California, USA
³ Department of Medicine and Centre for Quality Improvement and Patient Safety, University of Toronto, Toronto, Ontario, Canada

Correspondence to Dr Gurpreet Dhaliwal, Department of Medicine University of California, San Francisco CA 94121, USA; gurpreet.dhaliwal{at}ucsf.edu

https://doi.org/10.1136/bmjqs-2018-007917

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Diagnostic error research has mostly focused on methods to detect, characterise and analyse lapses in the diagnostic process by using incident reports, malpractice claims, autopsies and electronic trigger tools. The associated literature shows how frequently important diagnostic errors occur1 and examines cognitive2 and system-based3 causes of these errors. Relatively absent from this portfolio of research have been large-scale approaches for measuring institutional diagnostic performance, either for benchmarking purposes or for driving improvement efforts.

In this issue of BMJQS, Liberman and Newman-Toker introduce Symptom–Disease Pair Analysis of Diagnostic Error (SPADE) as a new approach to identify diagnostic errors by analysing large patient data sets (tens of thousands of patient encounters housed in electronic medical records or administrative databases).4 The SPADE methodology starts with a symptom that is misdiagnosed at an appreciable rate such as chest pain or dizziness. It then looks for instances within the data set where a patient with that symptom has two coded encounters in a short time frame. Misdiagnosis-related harm is inferred when there is a prespecified change in diagnosis over time. An example is a patient with acute dizziness (the symptom) who is discharged with a diagnosis of positional vertigo at an initial emergency department (ED) encounter and 1 week later returns to the ED and is diagnosed with an acute stroke (the disease).

The Symptom–Disease Pair at the core of the SPADE methodology refers to the presenting symptom (eg, dizziness) and the correct, but initially overlooked, diagnosis (eg, stroke). The validity of any symptom–diagnosis pair as a marker of diagnostic error is established using two approaches. A ‘look forward’ approach begins by identifying all patients who presented with dizziness and were discharged with a diagnosis of benign positional vertigo. It then looks at how often these patients present again within 30 days and receive a stroke diagnosis. A ‘look back’ approach establishes the proportion of patients admitted with stroke who presented within the preceding 30 days with dizziness. The SPADE approach assesses the frequency of this symptom–disease dyad across a large data set and counts each instance as an episode of misdiagnosis-related harm.

SPADE is an advance in diagnostic error measurement. It focuses on organisation-wide, disease-specific, misdiagnosis-related harm. It provides an estimate of diagnostic error frequency without human adjudication of flagged cases. And it offers a dynamic measure of diagnostic performance that can be followed over time. A companion short report and video in this issue of BMJQS illustrate how the SPADE approach can be used to populate an interactive diagnostic performance dashboard.5

SPADE, however, will capture only a subset of diseases, and within those conditions, only a subset of the diagnostic error burden. The SPADE methodology is best suited to detect diagnostic harm associated with symptoms where there is an elevated risk of acute worsening in the short term (eg, chest pain). There also must be a strong bidirectional statistical link between one symptom and one disease for SPADE to infer diagnostic error with confidence. Change in diagnosis must take place over two discrete episodes to be detected by SPADE. If a patient is misdiagnosed and later correctly diagnosed during a single episode of care (eg, within a hospitalisation), the change in diagnosis will not be detected. For instance, a patient admitted with syncope attributed to hypovolaemia and diagnosed days later during the same hospitalisation with ventricular tachycardia will not generate a signal of delayed diagnosis. These concerns, however, should be tempered by the understanding that organisations do not need a perfect account of diagnostic error in order to initiate improvement efforts. A general sense of the magnitude can spark change.

A recurrent theme in patient safety is that no single method provides the complete picture of the problem.6 Whether we are trying to identify cases involving diagnostic errors, generating strategies for avoiding diagnostic errors or measuring diagnostic performance, several complementary approaches are needed.7 For the part of the puzzle that relates to acute conditions with tight linkages between a single symptom and single morbid diagnosis, the SPADE methodology is a promising tactic.

Large data

The authors affiliate SPADE with ‘big data’, a term with an increasingly uncertain meaning on account of its ubiquity and hype. SPADE certainly depends on large data sets, but its data sources and analytical methods more closely resemble the size and types used in traditional health services research. Big data more distinctly refers to data sets that are constructed from multiple structured and unstructured sources and are often so voluminous and complex that traditional data processing software are inadequate to analyse them; oftentimes advanced computational methods, including artificial intelligence and machine learning, are necessary to gain insights from the information. In practice, the demarcation between traditional large data set analysis and big data analytics is fuzzy. Although SPADE may not count as ‘big data’ in the strictest sense, it nonetheless represents a foundational step that illustrates how diagnostic error investigators can leverage large data sets to identify important targets for improvements in diagnostic performance.

A promising starting point will be for large organisations to pilot a SPADE analysis with disease-specific diagnostic errors that have evidence-based solutions. For instance, if this approach unearths a rising number of patients with misdiagnosed spinal epidural abscess, an institution could implement a protocol incorporating risk factor assessment followed by testing for erythrocyte sedimentation rate and C-reactive protein, which has decreased diagnostic delays for this condition.8 Yet, even if SPADE creates a signal (and the system institutes a new protocol), there remains a risk that many clinicians will react without modifying their practice. What can help motivate that change?

Small data

Enter small data. ‘Small data’ is information that is comprehensible without analytics and comes in a volume and format that makes it manageable and informative.9 10 The most accessible version of small data is a story.11 And the most compelling stories for clinicians concern their own patients.

But the only way for clinicians to learn from their patient stories is to learn how each story ends.12 Countless patient encounters end with provisional or unknown diagnoses, where definitive diagnostic confirmation (or refutation) occurs days, weeks or months later as testing, treatment and natural history play out.13 Clinicians who wish to optimise their diagnostic judgements must establish patient tracking systems where they learn how the story ends—and how they can do better the next time they encounter the same diagnostic problem.14 This small data approach can help catalyse the change that big data output brings to our attention.

Suppose SPADE surfaces increasing rates of delayed diagnoses of spinal epidural abscess in elderly patients. As clinicians, we would certainly take an interest in this system-wide signal. But what specifically do we do the next time we are confronted with an octogenarian with back pain? To generate a plan for individual improvement, a clinician would need to analyse their own cases to see where they deviated from best practices and discern other shortcomings in their diagnostic approach.15 Such examination of small data could lead to insights about insufficient examination techniques, underestimation of the frequency of MRI misinterpretation or overestimation of the utility of fever or leukocytosis.

This small data complement to big data approaches will be particularly important with commonly misdiagnosed conditions like pneumonia and cellulitis16 where the causes of misdiagnosis are heterogeneous and complex and where few prepackaged solutions exist. Big data can surface the problem, but small data will provide the insights and motivation to do something about it.

Conclusion

Current research methods are insufficient to understand the magnitude and causes of diagnostic error. SPADE is not yet ready for high-stakes external benchmarking, but it can be piloted within large organisations to see if reliable internal metrics that drive improvements in diagnostic safety can be achieved. Studying diagnostic error is incredibly complex, and it will take time to develop methods. The SPADE approach is an important step forward.

Organisations should welcome large data sets into the portfolio of approaches to detect and measure diagnostic error. But they cannot lose sight of this persistent problem in efforts to improve quality: that data alone do not change practice. Patient stories have often found a place in efforts to motivate improvement efforts,17 18 as they are both emotionally and intellectually engaging.19 Clinicians who set up their own tracking systems quickly learn that the most powerful stories are ones where their own patients are the protagonists and that patient-specific rather than population-wide feedback is the strongest motivator for improvement.20

The curation and sharing of these stories should remain a priority even with the rise of big data. Technology has made amazing progress in recent decades, but the human brain has not changed one bit. The story, not the statistic, remains the brain’s preferred unit of learning—and the most powerful tool of persuasion.

References

↵
2. Singh H ,
3. Meyer AN ,
4. Thomas EJ
. The frequency of diagnostic errors in outpatient care: estimations from three large observational studies involving US adult populations. BMJ Qual Saf 2014;23:727–31.doi:10.1136/bmjqs-2013-002627
OpenUrl Abstract/FREE Full Text
↵
2. Graber ML ,
3. Kissam S ,
4. Payne VL , et al
. Cognitive interventions to reduce diagnostic error: a narrative review. BMJ Qual Saf 2012;21:535–57.doi:10.1136/bmjqs-2011-000149
OpenUrl Abstract/FREE Full Text
↵
2. Singh H ,
3. Graber ML ,
4. Kissam SM , et al
. System-related interventions to reduce diagnostic errors: a narrative review. BMJ Qual Saf 2012;21:160–70.doi:10.1136/bmjqs-2011-000150
OpenUrl Abstract/FREE Full Text
↵
2. Liberman AL ,
3. Newman-Toker DE
. Symptom-Disease Pair Analysis of Diagnostic Error (SPADE): a conceptual framework and methodological approach for unearthing misdiagnosis-related harms using big data. BMJ Qual Saf 2018;27:557–66.doi:10.1136/bmjqs-2017-007032
OpenUrl Abstract/FREE Full Text
↵
Mane et al . Diagnostic Performance Dashboards—Tracking Diagnostic Errors using Big Data. BMJ Qual Saf 2018;27:567–70.
OpenUrl FREE Full Text
↵
2. Levtzion-Korach O ,
3. Frankel A ,
4. Alcalai H , et al
. Integrating incident data from five reporting systems to assess patient safety: making sense of the elephant. Jt Comm J Qual Patient Saf 2010;36:402–AP18.doi:10.1016/S1553-7250(10)36059-4
OpenUrl PubMed
↵
2. Shojania KG
. The elephant of patient safety: what you see depends on how you look. Jt Comm J Qual Patient Saf 2010;36:399–AP3.doi:10.1016/S1553-7250(10)36058-2
OpenUrl PubMed
↵
2. Davis DP ,
3. Salazar A ,
4. Chan TC , et al
. Prospective evaluation of a clinical decision guideline to diagnose spinal epidural abscess in patients who present to the emergency department with spine pain. J Neurosurg Spine 2011;14:765–70.doi:10.3171/2011.1.SPINE1091
OpenUrl PubMed
↵
10 Reasons 2014 will be the Year of Small Data. http://www.zdnet.com/article/10-reasons-2014-will-be-the-year-of-small-data/ (accessed 3 Feb 2018).
↵
2. Sacristán JA ,
3. Dilla T
. No big data without small data: learning health care systems begin and end with the individual patient. J Eval Clin Pract 2015;21:1014–7.doi:10.1111/jep.12350
OpenUrl
↵
2. Aronson L
. A piece of my mind. Story as Evidence, Evidence as Story. JAMA 2015;314:125–6.doi:10.1001/jama.2015.3930
OpenUrl
↵
2. Bowen JL ,
3. Ilgen JS ,
4. Irby DM , et al
. "You Have to Know the End of the Story": Motivations to Follow Up After Transitions of Clinical Responsibility. Acad Med 2017;92:S48–S54.doi:10.1097/ACM.0000000000001919
OpenUrl
↵
2. Schiff GD
. Minimizing diagnostic error: the importance of follow-up and feedback. Am J Med 2008;121:S38–S42.doi:10.1016/j.amjmed.2008.02.004
OpenUrl PubMed
↵
2. Dhaliwal G
. Annals for Hospitalists Inpatient Notes - Diagnostic Excellence Starts With an Incessant Watch. Ann Intern Med 2017;167:HO2–HO3.doi:10.7326/M17-2447
OpenUrl
↵
2. Bhise V ,
3. Meyer AND ,
4. Singh H , et al
. Errors in Diagnosis of Spinal Epidural Abscesses in the Era of Electronic Health Records. Am J Med 2017;130:975–81.doi:10.1016/j.amjmed.2017.03.009
OpenUrl
↵
2. Singh H ,
3. Giardina TD ,
4. Meyer AN , et al
. Types and origins of diagnostic errors in primary care settings. JAMA Intern Med 2013;173:418–25.doi:10.1001/jamainternmed.2013.2777
OpenUrl
↵
2. Wachter RM ,
3. Shojania KG ,
4. Markowitz AJ , et al
. Quality grand rounds: the case for patient safety. Ann Intern Med 2006;145:629–30.doi:10.7326/0003-4819-145-8-200610170-00013
OpenUrl CrossRef PubMed Web of Science
↵
2. Stang AS ,
3. Wong BM
. Patients teaching patient safety: the challenge of turning negative patient experiences into positive learning opportunities. BMJ Qual Saf 2015;24:4–6.doi:10.1136/bmjqs-2014-003655
OpenUrl FREE Full Text
↵
2. Cox K
. Stories as case knowledge: case knowledge as stories. Med Educ 2001;35:862–6.doi:10.1046/j.1365-2923.2001.01016.x
OpenUrl CrossRef PubMed Web of Science
↵
2. Mitchell E ,
3. Sullivan F ,
4. Grimshaw JM , et al
. Improving management of hypertension in general practice: a randomised controlled trial of feedback derived from electronic patient data. Br J Gen Pract 2005;55:94–101.
OpenUrl Abstract/FREE Full Text

Footnotes

Contributor GD and KGS contributed to the conception of the paper; they critically read and modified subsequent drafts and approved the final version. KGS is an editor at BMJ Quality & Safety.
Funding This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests GD reports receiving honoraria from ISMIE Mutual Insurance Company and Physicians’ Reciprocal Insurers.
Provenance and peer review Commissioned; internally peer reviewed.

Linked Articles

Research and reporting methodology
Symptom-Disease Pair Analysis of Diagnostic Error (SPADE): a conceptual framework and methodological approach for unearthing misdiagnosis-related harms using big data

Ava L Liberman David E Newman-Toker
BMJ Quality & Safety 2018; 27 557-566 Published Online First: 22 Jan 2018. doi: 10.1136/bmjqs-2017-007032
Short report
Diagnostic performance dashboards: tracking diagnostic errors using big data

Ketan K Mane Kevin B Rubenstein Najlla Nassery Adam L Sharp Ejaz A Shamim Navdeep S Sangha Ahmed Hassoon Mehdi Fanai Zheyu Wang David E Newman-Toker
BMJ Quality & Safety 2018; 27 567-570 Published Online First: 17 Mar 2018. doi: 10.1136/bmjqs-2018-007945

[1] ↵

Singh H ,
Meyer AN ,
Thomas EJ
. The frequency of diagnostic errors in outpatient care: estimations from three large observational studies involving US adult populations. BMJ Qual Saf 2014;23:727–31.doi:10.1136/bmjqs-2013-002627
OpenUrl Abstract/FREE Full Text

[3] Singh H ,

[4] Meyer AN ,

[5] Thomas EJ

[6] ↵

Graber ML ,
Kissam S ,
Payne VL , et al
. Cognitive interventions to reduce diagnostic error: a narrative review. BMJ Qual Saf 2012;21:535–57.doi:10.1136/bmjqs-2011-000149
OpenUrl Abstract/FREE Full Text

[8] Graber ML ,

[9] Kissam S ,

[10] Payne VL , et al

[11] ↵

Singh H ,
Graber ML ,
Kissam SM , et al
. System-related interventions to reduce diagnostic errors: a narrative review. BMJ Qual Saf 2012;21:160–70.doi:10.1136/bmjqs-2011-000150
OpenUrl Abstract/FREE Full Text

[13] Singh H ,

[14] Graber ML ,

[15] Kissam SM , et al

[16] ↵

Liberman AL ,
Newman-Toker DE
. Symptom-Disease Pair Analysis of Diagnostic Error (SPADE): a conceptual framework and methodological approach for unearthing misdiagnosis-related harms using big data. BMJ Qual Saf 2018;27:557–66.doi:10.1136/bmjqs-2017-007032
OpenUrl Abstract/FREE Full Text

[18] Liberman AL ,

[19] Newman-Toker DE

[20] ↵
Mane et al . Diagnostic Performance Dashboards—Tracking Diagnostic Errors using Big Data. BMJ Qual Saf 2018;27:567–70.
OpenUrl FREE Full Text

[21] ↵

Levtzion-Korach O ,
Frankel A ,
Alcalai H , et al
. Integrating incident data from five reporting systems to assess patient safety: making sense of the elephant. Jt Comm J Qual Patient Saf 2010;36:402–AP18.doi:10.1016/S1553-7250(10)36059-4
OpenUrl PubMed

[23] Levtzion-Korach O ,

[24] Frankel A ,

[25] Alcalai H , et al

[26] ↵

Shojania KG
. The elephant of patient safety: what you see depends on how you look. Jt Comm J Qual Patient Saf 2010;36:399–AP3.doi:10.1016/S1553-7250(10)36058-2
OpenUrl PubMed

[28] Shojania KG

[29] ↵

Davis DP ,
Salazar A ,
Chan TC , et al
. Prospective evaluation of a clinical decision guideline to diagnose spinal epidural abscess in patients who present to the emergency department with spine pain. J Neurosurg Spine 2011;14:765–70.doi:10.3171/2011.1.SPINE1091
OpenUrl PubMed

[31] Davis DP ,

[32] Salazar A ,

[33] Chan TC , et al

[34] ↵
10 Reasons 2014 will be the Year of Small Data. http://www.zdnet.com/article/10-reasons-2014-will-be-the-year-of-small-data/ (accessed 3 Feb 2018).

[35] ↵

Sacristán JA ,
Dilla T
. No big data without small data: learning health care systems begin and end with the individual patient. J Eval Clin Pract 2015;21:1014–7.doi:10.1111/jep.12350
OpenUrl

[37] Sacristán JA ,

[38] Dilla T

[39] ↵

Aronson L
. A piece of my mind. Story as Evidence, Evidence as Story. JAMA 2015;314:125–6.doi:10.1001/jama.2015.3930
OpenUrl

[41] Aronson L

[42] ↵

Bowen JL ,
Ilgen JS ,
Irby DM , et al
. "You Have to Know the End of the Story": Motivations to Follow Up After Transitions of Clinical Responsibility. Acad Med 2017;92:S48–S54.doi:10.1097/ACM.0000000000001919
OpenUrl

[44] Bowen JL ,

[45] Ilgen JS ,

[46] Irby DM , et al

[47] ↵

Schiff GD
. Minimizing diagnostic error: the importance of follow-up and feedback. Am J Med 2008;121:S38–S42.doi:10.1016/j.amjmed.2008.02.004
OpenUrl PubMed

[49] Schiff GD

[50] ↵

Dhaliwal G
. Annals for Hospitalists Inpatient Notes - Diagnostic Excellence Starts With an Incessant Watch. Ann Intern Med 2017;167:HO2–HO3.doi:10.7326/M17-2447
OpenUrl

[52] Dhaliwal G

[53] ↵

Bhise V ,
Meyer AND ,
Singh H , et al
. Errors in Diagnosis of Spinal Epidural Abscesses in the Era of Electronic Health Records. Am J Med 2017;130:975–81.doi:10.1016/j.amjmed.2017.03.009
OpenUrl

[55] Bhise V ,

[56] Meyer AND ,

[57] Singh H , et al

[58] ↵

Singh H ,
Giardina TD ,
Meyer AN , et al
. Types and origins of diagnostic errors in primary care settings. JAMA Intern Med 2013;173:418–25.doi:10.1001/jamainternmed.2013.2777
OpenUrl

[60] Singh H ,

[61] Giardina TD ,

[62] Meyer AN , et al

[63] ↵

Wachter RM ,
Shojania KG ,
Markowitz AJ , et al
. Quality grand rounds: the case for patient safety. Ann Intern Med 2006;145:629–30.doi:10.7326/0003-4819-145-8-200610170-00013
OpenUrl CrossRef PubMed Web of Science

[65] Wachter RM ,

[66] Shojania KG ,

[67] Markowitz AJ , et al

[68] ↵

Stang AS ,
Wong BM
. Patients teaching patient safety: the challenge of turning negative patient experiences into positive learning opportunities. BMJ Qual Saf 2015;24:4–6.doi:10.1136/bmjqs-2014-003655
OpenUrl FREE Full Text

[70] Stang AS ,

[71] Wong BM

[72] ↵

Cox K
. Stories as case knowledge: case knowledge as stories. Med Educ 2001;35:862–6.doi:10.1046/j.1365-2923.2001.01016.x
OpenUrl CrossRef PubMed Web of Science

[74] Cox K

[75] ↵

Mitchell E ,
Sullivan F ,
Grimshaw JM , et al
. Improving management of hypertension in general practice: a randomised controlled trial of feedback derived from electronic patient data. Br J Gen Pract 2005;55:94–101.
OpenUrl Abstract/FREE Full Text

[77] Mitchell E ,

[78] Sullivan F ,

[79] Grimshaw JM , et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Statistics from Altmetric.com

Request Permissions

Large data

Small data

Conclusion

References

Footnotes

Linked Articles

Read the full text or download the PDF:

Log in using your username and password