Effect on diagnostic accuracy of cognitive reasoning tools for the workplace setting: systematic review and meta-analysis

Justine Staal; Jacky Hooftman; Sabrina T G Gunput; Sílvia Mamede; Maarten A Frens; Walter W Van den Broek; Jelmer Alsma; Laura Zwaan

doi:10.1136/bmjqs-2022-014865

Article Text

Systematic review

Effect on diagnostic accuracy of cognitive reasoning tools for the workplace setting: systematic review and meta-analysis

http://orcid.org/0000-0003-3790-0072Justine Staal1,
Jacky Hooftman1,2,
Sabrina T G Gunput3,
http://orcid.org/0000-0003-1187-2392Sílvia Mamede1,4,
Maarten A Frens5,
Walter W Van den Broek1,
Jelmer Alsma6,
http://orcid.org/0000-0003-3940-1699Laura Zwaan1

¹ Institute of Medical Education Research Rotterdam, Erasmus Medical Center, Rotterdam, The Netherlands
² Public and Occupational Health, Amsterdam Public Health Research Institute, Amsterdam UMC, Locatie VUmc, Amsterdam, The Netherlands
³ Medical Library, Erasmus Medical Center, Rotterdam, The Netherlands
⁴ Department of Psychology, Erasmus School of Social and Behavioural Sciences, Erasmus University Rotterdam, Rotterdam, The Netherlands
⁵ Department of Neuroscience, Erasmus Medical Center, Rotterdam, The Netherlands
⁶ Department of Internal Medicine, Erasmus University Medical Center, Rotterdam, The Netherlands

Correspondence to Mrs Justine Staal, Institute of Medical Education Research Rotterdam, Erasmus Medical Center, 3015 GD Rotterdam, Zuid-Holland, The Netherlands; j.staal{at}erasmusmc.nl

Abstract

Background Preventable diagnostic errors are a large burden on healthcare. Cognitive reasoning tools, that is, tools that aim to improve clinical reasoning, are commonly suggested interventions. However, quantitative estimates of tool effectiveness have been aggregated over both workplace-oriented and educational-oriented tools, leaving the impact of workplace-oriented cognitive reasoning tools alone unclear. This systematic review and meta-analysis aims to estimate the effect of cognitive reasoning tools on improving diagnostic performance among medical professionals and students, and to identify factors associated with larger improvements.

Methods Controlled experimental studies that assessed whether cognitive reasoning tools improved the diagnostic accuracy of individual medical students or professionals in a workplace setting were included. Embase.com, Medline ALL via Ovid, Web of Science Core Collection, Cochrane Central Register of Controlled Trials and Google Scholar were searched from inception to 15 October 2021, supplemented with handsearching. Meta-analysis was performed using a random-effects model.

Results The literature search resulted in 4546 articles of which 29 studies with data from 2732 participants were included for meta-analysis. The pooled estimate showed considerable heterogeneity (I²=70%). This was reduced to I²=38% by removing three studies that offered training with the tool before the intervention effect was measured. After removing these studies, the pooled estimate indicated that cognitive reasoning tools led to a small improvement in diagnostic accuracy (Hedges’ g=0.20, 95% CI 0.10 to 0.29, p<0.001). There were no significant subgroup differences.

Conclusion Cognitive reasoning tools resulted in small but clinically important improvements in diagnostic accuracy in medical students and professionals, although no factors could be distinguished that resulted in larger improvements. Cognitive reasoning tools could be routinely implemented to improve diagnosis in practice, but going forward, more large-scale studies and evaluations of these tools in practice are needed to determine how these tools can be effectively implemented.

PROSPERO registration number CRD42020186994.

Checklists
Cognitive biases
Diagnostic errors

Data availability statement

Data are available on reasonable request. The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request. The study protocol was preregistered and is available online in the PROSPERO database.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

https://doi.org/10.1136/bmjqs-2022-014865

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

WHAT IS ALREADY KNOWN ON THIS TOPIC

Cognitive reasoning tools that is, tools that aim to improve clinical reasoning, are often recommended to reduce diagnostic errors. Quantitative effect estimates have been aggregated over workplace-oriented and education-oriented tools. It is unknown what the impact of workplace-oriented cognitive reasoning tools is and what factors are associated with greater effectiveness.

WHAT THIS STUDY ADDS

Workplace-oriented cognitive reasoning tools lead to small improvements in diagnostic accuracy, but based on the current evidence no factors could be isolated that lead to greater improvements.

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE AND/OR POLICY

This meta-analysis suggests that cognitive reasoning tools could improve diagnostic accuracy in practice, but that more large-scale studies are necessary to evaluate the effects of cognitive reasoning tools in practice and under which circumstances cognitive reasoning tools are most effective.

Introduction

Diagnostic errors, defined as missed, delayed and wrong diagnoses, are a large burden on healthcare and a threat to patient safety. The National Academies of Sciences, Engineering, and Medicine, the collective national academy of the USA, estimated that most people will experience a diagnostic error in their lifetime, sometimes with devastating consequences.1 A significant portion of diagnostic errors is considered preventable and effective interventions are crucial to reduce these errors.2–4

The use of interventions focused on cognitive factors is often recommended3 5–8: these factors are thought to be a primary cause of errors which have been identified in more than 75% of error cases.4 9–11 Such interventions, referred to as cognitive reasoning tools in this study, are aimed at improving clinical reasoning and decision-making skills by improving clinicians’ intuitive and rational processing during diagnosis.3 Examples include checklists,12 reflective practices,2 7 12–15 cognitive forcing strategies12 and clinical decision support systems.12 16 Experiments testing the effectiveness of cognitive reasoning tools are relatively scarce,3 17 but overall the current literature indicates these tools could improve diagnostic accuracy. Previous studies seem to suggest that this effect differs between subgroups: for example, tool effectiveness between studies differed depending on the participants’ level of expertise and the difficulty level of the cases.18

Previous quantitative estimates of the impact of these tools on diagnostic accuracy were made by Prakash et al 2 and Kwan et al,16 who examined the impact of reflective practices and decision support systems, respectively. Crucially, these meta-analyses and other reviews3 7 19 20 have aggregated studies which focused on cognitive reasoning tools settings where the tools are used to improve learning and competence (education-oriented settings) with settings where the tools are used to improve performance (workplace-oriented settings), a distinction commonly made in the literature.7 21 The exact impact of cognitive reasoning tools on performance in workplace-oriented settings remains unknown. This study therefore aimed to separate both settings and provide insight in the effectiveness of cognitive reasoning tools aimed at workplace-oriented settings. Additionally, there is no consensus on what factors make an effective reasoning tool. In this systematic review and meta-analysis, we aimed to extend on the estimate of the effect of cognitive reasoning tools on improving diagnostic accuracy among medical students and professionals. Second, we aimed to identify factors in study or intervention design that were associated with higher overall effectiveness.

Methods

The PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions22 was followed in this study. The review’s objectives and methods were specified in advance in the PROSPERO Database.

Data sources and searches

All searches were conducted with the assistance of biomedical information specialists of the medical library. The complete search strategy is documented in online supplemental appendix A. The following electronic databases were searched: Embase.com (1971–present), Medline ALL (1946–present) via Ovid, Web of Science Core Collection (1975–present) and Cochrane Central Register of Controlled Trials (1992–present). Additionally, a search was performed in Google Scholar from which the 200 most relevant references were downloaded. All searches included unpublished ‘grey’ literature. After the original search was performed in April 2020, the search was last updated on 15 October 2021. Further studies were identified by reviewing reference lists of included studies and conference proceedings (Diagnostic Error in Medicine conferences in Diagnosis) and asking colleagues about unpublished work. Authors were contacted for missing information if necessary.

Supplemental material

[bmjqs-2022-014865supp001.pdf]

Study selection

Three reviewers independently performed the title and abstract screening. An article was included for full-text review if one reviewer included it. For articles that were not available in English, a translation was generated via Google Translate and checked by an author who understood the language (ie, Dutch, French, German, Swedish, Russian). No other languages were encountered. Two reviewers subsequently screened all selected full-text studies. Disagreements were solved via consensus, and if no consensus was reached, via consultation of the third reviewer. Inter-rater reliability was assessed using Cohen’s kappa statistic.23

We included all studies that evaluated cognitive reasoning tools focused on medical specialists (including students and those in training) with the aim to improve diagnosis. Although we excluded educational interventions, studies that included medical students could still be considered if they measured performance using workplace-oriented tools. We defined cognitive reasoning tools as structured tools that focus on improving clinical reasoning and decision-making skills.3 There were no restrictions for publication status or publication year. Searching was limited to controlled studies (quasi-experimental or experimental studies, controlled and crossover trials or before–after designs) that measured diagnostic performance (either as diagnostic error or diagnostic accuracy).

We excluded tools that focused on specific diseases (eg, diagnostic guidelines) because these present a set of decision rules that predict whether or not the patient should be diagnosed with a certain disease, instead of improving the diagnostic process in general. We further excluded studies in which the tool was not explicitly available while diagnosing cases (eg, studies that focused on using the tool for learning and education and not on implementing it into practice). Lastly, we excluded studies focused on psychiatric diseases, because psychiatric diagnosis is largely based on identifying a certain number of behaviours in a patient that match to a disorder in the Diagnostic and Statistical Manual of Mental Disorders,24 which is similar to using a checklist-like tool. We expected that the effectiveness of cognitive reasoning tools in psychiatric settings would not be comparable to other clinical settings.

Data extraction and quality assessment

Two reviewers independently performed data extraction and quality assessment for 30% of the studies. Disagreements were resolved via discussion and the task proceeded with a single evaluator. Data were extracted using the Cochrane Data Collection Form for intervention reviews on randomised controlled trials (RCTs) and non-RCTs (version 12-08-2013).25 This form was adapted by removing questions specific for medication trials, and questions specific to cognitive reasoning tools were added. Information extracted from each study included year of publication, country, participant characteristics (years of experience, level of expertise, area of expertise), type of intervention (type of tool, phase of the diagnostic process where the tool is used, diagnostic tasks the tool applies to, whether the tool’s items have to be acknowledged or reported), outcome measure (measure of cases diagnosed correctly or incorrectly), setting and research design (control group, randomisation). The adapted form was pilot-tested on five randomly selected included studies.

The methodological quality of included studies was assessed using the Cochrane Collaboration Risk of Bias (RoB 2) template.26 This form assessed study randomisation, deviations from the intended intervention, allocation concealment and blinding, outcome measures and selective outcome reporting. On each domain, a study could be rated as at high, medium or low risk of bias. If insufficient information was available, the domain was rated as ‘no information’ and the study authors were contacted. The final bias assessment was equivalent to the highest received subassessment.

The overall strength of the evidence was assessed using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) group’s tool.27 This tool assesses the quality of evidence along the domains of risk of bias, consistency, directness, precision and publication bias. The tool rates the confidence in the evidence as high, moderate, low or very low.

Two studies reported diagnostic error rates28 29; these percentages were inversed to be comparable to diagnostic accuracy rates.

Data synthesis

The primary outcome was the difference in diagnostic performance between the control group or baseline measurement and the intervention group. For continuous data, the mean and SD of diagnostic performance were used to compute the standardised mean difference (Hedges’ g) and the 95% CI of g; for dichotomous data, the reported effect size (ie, OR) was transformed to Hedges’ g. These results were pooled using a random-effects model meta-analysis with the Hartung-Knapp adjustment,30 using the restricted maximum likelihood method to estimate variation between studies. One trial was included per study in the main analysis. If a study directly compared a control group or baseline measurement with the intervention group, this comparison was included; if there were multiple comparisons in one study, comparisons that satisfied our inclusion criteria were aggregated. Between-study heterogeneity was estimated using the I² statistic, which was categorised as: might not be important (0%–40%), moderate (30%–60%), substantial (50%–90%) and considerable (75%–100%).31 It was considered feasible to combine the included studies for meta-analysis if heterogeneity did not exceed 40% which indicated consistency in the study outcomes. Further study differences could then be explored using subgroup analyses. Heterogeneity was further explored via influence and sensitivity analyses based on the risk of bias assessment. Influence was measured using leave-one-out estimates of heterogeneity and covariance ratios, where a study was considered influential if the covariance ratio was below 1. Publication bias was assessed using a funnel plot and Egger’s regression.32

Subgroup analyses were performed for participant expertise, several intervention characteristics (ie, intervention type, moment of intervention, intervention items) and study characteristics (ie, diagnostic task, case difficulty, same cases used with and without intervention, study intention). Variable definitions are given in table 1. The subgroup analyses for the level of expertise and intervention characteristics were prespecified; the analyses for study characteristics were based on observations made during study characteristic extraction. Analyses were performed with the metafor package33 in R (V.1.4.1106),34 with significance levels set at p<0.05.

View this table:

Table 1

Definitions of the characteristics used in subgroup analyses

Results

Our database search yielded 4546 studies and an additional 24 studies were identified through other search activities (figure 1). After removing duplicates, 2963 studies remained for initial screening. Of these, 2822 studies were excluded because their title and abstract did not meet the inclusion criteria, leaving 141 studies for full-text screening. Inter-rater reliability was moderate to substantial for title and abstract screening and substantial for full-text screening, although the overall rate of agreement was almost perfect (online supplemental appendix B). One hundred and twelve studies did not meet our inclusion criteria. Examples of excluded studies were studies where the intervention under study was not focused on supporting cognitive processes,35 36 studies that did not measure diagnostic accuracy or diagnostic errors37–40 or studies that did not describe an experiment.41 42 The remaining 29 studies were included for review and meta-analysis. All studies were available in English. All studies were published except for unpublished data from one study (Staal et al, Impact of diagnostic checklists on the interpretation of normal and abnormal electrocardiograms, 2021). This unpublished experiment compared diagnostic accuracy on ECGs’ diagnosis using a debiasing checklist and an ECG-specific checklist. The data were obtained from the authors. Three studies43 44 (Staal et al, Impact of diagnostic checklists on the interpretation of normal and abnormal electrocardiograms, 2021) contained two trials (two separate interventions were tested and compared with, in these cases, the same control group). These trials were aggregated for calculation of the main effect to prevent double counting of the control group. The different interventions were evaluated separately in a subgroup analysis. The characteristics of the included studies are detailed in table 2. The findings of the individual included studies are reported in online supplemental appendix C.

Supplemental material

[bmjqs-2022-014865supp002.pdf]

Supplemental material

[bmjqs-2022-014865supp003.pdf]

View this table:

Table 2

Characteristics of included studies

Figure 1

Study inclusion flow chart (PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses).

Interventions

A variety of interventions was included for analysis, which were divided into four categories based on Lambe et al 7: checklists, computerised decision support systems, instructions at test (ie, interventions that instruct participants to use a certain reasoning approach) and guided reflection (table 2). First, checklists were paper based or online lists that guided participants through all important factors that need to be considered before coming to a final diagnosis. Second, computerised decision support tools were electronic algorithms that guided participants by suggesting differential diagnoses for certain symptoms. Third, interventions providing instructions at test aimed to guide participants’ thinking in a certain way which was hypothesised to reduce errors. Finally, reflective reasoning tools were based on the deliberate reflection procedure designed by Mamede et al.45 In some cases, similar procedures were named differently, for example, Ilgen et al 46 47 used an abbreviated deliberate reflection which they called ‘directed search instructions’. Reflective reasoning tools ask participants to consider a diagnosis for a case, then consider all information in the case that confirms or contradicts that diagnosis and information that would have been expected if the diagnosis were correct, but is not presented. Participants are then asked to repeat this process for all differential diagnoses they come up with and finally, all diagnoses are ranked in order of likelihood. Details on the interventions of each individual study and how these have been classified are listed in table 3.

Despite variations in the format of these interventions, most shared the common focus on prompting participants to consider certain information in a specific manner (content specific) or to consider one’s reasoning processes during diagnosis (process focused) (table 1).

View this table:

Table 3

Descriptions of the interventions in each study and the category the intervention was assigned to

Risk of bias assessment

For 25 studies, risk of bias was low in all categories except in ‘Selection of reported results’, because these studies had no preregistered analysis plans available to verify whether selection bias was present (Staal et al, Impact of diagnostic checklists on the interpretation of normal and abnormal electrocardiograms, 2021). Only one study was preregistered. Three studies were assessed as high risk of bias. First, O’Sullivan and Schofield29 had a medium risk of bias due to a large drop-out rate during the study. Second, Shimizu et al 43 was scored at high risk because of their quasi-random participant allocation. Third, Cairns et al 48 was scored at high risk because of missing outcome data: participants were asked to diagnose at least one ECG, with a maximum of 10, but only six participants completed two or more ECGs. Inter-rater reliability for the total risk of bias score could not be calculated using Cohen’s kappa, but overall agreement was high (online supplemental appendix B). See online supplemental appendix D for the overall risk of bias assessment score.

Supplemental material

[bmjqs-2022-014865supp004.pdf]

Main analysis

Data on diagnostic accuracy were available for 29 studies. This resulted in analysable data for 2732 participants. A random-effect meta-analysis showed that the use of cognitive reasoning tools led to a small improvement in diagnostic accuracy (0.28, 95% CI 0.14 to 0.43, p<0.001). There was evidence of considerable heterogeneity in this estimate (I²=70%, χ²(28)=93.82, p<0.001), although this was not unexpected given the broad inclusion of cognitive reasoning tools. Retrospective exploration of influential studies indicated that Martinez-Franco et al,49 Talebian et al 50 and Thompson et al 51 seemed to differ from the other studies: their participants had received training with the intervention directly before measuring diagnostic accuracy in the intervention group. Excluding these studies reduced heterogeneity (I²=38%, χ²(25)=40.22, p=0.028) sufficiently to interpret the meta-analysis. The effect estimate was slightly reduced (0.20, 95% CI 0.10 to 0.29, p<0.001), although the effect magnitude and direction remained unchanged (figure 2). A more elaborate exploration of the heterogeneity is presented in online supplemental appendix E.

Supplemental material

[bmjqs-2022-014865supp005.pdf]

Figure 2

Forest plot of the overall pooled estimate.

Publication bias

A funnel plot was drawn to check for small study effects due to publication bias and to further explore heterogeneity (online supplemental appendix F). The funnel plot did not show significant asymmetry based on Egger’s regression test (t(27)=1.84, p=0.077). This indicated there was no reason to suspect an influence of small study effects, nor did the funnel plot offer an explanation for the heterogeneity.

Supplemental material

[bmjqs-2022-014865supp006.pdf]

Subgroup analyses

Several subgroup analyses were performed to explore study heterogeneity and possible moderators of the effectiveness of clinical reasoning tools. The results for each subgroup are detailed in online supplemental appendix G. Only the type of diagnostic task seemed to moderate the effect of clinical reasoning tools: studies using real or standardised patients had a higher effect estimate than studies using visual tasks or written cases (Q(2)=22.10, p<0.001). However, only two studies had participants to diagnose real or virtual patients,28 52 reducing the reliability of the comparison. There was no difference in performance between visual or written diagnostic tasks (Q(1)=0.63, p=0.426). No significant differences were found for the other subgroup comparisons.

Supplemental material

[bmjqs-2022-014865supp007.pdf]

Descriptively, participants of an intermediate level (ie, residents and fellows) seemed to benefit more from using cognitive reasoning tools than novices (ie, medical students). Experts seemed to benefit somewhat more than novices, but less than intermediates. Furthermore, content interventions seemed more effective than process interventions. Finally, studies where errors were induced and then remedied with the tool were more successful than studies that simply evaluated their tool, although it should be noted that only four studies induced and then remedied errors.

GRADE assessment

Finally, overall evidence was qualified for the meta-analysis excluding studies with extensive training49–51 (table 4). The GRADE assessment indicated moderate quality of evidence, which shows that cognitive reasoning tools may benefit diagnostic performance as opposed to diagnosis without such a tool. The level of evidence was downgraded because of the moderate risk of bias on the selection of reported results, since prespecified analysis plans were available for only one study (Staal et al, Impact of diagnostic checklists on the interpretation of normal and abnormal electrocardiograms, 2021).

View this table:

Table 4

GRADE certainty of evidence assessment

Discussion

This systematic review and meta-analysis of 29 studies involving 2732 medical students and physicians showed that workplace-oriented cognitive reasoning tools modestly improved diagnostic accuracy (0.28, 95% CI 0.14 to 0.43, p<0.001). This estimate exhibited substantial heterogeneity (I²=70%), which was largely attributable to three studies that offered training with their tool before measuring performance.49–51 Removing these studies resulted in a lower, but more precise effect size (0.20, 95% CI 0.10 to 0.29, p<0.001) and reduced heterogeneity (I²=38%). Further subgroup analyses indicated that participant expertise, intervention characteristics (type of intervention, moment of intervention and intervention items) and design characteristics (study design, case difficulty, same cases used with and without intervention and study intention) could not explain the remaining between-study heterogeneity (table 1). Only type of diagnostic task influenced tool effectiveness: the diagnosis of real or simulated patients seemed more effective (0.41, 95% CI 0.33 to 0.49) than for written (0.16, 95% CI 0.05 to 0.28) or visual cases (0.16, 95% CI 0.05 to 0.28). However, because only two studies included patient encounters this result should be interpreted cautiously and verified in future research.

The modest improvement in diagnostic accuracy when using cognitive reasoning tools is largely in line with existing narrative and systematic reviews. Many of these reviews examined a broad range of interventions and outcomes, among which several interventions that were defined as cognitive reasoning tools in the current review. Recommended interventions primarily included reflection strategies,2 3 7 12 13 53 clinical decision support systems,12 19 20 54 cognitive forcing strategies7 12 and checklists.12 20 53 However, these recommendations were given with a cautionary note as evidence was often mixed and study designs were too divergent to draw strong conclusions.12 15 53 A more direct comparison can be made with Graber et al 3 and Lambe et al,7 who specifically examined cognitive interventions. They concluded the interventions seemed promising but also cautioned that empirical evidence was scarce and preliminary. Lastly, the current estimate is in line with the meta-analysis by Prakash et al,2 who reported a modest improvement of diagnostic decision-making when using reflection strategies (0.38, 95% CI 0.23 to 0.52, I²=31%). The discrepancy in effect size with our estimate might be explained by differences in the included studies. Prakash et al only quantified the effect of reflection strategies and did not consider other tools, whereas we included a range of tools. Additionally, Prakash et al included both education-oriented studies (ie, studies that tested interventions with the aim to teach someone how to solve cases in the future) and workplace-oriented studies (ie, studies that tested interventions with the aim to measure performance when the tool is used for diagnosis). We quantified the effect of workplace-oriented studies alone, so Prakash et al’s larger effect size could reflect differences in how effective cognitive reasoning tools are for teaching versus practical use. Taken together, cognitive reasoning tools are often recommended in the literature as promising interventions and this is corroborated by the improvement in accuracy we found. Caution should, however, be taken when interpreting this improvement due to the limited underlying evidence base.

The factors determining the effectiveness of cognitive reasoning tools remain unclear. Although several individual studies suggested that cognitive reasoning tools are more effective in specific subgroups,15 18 38 43 50 the current review found little indication of this. Of note might be the subset of three studies we excluded due to their contribution to the heterogeneity.49–51 These studies were methodologically different because participants trained with the diagnostic task and intervention before performance was measured, which seemed to result in better performance than the other included studies. When considering all subset analyses, it would be premature to take our findings as evidence that cognitive reasoning tools are equally effective under most circumstances. This is due to the many different factors that might theoretically impact tool effectiveness and the combinations of these factors across studies. For example, several studies showed that process-focused interventions (ie, aimed at preventing flaws in reasoning processes) were often less effective than content-focused interventions (ie, aimed at providing or triggering relevant knowledge).18 However, this distinction was difficult to make in the current review, as most interventions included both process and content elements to a certain extent. It was furthermore difficult to account for interactions between process or content interventions and other factors: for example, content interventions might be more beneficial for one subgroup, whereas process interventions might be more useful for another subgroup. There are many potential influences on tool effectiveness and not enough studies with the same combination of factors. The current evidence base is simply not extensive enough to reliably assess such interactions and as a result we were unable to isolate the effect of individual factors or determine under which circumstances the tools are most effective.

In summary, cognitive reasoning tools modestly improved diagnostic accuracy. This effect should, however, be considered within the context of clinical practice. Diagnostic errors occur in about 10% of diagnoses, meaning the majority of diagnoses is correct.1 The small improvement in overall diagnostic accuracy would, therefore, translate to a larger and clinically important improvement in the small subset of diagnostic errors, indicating that cognitive reasoning tools are a promising type of intervention. Whether this effect can be maximised to increase its potential use in practice will depend on our understanding of the factors that influence tool effectiveness.

Future research should focus on performing more large-scale studies, as the small sample sizes contribute to mixed conclusions in the literature. Additional studies should be performed that examine factors that might influence tool effectiveness to determine the effects in different subgroups. Indications for potentially interesting factors may be taken from descriptive differences in our subgroup comparisons (online supplemental appendix G), which suggest diagnostic task and intervention type (content-focused or process-focused intervention) as factors of interest. Furthermore, the excluded subset of studies49–51 seemed to indicate the effect of the interventions was larger when participants were first given time to practice. This effect could translate well to medical education and especially cognitive reasoning tools that offer structured guidance (such as deliberate reflection45 or checklists55) might provide benefits to learners. Finally, this effect could give an indication of what the effect of cognitive reasoning tools in practice could be: after all, clinicians will first be trained to use any tool before it will be used on real diagnoses. Future research should investigate the implementation of cognitive reasoning tools in practice to determine whether the improvement of accuracy can be replicated.

Limitations

Our review has three important limitations based on the studies included in the review and the review process. The first limitation is the high heterogeneity in the initial study sample which likely reflected the methodological and statistical differences between the interventions included based on our broad inclusion criteria. We explored this heterogeneity by examining the influence each individual study had on the estimate and excluded three studies that allowed participants to train with the tool before using it.49–51 This reduced heterogeneity sufficiently to allow interpretation of the meta-analysis. Because we expected some heterogeneity, we used a random-effects meta-analysis model which takes extra variability in underlying population distributions into account. As a result, our pooled estimate is an accurate estimate of the effectiveness of cognitive reasoning tools based on the available literature. Additionally, the broad inclusion criteria we applied are also a strength of the review: it allowed us to give a generalisable overview of the effectiveness of similar tools in different settings.

A second limitation is that only studies measuring diagnostic accuracy or diagnostic errors in percentages could be compared in this meta-analysis. Several studies measured diagnostic performance in other ways that were not comparable to the predominant measure of accuracy in the literature, such as the number of errors made,37–39 56 whether the correct diagnosis was included in the differential57 58 or whether a new diagnostic plan was made for a patient based on the leading diagnosis.59 There were too few studies with these measures to perform an additional meta-analysis. However, given that these studies mostly show small, positive improvements, we would expect a summary of these diagnostic performance measures to be in line with the current estimate.

The third limitation concerns the available literature: studies that tested their intervention in practice are lacking, which is a result of the trade-off between performing well-designed and methodologically strong experimental studies and evaluating a tool in a less controlled, but more relevant environment. The current estimate of workplace-oriented tools is generalisable to different diagnostic tasks and specialisms in artificial settings, but the effectiveness of cognitive reasoning tools in practice remains unclear. Although there have been calls to reconfirm current findings in practice for the last decade,3 7 12 54 for this review only two studies could be identified that were performed outside of an artificial setting.28 50 Additionally, the long-term effects of cognitive reasoning tools are also unknown, as the included studies use single session designs. Future research should replicate the findings of existing studies and measure tool effectiveness in practice.

Conclusion

In conclusion, cognitive reasoning tools led to a small but clinically important improvement in diagnostic accuracy. Going forward, more studies should aim to identify the factors that influence tool effectiveness and under which conditions these tools are the most beneficial. Cognitive reasoning tools could be routinely implemented in practice to improve diagnosis. However, a larger evidence base, consisting of more large-scale studies and evaluations of cognitive reasoning tools in practice, is needed to guide the implementation of cognitive reasoning tools in such a way that their effectiveness is optimised.

Data availability statement

Ethics statements

Patient consent for publication

Ethics approval

Not applicable.

References

↵
2. Balogh EP ,
3. Miller BT ,
4. Ball JR
. Improving diagnosis in health care 2015.
↵
2. Prakash S ,
3. Sladek RM ,
4. Schuwirth L
. Interventions to improve diagnostic decision making: a systematic review and meta-analysis on reflective strategies. Med Teach 2019;41:517–24.doi:10.1080/0142159X.2018.1497786 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30244625
OpenUrl PubMed
↵
2. Graber ML ,
3. Kissam S ,
4. Payne VL , et al
. Cognitive interventions to reduce diagnostic error: a narrative review. BMJ Qual Saf 2012;21:535–57.doi:10.1136/bmjqs-2011-000149 pmid:http://www.ncbi.nlm.nih.gov/pubmed/22543420
OpenUrl Abstract/FREE Full Text
↵
2. Zwaan L ,
3. de Bruijne M ,
4. Wagner C , et al
. Patient record review of the incidence, consequences, and causes of diagnostic adverse events. Arch Intern Med 2010;170:1015–21.doi:10.1001/archinternmed.2010.146 pmid:http://www.ncbi.nlm.nih.gov/pubmed/20585065
OpenUrl CrossRef PubMed
↵
Clinician Checklists [Internet]: Society to Improve Diagnosis in Medicine, 2020. Available: https://www.improvediagnosis.org/clinician-checklists/ [Accessed 1 Jul 2021].
↵
2. Gawande A
. The checklist manifesto: how to get things right. J Nurs Regul 2011;1:64.doi:10.1016/S2155-8256(15)30310-0
OpenUrl
↵
2. Lambe KA ,
3. O'Reilly G ,
4. Kelly BD , et al
. Dual-process cognitive interventions to enhance diagnostic Reasoning: a systematic review. BMJ Qual Saf 2016;25:808–20.doi:10.1136/bmjqs-2015-004417 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26873253
OpenUrl Abstract/FREE Full Text
↵
2. Gupta A ,
3. Graber ML
. Web Exclusive. Annals for Hospitalists Inpatient Notes - Just What the Doctor Ordered-Checklists to Improve Diagnosis. Ann Intern Med 2019;170:HO2.doi:10.7326/M19-0829 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30986851
OpenUrl CrossRef PubMed
↵
2. Graber ML ,
3. Franklin N ,
4. Gordon R
. Diagnostic error in internal medicine. Arch Intern Med 2005;165:1493–9.doi:10.1001/archinte.165.13.1493 pmid:http://www.ncbi.nlm.nih.gov/pubmed/16009864
OpenUrl CrossRef PubMed Web of Science
↵
2. Schiff GD ,
3. Hasan O ,
4. Kim S , et al
. Diagnostic error in medicine: analysis of 583 physician-reported errors. Arch Intern Med 2009;169:1881–7.doi:10.1001/archinternmed.2009.333 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19901140
OpenUrl CrossRef PubMed Web of Science
↵
2. Newman-Toker DE ,
3. Schaffer AC ,
4. Yu-Moe CW , et al
. Serious misdiagnosis-related harms in malpractice claims: The "Big Three" - vascular events, infections, and cancers. Diagnosis 2019;6:227–40.doi:10.1515/dx-2019-0019 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31535832
OpenUrl PubMed
↵
2. Hartigan S ,
3. Brooks M ,
4. Hartley S , et al
. Review of the basics of cognitive error in emergency medicine: still no easy answers. West J Emerg Med 2020;21:125.doi:10.5811/westjem.2020.7.47832 pmid:http://www.ncbi.nlm.nih.gov/pubmed/33207157
OpenUrl PubMed
↵
2. Mamede S ,
3. Schmidt HG
. Reflection in medical diagnosis: a literature review. Health Professions Education 2017;3:15–25.doi:10.1016/j.hpe.2017.01.003
OpenUrl
↵
2. Astik GJ ,
3. Olson APJ
. Learning from missed opportunities through reflective practice. Crit Care Clin 2022;38:103–12.doi:10.1016/j.ccc.2021.09.003 pmid:http://www.ncbi.nlm.nih.gov/pubmed/34794624
OpenUrl PubMed
↵
2. Mamede S ,
3. Hautz WE ,
4. Berendonk C , et al
. Think twice: effects on diagnostic accuracy of returning to the case to reflect upon the initial diagnosis. Acad Med 2020;95:1223–9.doi:10.1097/ACM.0000000000003153 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31972673
OpenUrl CrossRef PubMed
↵
2. Kwan JL ,
3. Lo L ,
4. Ferguson J , et al
. Computerised clinical decision support systems and absolute improvements in care: meta-analysis of controlled clinical trials. BMJ 2020;169:m3216.doi:10.1136/bmj.m3216
OpenUrl
↵
2. Winters BD ,
3. Aswani MS ,
4. Pronovost PJ
. Commentary: reducing diagnostic errors: another role for checklists? Acad Med 2011;86:279–81.doi:10.1097/ACM.0b013e3182082692 pmid:http://www.ncbi.nlm.nih.gov/pubmed/21346432
OpenUrl CrossRef PubMed Web of Science
↵
2. Zwaan L ,
3. Staal J
. Evidence on use of clinical Reasoning checklists for diagnostic error reduction. AHRQ Papers on Diagnostic Safety Topics 2020.
↵
2. McDonald KM ,
3. Matesic B ,
4. Contopoulos-Ioannidis DG , et al
. Patient safety strategies targeted at diagnostic errors: a systematic review. Ann Intern Med 2013;158:381–9.doi:10.7326/0003-4819-158-5-201303051-00004 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23460094
OpenUrl CrossRef PubMed Web of Science
↵
2. Dave N ,
3. Bui S ,
4. Morgan C , et al
. Interventions targeted at reducing diagnostic error: systematic review. BMJ Qual Saf 2022;31:297-307.doi:10.1136/bmjqs-2020-012704 pmid:http://www.ncbi.nlm.nih.gov/pubmed/34408064
OpenUrl PubMed
↵
2. Croskerry P ,
3. Singhal G ,
4. Mamede S
. Cognitive debiasing 2: impediments to and strategies for change. BMJ Qual Saf 2013;22 Suppl 2:ii65–72.doi:10.1136/bmjqs-2012-001713 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23996094
OpenUrl PubMed
↵
2. Liberati A ,
3. Altman DG ,
4. Tetzlaff J , et al
. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. J Clin Epidemiol 2009;62:e1–34.doi:10.1016/j.jclinepi.2009.06.006 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19631507
OpenUrl CrossRef PubMed
↵
2. Landis JR ,
3. Koch GG
. The measurement of observer agreement for categorical data. Biometrics 1977;33:159–74.doi:10.2307/2529310 pmid:http://www.ncbi.nlm.nih.gov/pubmed/843571
OpenUrl CrossRef PubMed Web of Science
↵
1. American Psychiatric Association AP, American Psychiatric A
. Diagnostic and statistical manual of mental disorders: DSM-5. Washington, DC: American psychiatric association, 2013.
↵
1. (EPOC) CEPaOoC
. Epoc resources for review authors. Oslo: Norwegian knowledge centre for the health services, 2013. Available: https://epoc.cochrane.org/sites/epoc.cochrane.org/files/public/uploads/Resources-for-authors2017/screening_data_extraction_and_management.pdf [Accessed 01 Jul 2022].
↵
2. Higgins JPT ,
3. Altman DG ,
4. Gøtzsche PC , et al
. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ 2011;343:d5928.doi:10.1136/bmj.d5928
OpenUrl FREE Full Text
↵
2. Atkins D ,
3. Eccles M ,
4. Flottorp S , et al
. Systems for grading the quality of evidence and the strength of recommendations I: critical appraisal of existing approaches the grade Working group. BMC Health Serv Res 2004;4:1–7.doi:10.1186/1472-6963-4-38
OpenUrl CrossRef PubMed Web of Science
↵
2. Ely JW ,
3. Graber MA
. Checklists to prevent diagnostic errors: a pilot randomized controlled trial. Diagnosis 2015;2:163–9.doi:10.1515/dx-2015-0008 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29540029
OpenUrl PubMed
↵
2. O’Sullivan ED ,
3. Schofield SJ
. A cognitive forcing tool to mitigate cognitive bias–a randomised control trial. BMC Med Educ 2019;19:1–8.doi:10.1186/s12909-018-1444-3
OpenUrl
↵
2. Röver C ,
3. Knapp G ,
4. Friede T
. Hartung-Knapp-Sidik-Jonkman approach and its modification for random-effects meta-analysis with few studies. BMC Med Res Methodol 2015;15:1–7.doi:10.1186/s12874-015-0091-1
OpenUrl CrossRef PubMed
↵
2. Higgins JPT ,
3. Thomas J ,
4. Chandler J
. Cochrane Handbook for systematic reviews of interventions. John Wiley & Sons, 2019.
↵
2. Egger M ,
3. Davey Smith G ,
4. Schneider M , et al
. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997;315:629–34.doi:10.1136/bmj.315.7109.629 pmid:http://www.ncbi.nlm.nih.gov/pubmed/9310563
OpenUrl Abstract/FREE Full Text
↵
2. Viechtbauer W ,
3. Viechtbauer MW
. Package metafor. The comprehensive R Archive network. Package ‘metafor’ 2017.
↵
1. Team R
. RStudio: integrated development for R. 2020. Boston, MA: RStudio, PBC, 2020.
↵
2. Walter F ,
3. Prevost T ,
4. Vasconcelos J
. The diagnostic accuracy of the 7-POINT checklist to assess pigmented skin lesions in primary care: 734. Asia Pac J Clin Oncol 2012;8.
↵
2. Letourneau KM ,
3. Horne D ,
4. Soni RN , et al
. Advancing prenatal detection of congenital heart disease: a novel screening protocol improves early diagnosis of complex congenital heart disease. J Ultrasound Med 2018;37:1073–9.doi:10.1002/jum.14453 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29027708
OpenUrl CrossRef PubMed
↵
2. Sibbald M ,
3. de Bruin ABH ,
4. van Merrienboer JJG
. Checklists improve experts' diagnostic decisions. Med Educ 2013;47:301–8.doi:10.1111/medu.12080 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23398016
OpenUrl CrossRef PubMed
↵
2. Sibbald M ,
3. De Bruin ABH ,
4. van Merrienboer JJG
. Finding and fixing mistakes: do checklists work for clinicians with different levels of experience? Adv Health Sci Educ Theory Pract 2014;19:43–51.doi:10.1007/s10459-013-9459-3 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23625338
OpenUrl PubMed
↵
2. Sibbald M ,
3. de Bruin ABH ,
4. Yu E , et al
. Why verifying diagnostic decisions with a checklist can help: insights from eye tracking. Adv Health Sci Educ Theory Pract 2015;20:1053–60.doi:10.1007/s10459-015-9585-1 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25672896
OpenUrl PubMed
↵
2. Segal MM ,
3. Athreya B ,
4. Son MBF , et al
. Evidence-Based decision support for pediatric rheumatology reduces diagnostic errors. Pediatr Rheumatol Online J 2016;14:67.doi:10.1186/s12969-016-0127-z pmid:http://www.ncbi.nlm.nih.gov/pubmed/27964737
OpenUrl PubMed
↵
2. Billingsley S
. Evaluating chest x-rays. Use mnemonics to develop a systematic approach. Adv Nurse Pract 2009;17:24–5.pmid:http://www.ncbi.nlm.nih.gov/pubmed/19999416
OpenUrl PubMed
↵
2. Dryver E ,
3. Johannsson G ,
4. Mokhtari A , et al
. [Checklists and "crowdsourcing" for increased patient safety in the emergency department]. Lakartidningen 2014;111:493–4.pmid:http://www.ncbi.nlm.nih.gov/pubmed/24720026
OpenUrl PubMed
↵
2. Shimizu T ,
3. Matsumoto K ,
4. Tokuda Y
. Effects of the use of differential diagnosis checklist and general de-biasing checklist on diagnostic performance in comparison to intuitive diagnosis. Med Teach 2013;35:e1218–29.doi:10.3109/0142159X.2012.742493 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23228085
OpenUrl CrossRef PubMed
↵
2. Kämmer JE ,
3. Schauber SK ,
4. Hautz SC , et al
. Differential diagnosis checklists reduce diagnostic error differentially: a randomised experiment. Med Educ 2021;55:1172–82.doi:10.1111/medu.14596 pmid:http://www.ncbi.nlm.nih.gov/pubmed/34291481
OpenUrl PubMed
↵
2. Mamede S ,
3. Schmidt HG ,
4. Penaforte JC
. Effects of reflective practice on the accuracy of medical diagnoses. Med Educ 2008;42:468–75.doi:10.1111/j.1365-2923.2008.03030.x pmid:http://www.ncbi.nlm.nih.gov/pubmed/18412886
OpenUrl CrossRef PubMed Web of Science
↵
2. Ilgen JS ,
3. Bowen JL ,
4. McIntyre LA , et al
. Comparing diagnostic performance and the utility of clinical vignette-based assessment under testing conditions designed to encourage either automatic or analytic thought. Acad Med 2013;88:1545–51.doi:10.1097/ACM.0b013e3182a31c1e pmid:http://www.ncbi.nlm.nih.gov/pubmed/23969355
OpenUrl CrossRef PubMed
↵
2. Ilgen JS ,
3. Bowen JL ,
4. Yarris LM , et al
. Adjusting our lens: can developmental differences in diagnostic reasoning be harnessed to improve health professional and trainee assessment? Acad Emerg Med 2011;18 Suppl 2:S79–86.doi:10.1111/j.1553-2712.2011.01182.x pmid:http://www.ncbi.nlm.nih.gov/pubmed/21999563
OpenUrl PubMed
↵
2. Cairns AW ,
3. Bond RR ,
4. Finlay DD , et al
. A computer-human interaction model to improve the diagnostic accuracy and clinical decision-making during 12-lead electrocardiogram interpretation. J Biomed Inform 2016;64:93–107.doi:10.1016/j.jbi.2016.09.016 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27687552
OpenUrl PubMed
↵
2. Martinez-Franco AI ,
3. Sanchez-Mendiola M ,
4. Mazon-Ramirez JJ , et al
. Diagnostic accuracy in family medicine residents using a clinical decision support system (DXplain): a randomized-controlled trial. Diagnosis 2018;5:71–6.doi:10.1515/dx-2017-0045 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29730649
OpenUrl CrossRef PubMed
↵
2. Talebian MT ,
3. Zamani MM ,
4. Toliat A , et al
. Evaluation of emergency medicine residents competencies in electrocardiogram interpretation. Acta Med Iran 2014;52:848–54.pmid:http://www.ncbi.nlm.nih.gov/pubmed/25415819
OpenUrl PubMed
↵
2. Thompson M ,
3. Johansen D ,
4. Stoner R , et al
. Comparative effectiveness of a mnemonic-use approach vs. self-study to interpret a lateral chest X-ray. Adv Physiol Educ 2017;41:518–21.doi:10.1152/advan.00034.2017 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28978520
OpenUrl CrossRef PubMed
↵
2. Myung SJ ,
3. Kang SH ,
4. Phyo SR , et al
. Effect of enhanced analytic Reasoning on diagnostic accuracy: a randomized controlled study. Med Teach 2013;35:248–50.doi:10.3109/0142159X.2013.759643 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23327617
OpenUrl CrossRef PubMed
↵
2. Griffith PB ,
3. Doherty C ,
4. Smeltzer SC , et al
. Education initiatives in cognitive debiasing to improve diagnostic accuracy in student providers: a scoping review. J Am Assoc Nurse Pract 2020;33:862-871.doi:10.1097/JXX.0000000000000479 pmid:http://www.ncbi.nlm.nih.gov/pubmed/32773538
OpenUrl PubMed
↵
2. Abimanyi-Ochom J ,
3. Bohingamu Mudiyanselage S ,
4. Catchpool M , et al
. Strategies to reduce diagnostic errors: a systematic review. BMC Med Inform Decis Mak 2019;19:1–14.doi:10.1186/s12911-019-0901-1
OpenUrl CrossRef
↵
2. Sibbald M ,
3. de Bruin ABH ,
4. Cavalcanti RB , et al
. Do you have to re-examine to reconsider your diagnosis? checklists and cardiac exam. BMJ Qual Saf 2013;22:333–8.doi:10.1136/bmjqs-2012-001537 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23386730
OpenUrl Abstract/FREE Full Text
↵
2. Sibbald M ,
3. Sherbino J ,
4. Ilgen JS , et al
. Debiasing versus knowledge retrieval checklists to reduce diagnostic error in ECG interpretation. Adv Health Sci Educ Theory Pract 2019;24:427–40.doi:10.1007/s10459-019-09875-8 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30694452
OpenUrl PubMed
↵
2. Chew KS ,
3. Durning SJ ,
4. van Merriënboer JJ
. Teaching metacognition in clinical decision-making using a novel mnemonic checklist: an exploratory study. Singapore Med J 2016;57:694–700.doi:10.11622/smedj.2016015 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26778635
OpenUrl PubMed
↵
2. Chew KS ,
3. van Merrienboer JJG ,
4. Durning SJ
. Investing in the use of a checklist during differential diagnoses consideration: what's the trade-off? BMC Med Educ 2017;17:234.doi:10.1186/s12909-017-1078-x pmid:http://www.ncbi.nlm.nih.gov/pubmed/29187172
OpenUrl PubMed
↵
2. Walayat S ,
3. Chaucer B ,
4. Kim M
. Diagnostic Reboot: a proposal to improve diagnostic Reasoning: ncbi.nlm.nih.gov 2021.
2. Li P ,
3. Cheng ZY ,
4. Liu GL ,
5. yan Cheng Z ,
6. lin Liu G
. Availability bias causes Misdiagnoses by physicians: direct evidence from a randomized controlled trial. Intern Med 2020;59:3141–6.doi:10.2169/internalmedicine.4664-20 pmid:http://www.ncbi.nlm.nih.gov/pubmed/32788532
OpenUrl PubMed
2. Kok EM ,
3. Abed A ,
4. Robben SGF
. Does the use of a checklist help medical students in the detection of abnormalities on a chest radiograph? J Digit Imaging 2017;30:726–31.doi:10.1007/s10278-017-9979-0 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28560508
OpenUrl PubMed
2. Schmidt HG ,
3. Mamede S ,
4. van den Berge K , et al
. Exposure to media information about a disease can cause doctors to misdiagnose similar-looking clinical cases. Acad Med 2014;89:285–91.doi:10.1097/ACM.0000000000000107 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24362387
OpenUrl CrossRef PubMed
2. Schmidt HG ,
3. Van Gog T ,
4. Schuit SC , et al
. Do patients' disruptive behaviours influence the accuracy of a doctor's diagnosis? a randomised experiment. BMJ Qual Saf 2017;26:19–23.doi:10.1136/bmjqs-2015-004109 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26951795
OpenUrl Abstract/FREE Full Text
2. Mamede S ,
3. Schmidt HG ,
4. Rikers RMJP , et al
. Conscious thought beats deliberation without attention in diagnostic decision-making: at least when you are an expert. Psychol Res 2010;74:586–92.doi:10.1007/s00426-010-0281-8 pmid:http://www.ncbi.nlm.nih.gov/pubmed/20354726
OpenUrl CrossRef PubMed
2. Mamede S ,
3. van Gog T ,
4. van den Berge K , et al
. Effect of availability bias and reflective Reasoning on diagnostic accuracy among internal medicine residents. JAMA 2010;304:1198–203.doi:10.1001/jama.2010.1276 pmid:http://www.ncbi.nlm.nih.gov/pubmed/20841533
OpenUrl CrossRef PubMed Web of Science
2. Berbaum K ,
3. Franken EA ,
4. Caldwell RT , et al
. Can a checklist reduce SOS errors in chest radiography? Acad Radiol 2006;13:296–304.doi:10.1016/j.acra.2005.11.032 pmid:http://www.ncbi.nlm.nih.gov/pubmed/16488841
OpenUrl CrossRef PubMed
2. Chartan C ,
3. Singh H ,
4. Krishnamurthy P , et al
. Isolating red flags to enhance diagnosis (I-RED): an experimental vignette study. Int J Qual Health Care 2019;31:G97–102.doi:10.1093/intqhc/mzz082 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31665303
OpenUrl PubMed
2. DiNardo D ,
3. Tilstra S ,
4. McNeil M , et al
. Identification of facilitators and barriers to residents' use of a clinical Reasoning tool. Diagnosis 2018;5:21–8.doi:10.1515/dx-2017-0037 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29601296
OpenUrl PubMed
2. Graber ML ,
3. Tompkins D ,
4. Holland JJ
. Resources medical students use to derive a differential diagnosis. Med Teach 2009;31:522–7.doi:10.1080/01421590802167436 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19811168
OpenUrl CrossRef PubMed
2. Kilian M ,
3. Sherbino J ,
4. Hicks C , et al
. Understanding diagnosis through action: evaluation of a point-of-care checklist for junior emergency medical residents. Diagnosis 2019;6:151–6.doi:10.1515/dx-2018-0073 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30990784
OpenUrl PubMed
2. Lambe KA ,
3. Hevey D ,
4. Kelly BD
. Guided reflection interventions show no effect on diagnostic accuracy in medical students. Front Psychol 2018;9:2297.doi:10.3389/fpsyg.2018.02297 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30532723
OpenUrl PubMed
2. Nickerson J ,
3. Taub ES ,
4. Shah K
. A checklist manifesto: can a checklist of common diagnoses improve accuracy in ECG interpretation? Am J Emerg Med 2020;38:18–22.doi:10.1016/j.ajem.2019.03.048 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30952602
OpenUrl PubMed
2. Costa Filho GB ,
3. Moura AS ,
4. Brandão PR , et al
. Effects of deliberate reflection on diagnostic accuracy, confidence and diagnostic calibration in dermatology. Perspect Med Educ 2019;8:230–6.doi:10.1007/s40037-019-0522-5 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31290117
OpenUrl PubMed

Supplementary materials

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Data supplement 1
Data supplement 2
Data supplement 3
Data supplement 4
Data supplement 5
Data supplement 6
Data supplement 7

Footnotes

Twitter @laurazwaan81
Contributors All authors had full access to all the study data and take responsibility for the integrity of the data and the accuracy of the analysis. All authors read and approved the final manuscript. Guarantor: JS and LZ. Study conception and design: JS, JH, SM and LZ. Development of study materials: JS, JH, STGG and LZ. Acquisition of data: JS, JH, STGG and LZ. Analysis or interpretation of the data: JS, JH, SM and LZ. Drafting of the manuscript: JS and LZ. Critical revision of the manuscript for important intellectual content: JS, JH, STGG, SM, MAF, WWVdB, JA and LZ. Statistical analysis: JS and LZ. Administrative, technical or material support: JS, STGG and LZ. Supervision: JS and LZ.
Funding The authors are supported by a VENI grant from the Dutch National Scientific Organization (NOW; 45116032) and an Erasmus Medical Center Fellowship.
Disclaimer The funding body was not involved in the design of the study and the collection, analysis and interpretation of data and in writing the manuscript.
Competing interests None declared.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

[1] ↵

Balogh EP ,
Miller BT ,
Ball JR
. Improving diagnosis in health care 2015.

[3] Balogh EP ,

[4] Miller BT ,

[5] Ball JR

[6] ↵

Prakash S ,
Sladek RM ,
Schuwirth L
. Interventions to improve diagnostic decision making: a systematic review and meta-analysis on reflective strategies. Med Teach 2019;41:517–24.doi:10.1080/0142159X.2018.1497786 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30244625
OpenUrl PubMed

[8] Prakash S ,

[9] Sladek RM ,

[10] Schuwirth L

[11] ↵

Graber ML ,
Kissam S ,
Payne VL , et al
. Cognitive interventions to reduce diagnostic error: a narrative review. BMJ Qual Saf 2012;21:535–57.doi:10.1136/bmjqs-2011-000149 pmid:http://www.ncbi.nlm.nih.gov/pubmed/22543420
OpenUrl Abstract/FREE Full Text

[13] Graber ML ,

[14] Kissam S ,

[15] Payne VL , et al

[16] ↵

Zwaan L ,
de Bruijne M ,
Wagner C , et al
. Patient record review of the incidence, consequences, and causes of diagnostic adverse events. Arch Intern Med 2010;170:1015–21.doi:10.1001/archinternmed.2010.146 pmid:http://www.ncbi.nlm.nih.gov/pubmed/20585065
OpenUrl CrossRef PubMed

[18] Zwaan L ,

[19] de Bruijne M ,

[20] Wagner C , et al

[21] ↵
Clinician Checklists [Internet]: Society to Improve Diagnosis in Medicine, 2020. Available: https://www.improvediagnosis.org/clinician-checklists/ [Accessed 1 Jul 2021].

[22] ↵

Gawande A
. The checklist manifesto: how to get things right. J Nurs Regul 2011;1:64.doi:10.1016/S2155-8256(15)30310-0
OpenUrl

[24] Gawande A

[25] ↵

Lambe KA ,
O'Reilly G ,
Kelly BD , et al
. Dual-process cognitive interventions to enhance diagnostic Reasoning: a systematic review. BMJ Qual Saf 2016;25:808–20.doi:10.1136/bmjqs-2015-004417 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26873253
OpenUrl Abstract/FREE Full Text

[27] Lambe KA ,

[28] O'Reilly G ,

[29] Kelly BD , et al

[30] ↵

Gupta A ,
Graber ML
. Web Exclusive. Annals for Hospitalists Inpatient Notes - Just What the Doctor Ordered-Checklists to Improve Diagnosis. Ann Intern Med 2019;170:HO2.doi:10.7326/M19-0829 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30986851
OpenUrl CrossRef PubMed

[32] Gupta A ,

[33] Graber ML

[34] ↵

Graber ML ,
Franklin N ,
Gordon R
. Diagnostic error in internal medicine. Arch Intern Med 2005;165:1493–9.doi:10.1001/archinte.165.13.1493 pmid:http://www.ncbi.nlm.nih.gov/pubmed/16009864
OpenUrl CrossRef PubMed Web of Science

[36] Graber ML ,

[37] Franklin N ,

[38] Gordon R

[39] ↵

Schiff GD ,
Hasan O ,
Kim S , et al
. Diagnostic error in medicine: analysis of 583 physician-reported errors. Arch Intern Med 2009;169:1881–7.doi:10.1001/archinternmed.2009.333 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19901140
OpenUrl CrossRef PubMed Web of Science

[41] Schiff GD ,

[42] Hasan O ,

[43] Kim S , et al

[44] ↵

Newman-Toker DE ,
Schaffer AC ,
Yu-Moe CW , et al
. Serious misdiagnosis-related harms in malpractice claims: The "Big Three" - vascular events, infections, and cancers. Diagnosis 2019;6:227–40.doi:10.1515/dx-2019-0019 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31535832
OpenUrl PubMed

[46] Newman-Toker DE ,

[47] Schaffer AC ,

[48] Yu-Moe CW , et al

[49] ↵

Hartigan S ,
Brooks M ,
Hartley S , et al
. Review of the basics of cognitive error in emergency medicine: still no easy answers. West J Emerg Med 2020;21:125.doi:10.5811/westjem.2020.7.47832 pmid:http://www.ncbi.nlm.nih.gov/pubmed/33207157
OpenUrl PubMed

[51] Hartigan S ,

[52] Brooks M ,

[53] Hartley S , et al

[54] ↵

Mamede S ,
Schmidt HG
. Reflection in medical diagnosis: a literature review. Health Professions Education 2017;3:15–25.doi:10.1016/j.hpe.2017.01.003
OpenUrl

[56] Mamede S ,

[57] Schmidt HG

[58] ↵

Astik GJ ,
Olson APJ
. Learning from missed opportunities through reflective practice. Crit Care Clin 2022;38:103–12.doi:10.1016/j.ccc.2021.09.003 pmid:http://www.ncbi.nlm.nih.gov/pubmed/34794624
OpenUrl PubMed

[60] Astik GJ ,

[61] Olson APJ

[62] ↵

Mamede S ,
Hautz WE ,
Berendonk C , et al
. Think twice: effects on diagnostic accuracy of returning to the case to reflect upon the initial diagnosis. Acad Med 2020;95:1223–9.doi:10.1097/ACM.0000000000003153 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31972673
OpenUrl CrossRef PubMed

[64] Mamede S ,

[65] Hautz WE ,

[66] Berendonk C , et al

[67] ↵

Kwan JL ,
Lo L ,
Ferguson J , et al
. Computerised clinical decision support systems and absolute improvements in care: meta-analysis of controlled clinical trials. BMJ 2020;169:m3216.doi:10.1136/bmj.m3216
OpenUrl

[69] Kwan JL ,

[70] Lo L ,

[71] Ferguson J , et al

[72] ↵

Winters BD ,
Aswani MS ,
Pronovost PJ
. Commentary: reducing diagnostic errors: another role for checklists? Acad Med 2011;86:279–81.doi:10.1097/ACM.0b013e3182082692 pmid:http://www.ncbi.nlm.nih.gov/pubmed/21346432
OpenUrl CrossRef PubMed Web of Science

[74] Winters BD ,

[75] Aswani MS ,

[76] Pronovost PJ

[77] ↵

Zwaan L ,
Staal J
. Evidence on use of clinical Reasoning checklists for diagnostic error reduction. AHRQ Papers on Diagnostic Safety Topics 2020.

[79] Zwaan L ,

[80] Staal J

[81] ↵

McDonald KM ,
Matesic B ,
Contopoulos-Ioannidis DG , et al
. Patient safety strategies targeted at diagnostic errors: a systematic review. Ann Intern Med 2013;158:381–9.doi:10.7326/0003-4819-158-5-201303051-00004 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23460094
OpenUrl CrossRef PubMed Web of Science

[83] McDonald KM ,

[84] Matesic B ,

[85] Contopoulos-Ioannidis DG , et al

[86] ↵

Dave N ,
Bui S ,
Morgan C , et al
. Interventions targeted at reducing diagnostic error: systematic review. BMJ Qual Saf 2022;31:297-307.doi:10.1136/bmjqs-2020-012704 pmid:http://www.ncbi.nlm.nih.gov/pubmed/34408064
OpenUrl PubMed

[88] Dave N ,

[89] Bui S ,

[90] Morgan C , et al

[91] ↵

Croskerry P ,
Singhal G ,
Mamede S
. Cognitive debiasing 2: impediments to and strategies for change. BMJ Qual Saf 2013;22 Suppl 2:ii65–72.doi:10.1136/bmjqs-2012-001713 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23996094
OpenUrl PubMed

[93] Croskerry P ,

[94] Singhal G ,

[95] Mamede S

[96] ↵

Liberati A ,
Altman DG ,
Tetzlaff J , et al
. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. J Clin Epidemiol 2009;62:e1–34.doi:10.1016/j.jclinepi.2009.06.006 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19631507
OpenUrl CrossRef PubMed

[98] Liberati A ,

[99] Altman DG ,

[100] Tetzlaff J , et al

[101] ↵

Landis JR ,
Koch GG
. The measurement of observer agreement for categorical data. Biometrics 1977;33:159–74.doi:10.2307/2529310 pmid:http://www.ncbi.nlm.nih.gov/pubmed/843571
OpenUrl CrossRef PubMed Web of Science

[103] Landis JR ,

[104] Koch GG

[105] ↵
American Psychiatric Association AP, American Psychiatric A
. Diagnostic and statistical manual of mental disorders: DSM-5. Washington, DC: American psychiatric association, 2013.

[106] American Psychiatric Association AP, American Psychiatric A

[107] ↵
(EPOC) CEPaOoC
. Epoc resources for review authors. Oslo: Norwegian knowledge centre for the health services, 2013. Available: https://epoc.cochrane.org/sites/epoc.cochrane.org/files/public/uploads/Resources-for-authors2017/screening_data_extraction_and_management.pdf [Accessed 01 Jul 2022].

[108] (EPOC) CEPaOoC

[109] ↵

Higgins JPT ,
Altman DG ,
Gøtzsche PC , et al
. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ 2011;343:d5928.doi:10.1136/bmj.d5928
OpenUrl FREE Full Text

[111] Higgins JPT ,

[112] Altman DG ,

[113] Gøtzsche PC , et al

[114] ↵

Atkins D ,
Eccles M ,
Flottorp S , et al
. Systems for grading the quality of evidence and the strength of recommendations I: critical appraisal of existing approaches the grade Working group. BMC Health Serv Res 2004;4:1–7.doi:10.1186/1472-6963-4-38
OpenUrl CrossRef PubMed Web of Science

[116] Atkins D ,

[117] Eccles M ,

[118] Flottorp S , et al

[119] ↵

Ely JW ,
Graber MA
. Checklists to prevent diagnostic errors: a pilot randomized controlled trial. Diagnosis 2015;2:163–9.doi:10.1515/dx-2015-0008 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29540029
OpenUrl PubMed

[121] Ely JW ,

[122] Graber MA

[123] ↵

O’Sullivan ED ,
Schofield SJ
. A cognitive forcing tool to mitigate cognitive bias–a randomised control trial. BMC Med Educ 2019;19:1–8.doi:10.1186/s12909-018-1444-3
OpenUrl

[125] O’Sullivan ED ,

[126] Schofield SJ

[127] ↵

Röver C ,
Knapp G ,
Friede T
. Hartung-Knapp-Sidik-Jonkman approach and its modification for random-effects meta-analysis with few studies. BMC Med Res Methodol 2015;15:1–7.doi:10.1186/s12874-015-0091-1
OpenUrl CrossRef PubMed

[129] Röver C ,

[130] Knapp G ,

[131] Friede T

[132] ↵

Higgins JPT ,
Thomas J ,
Chandler J
. Cochrane Handbook for systematic reviews of interventions. John Wiley & Sons, 2019.

[134] Higgins JPT ,

[135] Thomas J ,

[136] Chandler J

[137] ↵

Egger M ,
Davey Smith G ,
Schneider M , et al
. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997;315:629–34.doi:10.1136/bmj.315.7109.629 pmid:http://www.ncbi.nlm.nih.gov/pubmed/9310563
OpenUrl Abstract/FREE Full Text

[139] Egger M ,

[140] Davey Smith G ,

[141] Schneider M , et al

[142] ↵

Viechtbauer W ,
Viechtbauer MW
. Package metafor. The comprehensive R Archive network. Package ‘metafor’ 2017.

[144] Viechtbauer W ,

[145] Viechtbauer MW

[146] ↵
Team R
. RStudio: integrated development for R. 2020. Boston, MA: RStudio, PBC, 2020.

[147] Team R

[148] ↵

Walter F ,
Prevost T ,
Vasconcelos J
. The diagnostic accuracy of the 7-POINT checklist to assess pigmented skin lesions in primary care: 734. Asia Pac J Clin Oncol 2012;8.

[150] Walter F ,

[151] Prevost T ,

[152] Vasconcelos J

[153] ↵

Letourneau KM ,
Horne D ,
Soni RN , et al
. Advancing prenatal detection of congenital heart disease: a novel screening protocol improves early diagnosis of complex congenital heart disease. J Ultrasound Med 2018;37:1073–9.doi:10.1002/jum.14453 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29027708
OpenUrl CrossRef PubMed

[155] Letourneau KM ,

[156] Horne D ,

[157] Soni RN , et al

[158] ↵

Sibbald M ,
de Bruin ABH ,
van Merrienboer JJG
. Checklists improve experts' diagnostic decisions. Med Educ 2013;47:301–8.doi:10.1111/medu.12080 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23398016
OpenUrl CrossRef PubMed

[160] Sibbald M ,

[161] de Bruin ABH ,

[162] van Merrienboer JJG

[163] ↵

Sibbald M ,
De Bruin ABH ,
van Merrienboer JJG
. Finding and fixing mistakes: do checklists work for clinicians with different levels of experience? Adv Health Sci Educ Theory Pract 2014;19:43–51.doi:10.1007/s10459-013-9459-3 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23625338
OpenUrl PubMed

[165] Sibbald M ,

[166] De Bruin ABH ,

[167] van Merrienboer JJG

[168] ↵

Sibbald M ,
de Bruin ABH ,
Yu E , et al
. Why verifying diagnostic decisions with a checklist can help: insights from eye tracking. Adv Health Sci Educ Theory Pract 2015;20:1053–60.doi:10.1007/s10459-015-9585-1 pmid:http://www.ncbi.nlm.nih.gov/pubmed/25672896
OpenUrl PubMed

[170] Sibbald M ,

[171] de Bruin ABH ,

[172] Yu E , et al

[173] ↵

Segal MM ,
Athreya B ,
Son MBF , et al
. Evidence-Based decision support for pediatric rheumatology reduces diagnostic errors. Pediatr Rheumatol Online J 2016;14:67.doi:10.1186/s12969-016-0127-z pmid:http://www.ncbi.nlm.nih.gov/pubmed/27964737
OpenUrl PubMed

[175] Segal MM ,

[176] Athreya B ,

[177] Son MBF , et al

[178] ↵

Billingsley S
. Evaluating chest x-rays. Use mnemonics to develop a systematic approach. Adv Nurse Pract 2009;17:24–5.pmid:http://www.ncbi.nlm.nih.gov/pubmed/19999416
OpenUrl PubMed

[180] Billingsley S

[181] ↵

Dryver E ,
Johannsson G ,
Mokhtari A , et al
. [Checklists and "crowdsourcing" for increased patient safety in the emergency department]. Lakartidningen 2014;111:493–4.pmid:http://www.ncbi.nlm.nih.gov/pubmed/24720026
OpenUrl PubMed

[183] Dryver E ,

[184] Johannsson G ,

[185] Mokhtari A , et al

[186] ↵

Shimizu T ,
Matsumoto K ,
Tokuda Y
. Effects of the use of differential diagnosis checklist and general de-biasing checklist on diagnostic performance in comparison to intuitive diagnosis. Med Teach 2013;35:e1218–29.doi:10.3109/0142159X.2012.742493 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23228085
OpenUrl CrossRef PubMed

[188] Shimizu T ,

[189] Matsumoto K ,

[190] Tokuda Y

[191] ↵

Kämmer JE ,
Schauber SK ,
Hautz SC , et al
. Differential diagnosis checklists reduce diagnostic error differentially: a randomised experiment. Med Educ 2021;55:1172–82.doi:10.1111/medu.14596 pmid:http://www.ncbi.nlm.nih.gov/pubmed/34291481
OpenUrl PubMed

[193] Kämmer JE ,

[194] Schauber SK ,

[195] Hautz SC , et al

[196] ↵

Mamede S ,
Schmidt HG ,
Penaforte JC
. Effects of reflective practice on the accuracy of medical diagnoses. Med Educ 2008;42:468–75.doi:10.1111/j.1365-2923.2008.03030.x pmid:http://www.ncbi.nlm.nih.gov/pubmed/18412886
OpenUrl CrossRef PubMed Web of Science

[198] Mamede S ,

[199] Schmidt HG ,

[200] Penaforte JC

[201] ↵

Ilgen JS ,
Bowen JL ,
McIntyre LA , et al
. Comparing diagnostic performance and the utility of clinical vignette-based assessment under testing conditions designed to encourage either automatic or analytic thought. Acad Med 2013;88:1545–51.doi:10.1097/ACM.0b013e3182a31c1e pmid:http://www.ncbi.nlm.nih.gov/pubmed/23969355
OpenUrl CrossRef PubMed

[203] Ilgen JS ,

[204] Bowen JL ,

[205] McIntyre LA , et al

[206] ↵

Ilgen JS ,
Bowen JL ,
Yarris LM , et al
. Adjusting our lens: can developmental differences in diagnostic reasoning be harnessed to improve health professional and trainee assessment? Acad Emerg Med 2011;18 Suppl 2:S79–86.doi:10.1111/j.1553-2712.2011.01182.x pmid:http://www.ncbi.nlm.nih.gov/pubmed/21999563
OpenUrl PubMed

[208] Ilgen JS ,

[209] Bowen JL ,

[210] Yarris LM , et al

[211] ↵

Cairns AW ,
Bond RR ,
Finlay DD , et al
. A computer-human interaction model to improve the diagnostic accuracy and clinical decision-making during 12-lead electrocardiogram interpretation. J Biomed Inform 2016;64:93–107.doi:10.1016/j.jbi.2016.09.016 pmid:http://www.ncbi.nlm.nih.gov/pubmed/27687552
OpenUrl PubMed

[213] Cairns AW ,

[214] Bond RR ,

[215] Finlay DD , et al

[216] ↵

Martinez-Franco AI ,
Sanchez-Mendiola M ,
Mazon-Ramirez JJ , et al
. Diagnostic accuracy in family medicine residents using a clinical decision support system (DXplain): a randomized-controlled trial. Diagnosis 2018;5:71–6.doi:10.1515/dx-2017-0045 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29730649
OpenUrl CrossRef PubMed

[218] Martinez-Franco AI ,

[219] Sanchez-Mendiola M ,

[220] Mazon-Ramirez JJ , et al

[221] ↵

Talebian MT ,
Zamani MM ,
Toliat A , et al
. Evaluation of emergency medicine residents competencies in electrocardiogram interpretation. Acta Med Iran 2014;52:848–54.pmid:http://www.ncbi.nlm.nih.gov/pubmed/25415819
OpenUrl PubMed

[223] Talebian MT ,

[224] Zamani MM ,

[225] Toliat A , et al

[226] ↵

Thompson M ,
Johansen D ,
Stoner R , et al
. Comparative effectiveness of a mnemonic-use approach vs. self-study to interpret a lateral chest X-ray. Adv Physiol Educ 2017;41:518–21.doi:10.1152/advan.00034.2017 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28978520
OpenUrl CrossRef PubMed

[228] Thompson M ,

[229] Johansen D ,

[230] Stoner R , et al

[231] ↵

Myung SJ ,
Kang SH ,
Phyo SR , et al
. Effect of enhanced analytic Reasoning on diagnostic accuracy: a randomized controlled study. Med Teach 2013;35:248–50.doi:10.3109/0142159X.2013.759643 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23327617
OpenUrl CrossRef PubMed

[233] Myung SJ ,

[234] Kang SH ,

[235] Phyo SR , et al

[236] ↵

Griffith PB ,
Doherty C ,
Smeltzer SC , et al
. Education initiatives in cognitive debiasing to improve diagnostic accuracy in student providers: a scoping review. J Am Assoc Nurse Pract 2020;33:862-871.doi:10.1097/JXX.0000000000000479 pmid:http://www.ncbi.nlm.nih.gov/pubmed/32773538
OpenUrl PubMed

[238] Griffith PB ,

[239] Doherty C ,

[240] Smeltzer SC , et al

[241] ↵

Abimanyi-Ochom J ,
Bohingamu Mudiyanselage S ,
Catchpool M , et al
. Strategies to reduce diagnostic errors: a systematic review. BMC Med Inform Decis Mak 2019;19:1–14.doi:10.1186/s12911-019-0901-1
OpenUrl CrossRef

[243] Abimanyi-Ochom J ,

[244] Bohingamu Mudiyanselage S ,

[245] Catchpool M , et al

[246] ↵

Sibbald M ,
de Bruin ABH ,
Cavalcanti RB , et al
. Do you have to re-examine to reconsider your diagnosis? checklists and cardiac exam. BMJ Qual Saf 2013;22:333–8.doi:10.1136/bmjqs-2012-001537 pmid:http://www.ncbi.nlm.nih.gov/pubmed/23386730
OpenUrl Abstract/FREE Full Text

[248] Sibbald M ,

[249] de Bruin ABH ,

[250] Cavalcanti RB , et al

[251] ↵

Sibbald M ,
Sherbino J ,
Ilgen JS , et al
. Debiasing versus knowledge retrieval checklists to reduce diagnostic error in ECG interpretation. Adv Health Sci Educ Theory Pract 2019;24:427–40.doi:10.1007/s10459-019-09875-8 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30694452
OpenUrl PubMed

[253] Sibbald M ,

[254] Sherbino J ,

[255] Ilgen JS , et al

[256] ↵

Chew KS ,
Durning SJ ,
van Merriënboer JJ
. Teaching metacognition in clinical decision-making using a novel mnemonic checklist: an exploratory study. Singapore Med J 2016;57:694–700.doi:10.11622/smedj.2016015 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26778635
OpenUrl PubMed

[258] Chew KS ,

[259] Durning SJ ,

[260] van Merriënboer JJ

[261] ↵

Chew KS ,
van Merrienboer JJG ,
Durning SJ
. Investing in the use of a checklist during differential diagnoses consideration: what's the trade-off? BMC Med Educ 2017;17:234.doi:10.1186/s12909-017-1078-x pmid:http://www.ncbi.nlm.nih.gov/pubmed/29187172
OpenUrl PubMed

[263] Chew KS ,

[264] van Merrienboer JJG ,

[265] Durning SJ

[266] ↵

Walayat S ,
Chaucer B ,
Kim M
. Diagnostic Reboot: a proposal to improve diagnostic Reasoning: ncbi.nlm.nih.gov 2021.

[268] Walayat S ,

[269] Chaucer B ,

[270] Kim M

[271] Li P ,
Cheng ZY ,
Liu GL ,
yan Cheng Z ,
lin Liu G
. Availability bias causes Misdiagnoses by physicians: direct evidence from a randomized controlled trial. Intern Med 2020;59:3141–6.doi:10.2169/internalmedicine.4664-20 pmid:http://www.ncbi.nlm.nih.gov/pubmed/32788532
OpenUrl PubMed

[273] Li P ,

[274] Cheng ZY ,

[275] Liu GL ,

[276] yan Cheng Z ,

[277] lin Liu G

[278] Kok EM ,
Abed A ,
Robben SGF
. Does the use of a checklist help medical students in the detection of abnormalities on a chest radiograph? J Digit Imaging 2017;30:726–31.doi:10.1007/s10278-017-9979-0 pmid:http://www.ncbi.nlm.nih.gov/pubmed/28560508
OpenUrl PubMed

[280] Kok EM ,

[281] Abed A ,

[282] Robben SGF

[283] Schmidt HG ,
Mamede S ,
van den Berge K , et al
. Exposure to media information about a disease can cause doctors to misdiagnose similar-looking clinical cases. Acad Med 2014;89:285–91.doi:10.1097/ACM.0000000000000107 pmid:http://www.ncbi.nlm.nih.gov/pubmed/24362387
OpenUrl CrossRef PubMed

[285] Schmidt HG ,

[286] Mamede S ,

[287] van den Berge K , et al

[288] Schmidt HG ,
Van Gog T ,
Schuit SC , et al
. Do patients' disruptive behaviours influence the accuracy of a doctor's diagnosis? a randomised experiment. BMJ Qual Saf 2017;26:19–23.doi:10.1136/bmjqs-2015-004109 pmid:http://www.ncbi.nlm.nih.gov/pubmed/26951795
OpenUrl Abstract/FREE Full Text

[290] Schmidt HG ,

[291] Van Gog T ,

[292] Schuit SC , et al

[293] Mamede S ,
Schmidt HG ,
Rikers RMJP , et al
. Conscious thought beats deliberation without attention in diagnostic decision-making: at least when you are an expert. Psychol Res 2010;74:586–92.doi:10.1007/s00426-010-0281-8 pmid:http://www.ncbi.nlm.nih.gov/pubmed/20354726
OpenUrl CrossRef PubMed

[295] Mamede S ,

[296] Schmidt HG ,

[297] Rikers RMJP , et al

[298] Mamede S ,
van Gog T ,
van den Berge K , et al
. Effect of availability bias and reflective Reasoning on diagnostic accuracy among internal medicine residents. JAMA 2010;304:1198–203.doi:10.1001/jama.2010.1276 pmid:http://www.ncbi.nlm.nih.gov/pubmed/20841533
OpenUrl CrossRef PubMed Web of Science

[300] Mamede S ,

[301] van Gog T ,

[302] van den Berge K , et al

[303] Berbaum K ,
Franken EA ,
Caldwell RT , et al
. Can a checklist reduce SOS errors in chest radiography? Acad Radiol 2006;13:296–304.doi:10.1016/j.acra.2005.11.032 pmid:http://www.ncbi.nlm.nih.gov/pubmed/16488841
OpenUrl CrossRef PubMed

[305] Berbaum K ,

[306] Franken EA ,

[307] Caldwell RT , et al

[308] Chartan C ,
Singh H ,
Krishnamurthy P , et al
. Isolating red flags to enhance diagnosis (I-RED): an experimental vignette study. Int J Qual Health Care 2019;31:G97–102.doi:10.1093/intqhc/mzz082 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31665303
OpenUrl PubMed

[310] Chartan C ,

[311] Singh H ,

[312] Krishnamurthy P , et al

[313] DiNardo D ,
Tilstra S ,
McNeil M , et al
. Identification of facilitators and barriers to residents' use of a clinical Reasoning tool. Diagnosis 2018;5:21–8.doi:10.1515/dx-2017-0037 pmid:http://www.ncbi.nlm.nih.gov/pubmed/29601296
OpenUrl PubMed

[315] DiNardo D ,

[316] Tilstra S ,

[317] McNeil M , et al

[318] Graber ML ,
Tompkins D ,
Holland JJ
. Resources medical students use to derive a differential diagnosis. Med Teach 2009;31:522–7.doi:10.1080/01421590802167436 pmid:http://www.ncbi.nlm.nih.gov/pubmed/19811168
OpenUrl CrossRef PubMed

[320] Graber ML ,

[321] Tompkins D ,

[322] Holland JJ

[323] Kilian M ,
Sherbino J ,
Hicks C , et al
. Understanding diagnosis through action: evaluation of a point-of-care checklist for junior emergency medical residents. Diagnosis 2019;6:151–6.doi:10.1515/dx-2018-0073 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30990784
OpenUrl PubMed

[325] Kilian M ,

[326] Sherbino J ,

[327] Hicks C , et al

[328] Lambe KA ,
Hevey D ,
Kelly BD
. Guided reflection interventions show no effect on diagnostic accuracy in medical students. Front Psychol 2018;9:2297.doi:10.3389/fpsyg.2018.02297 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30532723
OpenUrl PubMed

[330] Lambe KA ,

[331] Hevey D ,

[332] Kelly BD

[333] Nickerson J ,
Taub ES ,
Shah K
. A checklist manifesto: can a checklist of common diagnoses improve accuracy in ECG interpretation? Am J Emerg Med 2020;38:18–22.doi:10.1016/j.ajem.2019.03.048 pmid:http://www.ncbi.nlm.nih.gov/pubmed/30952602
OpenUrl PubMed

[335] Nickerson J ,

[336] Taub ES ,

[337] Shah K

[338] Costa Filho GB ,
Moura AS ,
Brandão PR , et al
. Effects of deliberate reflection on diagnostic accuracy, confidence and diagnostic calibration in dermatology. Perspect Med Educ 2019;8:230–6.doi:10.1007/s40037-019-0522-5 pmid:http://www.ncbi.nlm.nih.gov/pubmed/31290117
OpenUrl PubMed

[340] Costa Filho GB ,

[341] Moura AS ,

[342] Brandão PR , et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Data availability statement

Statistics from Altmetric.com

Request Permissions

WHAT IS ALREADY KNOWN ON THIS TOPIC

WHAT THIS STUDY ADDS

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE AND/OR POLICY

Introduction

Methods

Data sources and searches

Supplemental material

Study selection

Data extraction and quality assessment

Data synthesis

Results

Supplemental material

Supplemental material

Interventions

Risk of bias assessment

Supplemental material

Main analysis

Supplemental material

Publication bias

Supplemental material

Subgroup analyses

Supplemental material

GRADE assessment

Discussion

Limitations

Conclusion

Data availability statement

Ethics statements

Patient consent for publication

Ethics approval

References

Supplementary materials

Supplementary Data

Footnotes

Read the full text or download the PDF:

Log in using your username and password