Table 5

Inter-rater reliability between criterion-based scores (proportion of criteria stated as being met) for the same record by different reviewers

Reviewer pairsConditionSit*No. of paired reviewsICC between scores (95% CI)Weighted mean ICC (95% CI)
Doctor vs doctorHeart failureF140.96 (0.87 to 0.99)0.88 (0.83 to 0.93)
COPDG500.65 (0.46 to 0.79)
Heart failureB460.65 (0.50 to 0.77)
Heart failureE120.64 (0.13 to 0.88)
Nurse/clinical vs nurse/clinicalCOPDJ250.86 (0.71 to 0.94)0.74 (0.66 to 0.82)
COPDD480.70 (0.52 to 0.82)
Heart failureD210.69 (0.38 to 0.86)
Heart failureH500.27 (0.00 to 0.51)
Non-clinical audit staff vs non-clinical audit staffCOPDE400.69 (0.49 to 0.82)0.61 (0.47 to 0.76)
COPDA290.33 (−0.04 to 0.61)
  • COPD, chronic obstructive pulmonary disease; ICC, intraclass correlation.

  • * Only sites with more than one reviewer are included in reliability analysis; therefore, some sites do not appear on this table.

  • Mean ICC per staff type, weighted by inverse variances to account for differing numbers of paired reviews. A single ICC was calculated for the three doctors at site B and this was combined with the other doctor pairs in the weighted mean ICC.

  • Non-specialist doctors.