TY - JOUR T1 - Random variation and rankability of hospitals using outcome indicators JF - BMJ Quality & Safety JO - BMJ Qual Saf SP - 869 LP - 874 DO - 10.1136/bmjqs.2010.048058 VL - 20 IS - 10 AU - Anne-Margreet van Dishoeck AU - Hester F Lingsma AU - Johan P Mackenbach AU - Ewout W Steyerberg Y1 - 2011/10/01 UR - http://qualitysafety.bmj.com/content/20/10/869.abstract N2 - Objective There is a growing focus on quality and safety in healthcare. Outcome indicators are increasingly used to compare hospital performance and to rank hospitals, but the reliability of ranking (rankability) is under debate. This study aims to quantify the rankability of several outcome indicators of hospital performance currently used by the Dutch government.Methods From 52 indicators used by the Netherlands Inspectorate, the authors selected nine outcome indicators presenting a fraction and absolute numbers. Of these indicators, four were combined into two, resulting in seven indicators for analysis. The official data of 97 Dutch hospitals for the year 2007 were used. Uncertainty in the observed outcomes within the hospitals (within hospital variance, σ2) was estimated using fixed effect logistic regression models. Heterogeneity (between hospital variance, τ2) was measured with random effect logistic regression models. Subsequently, the rankability was calculated by relating heterogeneity to uncertainty within and between hospitals (τ2/(τ2 +median σ2)).Results Sample sizes varied but were typically around 200 per hospital (range of median 90–277) with a median of 2–21 cases, causing a substantial uncertainty in outcomes per hospital. Although fourfold to eightfold differences between hospitals were noted, the uncertainty within hospitals caused a poor (<50%) rankability in three indicators and moderate rankability (50–75%) in the other four indicators.Conclusion The currently used Dutch outcome indicators are not suitable for ranking hospitals. When judging hospital quality the influence of random variation must be accounted for to avoid overinterpretation of the numbers in the quest for more transparency in healthcare. Adequate sample size is a prerequisite in attempting reliable ranking. ER -