Literature DB >> 35879943

The Doctors' Effect on Patients' Physical Health Outcomes Beyond the Intervention: A Methodological Review.

Christoph Schnelle1, Mark A Jones1.   

Abstract

Background: Previous research suggests that when a treatment is delivered, patients' outcomes may vary systematically by medical practitioner. Objective: To conduct a methodological review of studies reporting on the effect of doctors on patients' physical health outcomes and to provide recommendations on how this effect could be measured and reported in a consistent and appropriate way.
Methods: The data source was 79 included studies and randomized controlled trials from a systematic review of doctors' effects on patients' physical health. We qualitatively assessed the studies and summarized how the doctors' effect was measured and reported.
Results: The doctors' effects on patients' physical health outcomes were reported as fixed effects, identifying high and low outliers, or random effects, which estimate the variation in patient health outcomes due to the doctor after accounting for all available variables via the intra-class correlation coefficient. Multivariable multilevel regression is commonly used to adjust for patient risk, doctor experience and other demographics, and also to account for the clustering effect of hospitals in estimating both fixed and random effects.
Conclusion: This methodological review identified inconsistencies in how the doctor's effect on patients' physical health outcomes is measured and reported. For grading doctors from worst to best performances and estimating random effects, specific recommendations are given along with the specific data points to report.
© 2022 Schnelle and Jones.

Entities:  

Keywords:  doctors’ effect; meta-epidemiological review; meta-epidemiology; methodological study; research methods

Year:  2022        PMID: 35879943      PMCID: PMC9307914          DOI: 10.2147/CLEP.S357927

Source DB:  PubMed          Journal:  Clin Epidemiol        ISSN: 1179-1349            Impact factor:   5.814


Introduction

A fundamental question in medical research is whether medical practitioners have an effect on patients’ health beyond the intervention, patient risk, and hospital variables. Previous research has revealed that when a treatment is delivered by a doctor (ie surgeon or medical physician), patient outcomes may vary systematically by medical practitioner.1,2 It is well known that hospitals can have an influence on patients’ health outcomes, with wide variation between hospitals.3–7 Such outcomes include adverse events,4 prescribing errors,4 hospital readmission,5,6 and mortality.7–9 Comparing hospitals requires a sound methodology and reliable estimates that take into account the multiple variables involved.8,10 In contrast to the substantial research on hospital effects, there is minimal research on the effect of doctors. The influence of doctor-patient communication has been investigated as a “doctor effect” on patients’ health outcomes,1,11,12 including symptoms,13,14 readmission rates in the emergency department,13,15 health-related quality of life,16 and improved diabetes control.17 Research on the therapist effect in psychotherapy has shown significant effects of therapists on patient outcomes beyond the therapy technique or modality applied.18,19 This wide variation among practitioners has been acknowledged and incorporated into the training material for psychotherapists.20,21 In surgery, outcomes associated with procedure volume, seniority, level of experience, or doctor specialty, include mortality rate,22 length of hospital stay,23,24 postoperative complications,25 and readmission.26,27 While research on the doctors’ effect in non-surgical specialties is limited, there is evidence from studies in primary care,1,28 intensive care,29 acute care,30 and obstetrics,31 where medical practitioners had an effect on patients’ health outcomes. Given the significant therapist effect in psychotherapy, and the known wide variation in patient outcomes across hospitals, but unclear effect of individual doctors on patient outcomes, we conducted a systematic review of the effect of doctors on patients’ physical outcomes. We aimed to assess whether doctor effects vary with specificity, outcome and intervention. However, in conducting the review, we found substantial variation in the way a doctor effect is measured and reported, therefore making data synthesis challenging and meta-analysis impossible. This has led to the present study where we have conducted a methodological review of studies that measure and report on doctors’ effect on physical patient outcomes. The focus of the methodological review is on the method of measurement of the doctors’ effect as well as how it is reported. The data source for the review is the included studies from our systematic review.32

Objective

To conduct a methodological review of studies reporting on the effect of doctors on patients’ physical health outcomes and to provide recommendations on how this effect could be measured and reported in a consistent and appropriate way.

Materials and Methods

Design

The present study is a methodological review where the focus is on statistical analysis and reporting.33 The search strategy, data collection, and extraction are explained in detail in a previous report of a systematic review of the surgeons’ effect on patients’ physical health outcomes.32

Search Strategy

Three databases were searched initially: PubMed, Embase, and PsycINFO; and over 10,000 publications were screened. For each of the studies identified that met the inclusion criteria, a citation analysis on Scopus was conducted to identify further eligible studies. The full search strategy and keywords can be found in the .

Study Selection and Eligibility Criteria

The studies selected in the initial electronic search and the studies added through the citation analysis were independently reviewed by two researchers with a third reviewer acting as an arbitrator if required. This process resulted in 79 included studies, all of which are included in the present study. Any physical patient health-related outcome was eligible for inclusion. Studies that fulfilled any of the following criteria were excluded: (1) studies that only described a doctors’ effect on particular doctor-related variables (such as specialty of doctor), (2) studies with fewer than 15 doctors, (3) cross-sectional studies, and (4) studies that mention fixed or random effects but did not list them either graphically or in numerical form.

Data Extraction and Quality Assessment

CS extracted the relevant information for assessing doctor effects from each included study, and the extracted data was then reviewed by a second researcher. The data items extracted can be found in Table 1. For quality assessment, the Newcastle-Ottawa Scale (NOS) was used, with the majority of studies scoring between 8 and 9 (9 being the maximum total).34–36
Table 1

Data Items Extracted

Data ItemComment
PublicationFirst author, year
Surgeon or Other Medical SpecialtySurgeon, Other
Practitioner TypeSurgeon, GP, Cardiologist, etc.
Medical Specialty of Doctor
Detailed Intervention
General Outcome
Specific OutcomeOften same as General Outcome
Type of StudyCohort or Randomized Controlled Trial
Newcastle Ottawa Scale Score0–9
Count of Doctors in Study
Count of Patients
Count of Institutions
Doctor ICCIntra-class correlation coefficient, here a measure of the strength of the effect on patients’ physical health
Multivariate Data Analysis usedY/N
Percentage of Doctors that are OutliersPositive and Negative Outliers
Country of dataset analyzed
Data Items Extracted

Methodological Review

We planned to describe the methods used to estimate and report the doctors’ effect on patients’ physical outcomes including the statistical model used, types of confounding variables adjusted for (patient variables, hospital/institution variables, doctor variables), and the method of reporting the doctor effect.

Results

Of the 79 included studies, 62 used a multivariable multilevel regression model to estimate the doctors’ effect, 72 studies included patient variables in their model, 41 studies included hospital or institution variables in the model, 60 studies included doctors’ volume, and 24 studies included other doctor variables. There were two different ways that the doctors’ effect was reported: fixed effects and random effects,37,38 with 54 studies reporting fixed effects and 34 studies reporting random effects. Table 2 provides details for each included study, presenting in part the wide variety of statistical methods used.
Table 2

Detailed Results for Each Study

PublicationDoctorsPatients/ProceduresInstitutionsICC%Neg Outlier%Pos Outlier%CountryMLR*MV**Statistical AnalysisPV^HV^^DVo#ODV##Confidence Interval Calculation
Anderson, 201622NS288035OtherOtherUSY“Gaussian Kernel Densities were constructed to show the relative distributions of the effects of individual institutions and surgeons”YYYNNone
Aquina, 201551 pg e163NS158,596NSOtherOtherUSY“Mixed Effects Multivariable Logistic Regression”, conference abstractYYYN95% CI given, but not method
Aquina, 20155222314,8759913.028.0USY“Bivariate and hierarchical logistic regression with further multivariable analysis” R 3.1 SAS 9.3YYYN95% CI given, but not method
Aquina, 2016533481125,16021024.3OtherOtherUSY“Three-level mixed-effects logistic regression analyses were performed” R 3.2.0 SAS 9.4YYYYNone
Aquina, 2017541572/2012124,416/78,267260/25640.5/14USY“Mixed-effects Cox proportional hazards analyses” R 3.2.1 SAS 9.4YYYY95% CI given, but not method
Arvidsson, 2005552510687OtherOtherSwedenNYSAS 8.2 NL Mixed modelYYNNNone
Becerra, 2017561503/81412,3321877.9USY“Multilevel logistic regression”, “multilevel competing-risks Cox models” SAS 9.3 RYYYY95% CI given, but not method
Beckett, 2018302221,5701UKNNAnalysis based on r-squareNNYNNone
Begg, 20025715910,737728/13/93/14/3USYCorrelation-adjusted and GEE logistic regressionYYYNNone
Bianco, 2005581595238NS7.52.5USYLogistic regression, binomial distribution, histograms, extra-binomial variationYYYNNone
Bianco, 20105954772549.313.0USY“[M]ultivariable, parametric random-effects regression survival-time model, using a log-logistic survival distribution to model hazard over time” Stata 9.2YNYN95% CI given, but not method
Bolling, 201060108828,5076396.67.4USYGEE logistic regression SAS 9.2 GENMODYNYNFunnel plot with 95% CI
Bridgewater, 20036123857240.00.0UKYUnspecified, using SASYNYN95% CI given, but not method
Bridgewater, 200562251097/906640.00.0UKYUnspecified, using SASYNYNClopper-Pearson 95% CI
Brown, 20166313314,033846.06.8USYBayesian hierarchical logistic regressionYNYY95% CI given, but not method, f did better
Burns, 2011641557246,4691560.74.5UKNYLogistic regressionYYYN“We constructed funnel plots using exact Poisson control limits by means of the web tool available at www.erpho.org.uk/topics/tools/funnel.aspx.”
Cirillo, 2020653219,82413.10.0ItalyYLogistic regression, random effects meta-analysisYNYN95% CI given, but not method
Cromwell, 2013664901194126/1290.00.0UKNYStata Funnel plot, Wilcoxon extended by CuzickYYYNBinomial distribution 95% CI
Dagenais, 201967191461114.410.510.5USYHierarchical logistic regressionYNYY95% CI given, but not method
Davenport, 2020685525,5961USYSAS 9.4 inference testingYNYY95% CI given, but not method, though not relevant for mortality
Duclos, 201269282357/2904510/32FranceYMixed effects logistic regressionYYYYBinomial distribution 95% CI
Eastham, 2003704446292OtherOtherUSYLogistic mixed modelYYYNNone
Eijkenaar, 201371447/53726,684/37,832N/A2.5/0.6NetherlandsYGeneralized Linear Multilevel Models using SAS 9.2 GLIMMIXYN/AYN95% CI given, but not method
Eklund, 200972481275>12.1SwedenNYRCT Pearson Chi2, Fisher’s exact, Cox regression, “Z-test for heterogeneity”YNYYNone
Faschinger, 2011731736,3291OtherOtherAustriaNNot specifiedCorrelations calculatedYN/AYNNone
Fountain, 20047443876/504287.4UKYSAS NLMIXED, dealing with convergence issuesYNNNStandard error calculated
Gani, 2015265622,55912.8USY“[M]ultilevel multivariable logistic regression” Stata 12.1 GLLAMYN/AYY95% CI given, but not method
Glance, 20067513851,750335.95.18.7USYStata 8.2 SAS GLIMMIXYYYY“Quality outliers were identified using 1) the ratio of observed-to-expected mortality rates (O/E ratio) and confidence intervals (CIs) calculated using both parametric (Poisson distribution) and nonparametric (bootstrapping) techniques; and 2) shrinkage estimators.”
Glance, 201645420/24155,436400.5/1.80.0/3.30.0/1.7USYHierarchical logistic regressionYYNN95% CI given, but not method
Goodwin, 2013421099131,7102680.750.61.5USY“[H]ierarchical general linear model”YYNN95% CI given, but not method
Gossl, 20137621818730.04.8USNYLogistic regressionYN/ANNDeviation from normal distribution
Grant, 2008773114,63740.00.0UKNYSAS 8.2 Logistic regressionYNYN95% CI given, but not method
Gutacker, 201843212–376024,505–405,67130–1520.4–12.7UKY“Three-level hierarchical generalised linear mixed models”YYYNNone
Hannan, 20177840327,5606012.018.612.7USYHierarchical logistic regressionYNYY95% CI given, but not method
Harley, 200531143NSMultiple6.32.1UKNYMultivariate AnalysisNNNN95% CI given, but not method
Healy, 201725973118/20784610.3/9.37.2/4.1USY“Multi-level mixed-effects logistic regression” Stata 13YNYN95% CI given, but not method
Hermanek, 19997943112179.316.3GermanyNY“Multiple logistic regression analyses”NNNN95% CI given, but not method
Hermann, 2002802016,4431OtherOtherAustriaNNChi-square, Brandt and Snedecor contingency tables for binomial distributionsNNNNNone
Hofer, 199981232364231.0USY“hierarchical regression for general linear models”YNN/ANNone
Hoffman, 2017821128183,2836016.2USY“Generalized linear mixed effects models”YYYYConference abstract ICC CI not specified how
Holmboe, 20108323622,52613 states12.0USYSAS 9.1.3 NLMIXEDYNYYDelta method for 95% CI
Huesch, 200984398221,327751.2OtherUSNYUsing SEMA by SEMATECHYYNNBinomial distribution 95% CI
Hyder, 20138557514882980.3USYMultilevel Models SAS 9.3YYYN95% CI given, but not method
Jemt, 20168623880818.7OtherSwedenNNChi-squareNN/ANNNone
Johnston, 20108740455,51512OtherOtherUKNNFunnel plotsNNYNNone
Justiniano, 2019883451251118OtherOtherUSYBayesian hierarchical regressionYYYY95% CI given, but not method
Kaczmarski, 2019895337291,065NS17.53.7USYHierarchical logistic regression SAS 7.1YNYY95% CI given, but not method
Kaplan, 200990210757433.0/30.627.643.8USYBinary mixed models SAS NLMIXEDYNNNStandard error calculated
Kerlin, 20189134511,2681041.822.925.2USYBayesian hierarchical regression Stata 14.2YYYYBayesian 95% credible intervals of odds ratios
Kissenberth, 201892571703NS44.0USNYLinear regressionYNNNConference abstract, no CI
Krein, 20029325812,11091.0USYMultilevel analysis MLwiN 2000YYNNNone
Kunadian, 200940261149,888481.61.1USYMultivariate Logistic RegressionYYYNBinomial distribution 95% CI
Landercasper, 201994713954NS5.74.3USYMixed effects multivariate model SAS 9.4YNYY95% CI given, but not method
LaPar, 20149593419417OtherOtherUSNY[M]ultivariable, mortality risk-adjusted models with restricted cubic splinesYYYNNone
Likosky, 2012963211,8388OtherOtherUSNYMultivariate Logistic RegressionYNNNNone
Luan, 201997381277212.615.8USYMultivariate Mixed Effects Logistic regression Stata 15YYYNBonferroni corrected 95% CI, no further details
Martin, 2013982986091432.5Graph too smallGraph too smallUSYLogistic regressionYYYNBayesian 95% coverage intervals, surgeon performance assumed normally distributed
McCahill, 201299542206411.131.5USYLogistic regressionYYYN95% CI given, but not method
Navar-Boggan, 201210047597916.412.8USY“Multilevel multivariable random-effects logistic regression” Stata 9YN/AYY95% CI given, but not method
O’Connor, 20081011202589180.8USY“Multivariate hierarchical models” MLwiNYYYYNone
Orueta, 201510214792,207,1751304.2SpainY“Four-level mixed effect models” inc district SAS 9.2 GLIMMIXYYYY95% CI given, but not method
Papachristofi, 201410324/1818,42610.1/2.80.0/16.70.0/0.0UKY“Logistic random effects regression” with random effectsYN/AYN95% CI given, but not method
Papachristofi, 201639190/127110,769100.25/4.00.0/15.00.0/6.3UKY“[L]ogistic random-effects regression analysis” using R 3.01YYYN95% CI given, but not method for practitioners, comment why no 95% CI for ICC
Papachristofi, 201723190/127107,038100.19/2.82.1/11.80.5/14.2UKY“Logistic mixed effects models” using R 3.2.2YYYN95% CI given, but not method
Quinn, 20181042724123,141512.20.20.2USY“3-level crossed random effects logistic regression models” Stata MP 14.2, SAS 9.4YYYN“Ninety-five percent CIs were calculated according to Agency for Healthcare Research and Quality methods for risk-adjusted rates.”
Rudmik, 2017105432168Multiple16.34.7CanadaYLogistic regressionYNYNBinomial distribution 95% CI
Selby, 20101061005/1,049169,156/232,053351.9/1.9USY“Multilevel linear and logistic regression”YYNNStandard error calculated
Shih, 201510734550332414.0USY“Hierarchical logistic regression”, Stata 12.0YNYNNone
Singh, 20151552548,88314315.012.812.5USY“[M]ultilevel, multi-variable models”YYYY95% CI given, but not method
Singh, 201824398739,884NS10.0/0.17.2/0.0USYMixed models, SAS GLMMYYYY95% CI given, but not method
Singh, 2019284230565,5790.00.1USY“Multilevel logistic regression” SAS 9.4 GENMOD, GLIMMIX, Stata 15.1 marginsYNYNFormula for 95% CI given and bootstrapping
Thigpen, 2018273499515.98.8USNY“Linear regression model”YNYNEfron’s bootstrap for 95% CI
Tuerk, 200810842138112.0USY“Hierarchical linear models” HLM6YNNNICC as per Bryk Raudenbusch, 95% CI not calculated
Udyavar, 20181092149569,7672242.3USY“Multilevel random effects modelling” Stata 14 MELOGITYYYY95% CI given, but not method
Udyavar, 201811017565,706318.7USY“[M]ultilevel random effects models” Stata 14YYYYICC 95% CI not calculated
Udyavar, 20191115816215,74519827.3USY“[M]ultilevel mixed effects modeling”YYYYOdds ratio 95% CI given, but not method
Verma, 2020112135103,085718.514.8CanadaYSix different multivariable regression analyses R 3.5YYYN95% CI given, but not method
Xu, 20161132762525443.30.0USY“Logistic regression and post-estimation”YYYYNone
Xu, 201911414,5981,884,842OtherOtherUSY“Multivariable logistic regressions” Stata MP 14YNYY95% CI given, but not method

Abbreviations: *MLR, Multi-level regression; **MV, If no MLR, was multivariate regression used? ^PV, Patient variables; ^^HV, Hospital variables; #DVo, Doctors’ volume of procedures used; ##ODV, Other doctor variables than volume used.

Detailed Results for Each Study Abbreviations: *MLR, Multi-level regression; **MV, If no MLR, was multivariate regression used? ^PV, Patient variables; ^^HV, Hospital variables; #DVo, Doctors’ volume of procedures used; ##ODV, Other doctor variables than volume used.

Fixed Effects – Grading Doctors by Their Effect

Fixed effects are represented by the range of patient outcomes that doctors are responsible for after all available confounding variables have been accounted for. They are shown visually using a caterpillar plot, which ranks doctors by outcomes from lowest to highest, or a funnel plot, which shows each doctor as a dot and indicates whether doctors are outside a 95% or 99% confidence interval. For example, Papachristofi et al39 showed caterpillar graphs with an ICC of 4.0% (surgeons) and an ICC of 0.25% (anesthetists) (Figure 1), while Kunadian et al40 showed a funnel plot with an ICC of 6.5% (Figure 2), redone at a higher resolution by the authors (Figure 3) and the same data as a caterpillar plot (Figure 4). Measuring fixed effects allows identification of high and low outliers and how heterogeneously doctors perform. They also show whether variation in performance is consistent with chance or whether the variation is more significant than that. Fixed effects are calculated through “modelling fixed provider effects”.41
Figure 1

Estimated probability of in-hospital death within three months of surgery for a patient with average Euro-SCORE risk: (a) surgeons adjusted for centre and anaesthetist; (c) anaesthetists adjusted for centre and surgeon. The horizontal line is average probability (1.8%) for the study cohort. Error bars = 95% CI.

Figure 2

A funnel plot with each cardiologist represented by a black dot with 95% and 99% confidence intervals. The grey horizontal line is the average mortality for percutaneous coronary intervention (PCI) in New York State 2002–2004.

Figure 3

This figure was created by the authors and is a higher resolution version of Figure 2 using the same data. It is a funnel plot with each cardiologist represented by a dot with 95% and 99% confidence intervals. Cardiologists whose mortality confidence interval is above the 95% line are marked in red, those below marked in green.

Figure 4

A caterpillar plot created by the authors. It uses the same data as Figures 2 and 3. Beige (on left) and brown (on right) confidence intervals have an upper limit above 10%. Green confidence intervals are wholly below average mortality, red confidence intervals wholly above.

Estimated probability of in-hospital death within three months of surgery for a patient with average Euro-SCORE risk: (a) surgeons adjusted for centre and anaesthetist; (c) anaesthetists adjusted for centre and surgeon. The horizontal line is average probability (1.8%) for the study cohort. Error bars = 95% CI. A funnel plot with each cardiologist represented by a black dot with 95% and 99% confidence intervals. The grey horizontal line is the average mortality for percutaneous coronary intervention (PCI) in New York State 2002–2004. This figure was created by the authors and is a higher resolution version of Figure 2 using the same data. It is a funnel plot with each cardiologist represented by a dot with 95% and 99% confidence intervals. Cardiologists whose mortality confidence interval is above the 95% line are marked in red, those below marked in green. A caterpillar plot created by the authors. It uses the same data as Figures 2 and 3. Beige (on left) and brown (on right) confidence intervals have an upper limit above 10%. Green confidence intervals are wholly below average mortality, red confidence intervals wholly above.

Random Effects – Estimating the Variation Due to the Doctor

Random effects represent a percentage of the total variation in outcomes between patients that the doctors are responsible for. They are estimated via the intra-class correlation coefficient (ICC), which is the proportion of the total variation in the patient outcome attributed to doctors. There are many different ways to describe this effect.37 The ICCs measured and reported in the studies ranged from 0% to 47% with a median of 3%.

Discussion

This methodological review of studies that report a doctors’ effect on a patient's physical outcomes has identified wide variations in how researchers measure and report a doctors’ effect. However, there were 2 broad methods identified: fixed effects that allow doctors to be ranked; and random effects where the proportion of variance attributed to unexplained differences between doctors is estimated. The most common statistical model used in the analyses was a multivariable multilevel regression where the types of confounding variables adjusted for included those assessing patient risk, known doctor attributes, and, to a lesser degree, differences between hospitals or institutions. Glance et al38 discuss in some detail three approaches of provider profiling for binary outcomes, namely conventional logistic regression, hierarchical logistic regression, and fixed-effects logistic regression. They conclude that hierarchical logistic regression is generally preferred, except in the case where providers have low case volume, where hierarchical logistic regression understates the provider effect. We agree that hierarchical logistic regression is an acceptable method for provider profiling as it allows measurement of the strength of the providers’ effect on physical patient health. This review identified substantial heterogeneity in how the percentage of the variation due to the doctors is reported. For example, Goodwin et al42 reported the percentage of the variation for the null model as the “ICC” and the variation calculated after taking all available information into account as “partitioned variance”. It is helpful to calculate the variation of the null model as, if there is negligible or no variation, there is no need to include additional levels in the analysis. In both cases, the null and adjusted models, the ICC was calculated. In contrast, Gutacker et al43 referred to the random effect measure as the “variance partition coefficient”. A crucial element of reporting fixed effects is the calculation of the confidence intervals of each doctors’ performance. Glance et al38,44,45 provide a detailed technical discussion of the respective advantages of using fixed (grading doctors from worst to best) and random effects (calculating the percentage of outcome variation due to the doctor). One pertinent issue discussed is that the smaller the cluster is, ie the fewer patients the doctor has, the greater the shrinkage towards the mean,46,47 reducing the calculated ICC, and leading to an underestimate of the difference in performance between doctors.

Interpreting the Doctor’s Effect

The clinical importance of the findings from the studies assessed in this methodological review depends on how common the outcome assessed is and how varied the doctors’ effect is among practitioners. The more common and the more varied, the more the finding is clinically important. The choice between a doctor with an above or below average effect will have implications for the patients’ health outcomes at different levels of how common the outcome is and how strong the doctors’ effect is. The stronger the doctors' effect and the more important the outcome, the more the choice of doctor matters for the individual. The more common the outcome is, the more the choice of doctor matters for population health. Table 3 by Baldwin et al,21 originally from Wampold et al,48 and augmented by Kraemer et al,49 shows effect sizes for different ICCs. The intra-class correlation coefficient (ICC) can measure the percentage of the variation in patients’ physical health outcomes due to each component of a medical interaction,21 which is typically the patient, the doctor, the hospital, and the intervention. Table 3 shows a scenario where 50% of the patients recover from an intervention when there is no doctors’ effect, ie for an ICC of zero. However, an ICC of 5.9% is reported to produce a medium-sized effect (Cohen’s d) with a Number Needed to Treat (NNT) of 3.6. Under such circumstances, an ICC of 5.9% can mean that doctors have a clinically significant effect that is greater than many interventions.
Table 3

Relationship Between ICC, Cohen’s d, Success Rates and NNT

ICCCohen’s d50Proportion of Untreated Controls Below Mean of Treated PersonsSuccess Rate of Untreated PersonsSuccess Rate of Treated PersonsNNT – Numbers Needed to Treat49
Small
0.0%0.00.5000.5000.500
0.2%0.10.5400.4750.52517.7
1.0%0.20.5790.4500.5508.9
Medium
2.2%0.30.6180.4260.5746.0
3.8%0.40.6550.4020.5984.5
5.9%0.50.6910.3790.6213.6
8.3%0.60.7260.3560.6443.0
10.9%0.70.7580.3350.6652.6
Large
13.8%0.80.7880.3140.6862.3

Notes: Cohen’s d’s aim is to describe the magnitude of response to treatments between two groups, for example, a treatment and a control group. More technically, “The difference between the Treatment and Control group means, divided by the within-group standard deviation”.50 The Number Needed to Treat (NNT) is defined as the number of patients one would expect to treat with Treatment to have one more success (or one less failure) than if the same number were treated with Control.49

Relationship Between ICC, Cohen’s d, Success Rates and NNT Notes: Cohen’s d’s aim is to describe the magnitude of response to treatments between two groups, for example, a treatment and a control group. More technically, “The difference between the Treatment and Control group means, divided by the within-group standard deviation”.50 The Number Needed to Treat (NNT) is defined as the number of patients one would expect to treat with Treatment to have one more success (or one less failure) than if the same number were treated with Control.49

Recommendations

How to Report a Doctors’ Effect

If researchers wish to report a doctors’ effect that has been estimated, we recommend the following: Including “doctors” effect’ or “physicians” effect’ in the keywords and optionally in the title or abstract Using multivariable multilevel regression for the analysis with adjustments for patient risk, doctor experience, hospital effects, and any other potential confounding variable For describing fixed effects – grading doctors from worst to best, showing individual results for each doctor in a Table or a Figure For describing random effects, calculation of the intra-class correlation coefficient (ICC), describing the variation with 95% confidence interval and whether the outcome is a binary or continuous variable Making the dataset used for the analysis available for other researchers to conduct their own analyses.

What to Report

Observational Studies

We recommend reporting doctor effects after all available confounding variables have been taken into account, either by (a) the percentage of variation in the patient outcomes which is attributed to the doctor but unexplained by known attributes such as their experience, or (b) the ordering of doctors by their patients’ physical health outcomes, or (c) ideally both. Reporting this data offers the potential to identify both low and high outliers among doctors, as well as how much of an unexplained doctors’ effect there is on patient outcomes.

Data Points to Report

Table 4 lists the data points that are recommended to report. Table 5 shows a specific example of those reported data points employing the dataset used in Kunadian et al.40
Table 4

Data Points to Report

Data Points to ReportDescription
1. InterventionType of intervention
2. Type of studyWe do not recommend using cross-sectional studies (surveys), as response rates can introduce a selection bias. This does not concern patient-reported data recorded by the doctor, like levels of pain or mobility.
3. Count of doctorsCount of doctors overall. For randomized controlled trials, the count of doctors for each arm.
4. Count of patients or proceduresIf available, both patients and procedures.
Randomized controlled trials:In addition to the above: number of arms to the study.If the trial is not too large, a matrix showing how many patients of each arm were served by each doctor.
5. Count of higher aggregation, if any – hospital, practices, counties, statesIf there are more than two levels, ie not just patients/procedures and doctors, but also hospital, or medical practice, or county, or state, reporting their number could be useful. As there is a well-known hospital effect, distinguishing between hospital and doctors’ effects will be useful.
6. Outcome typeThe patients’ physical health outcomes measured, for example mortality, length of stay, complications, pregnancies, blood pressure or HbA1c levels under control/ not under control.Definitions for each outcome. For example, with mortality, whether it is in-hospital, 30-days, or five years. Whether the outcome is binary, ordinal, or continuous. If feasible, all 30-days, in-hospital, and longer times, if they are available.
7. Percentage of patients/procedures with this outcomeFor binary outcomes, the percentage of patients by doctor with that outcome – lowest percentage, highest, mean, and median.For ordinal or continuous outcomes, lowest, highest, average, mean, and median outcome by doctor.
8. Multivariate analysis (Y/N)Has there been a multivariate analysis, and which variables were considered for exclusion in the analysis, and which were included in the final analysis?
9. Volume effect Y/N/NS (NS=’not stated’)Was the number of patients/procedures per doctor included in the analysis?Was the effect, if any, reported as being substantial, not substantial, or not stated?
10. Observed vs expected recorded Y/N/NSWere investigations done to identify low and high outliers among the doctors, and their count, or proportion recorded?Were funnel plot(s) provided, pointing out 95% and optionally, 90% and or 99% outliers?Alternatively, a caterpillar plot, ie a fixed effect chart showing the patient outcome for each doctor, together with the individual doctor’s 95% CI, sorted by patient outcome, showing outliers among doctors.
Confidence interval options:115Binomial (normal distribution in patient outcomes)Delta method – what are the details, and how is it done?Other – bootstrap, simulation116
11. Percentage variation number/NSThe variation due to the doctors in the patients’ physical health outcome as a percentage of the total variance of all investigated levels, with 95% confidence levels. Optionally, absolute variance and total variance as well.
12. ICC calculated during multilevel, multivariate analysisAs the percentage of the total variance of all investigated levels is the definition of the ICC, reporting of the ICC (intra-class correlation coefficient) as such with 95% confidence intervals as a more detailed alternative to reporting only the variation.
13. Pre-shrinkage ICC calculated through simulationThe ICC calculated in multilevel analysis is often reported as lower than it really is due to shrinkage.46,47 In order to find the pre-shrinkage ICC, the following approach can be taken:Simulated datasets that have the same distribution as the doctor/patient clusters in the data investigated can be generated using increasing ICCs until a generating ICC is found that has the same post-shrinkage ICC as the dataset investigated. Reporting this pre-shrinkage ICC can be valuable, as it can be much larger than the post-shrinkage ICC when, for example, the patients’ physical effect is not common (under 10%).
Table 5

Data Points Reported by Kunadian et al

Data Points to ReportKunadian et al:40 an Example*
1. Type of interventionPercutaneous Coronary Interventions (PCI) in New York State 2002–2004, also known as angioplasty.
2. Type of studyCohort study from medical records.
3. Count of doctors261
4. Count of patients or procedures149,888 patients, procedures not stated.
5. Count of higher aggregation, if any – hospital, practices, counties, states48 hospitals
6. Outcome type30-day and 3-year mortality following PCI.
7. Percentage of patients/procedures with this outcomeOverall, 944 deaths out of 149,888 PCI procedures. After excluding patients listed as “All Other doctors in this hospital”, 912 deaths in 146,781 procedures.
8. Multivariate analysis (Y/N)Yes. Risk-adjusted mortality rate.
9. Volume effect Y/N/NS (NS=’not stated’)Yes. Neither the downloadable paper nor Kunadian state whether there is a volume effect for cardiologists. Kunadian states there is no significant relationship between hospital volume and risk of in-hospital death from these data.
10. Observed vs expected recorded Y/N/NSYes.
Were funnel plot(s) provided?Yes, provided in Kunadian as Figure 2.
Were caterpillar plots provided?Not by Kunadian et al40. See Figure 4 as provided by authors.
Were confidence intervals calculated?Neither the downloadable document nor Kunadian state how the confidence interval was calculated.
11. Percentage variation Number/NSNS
12. ICC calculated during multilevel, multivariate analysisICC was calculated by the authors of this paper to be 6.54%, 95% CI (4.32%, 9.79%).
13. Pre-shrinkage ICC calculated through simulationUsing simulated data with the same number of doctors, cases per doctor, and deaths per doctor, resulted in an average ICC of 6.48%, 95% CI (4.47%, 9.32%) after 550 simulations. Therefore, there is no substantial shrinkage at work, which is not unexpected as the mean number of cases per doctor is high at 558.

Notes: *Kunadian et al's 2009 paper40 refers to a version of the original dataset117 that can be freely downloaded and is sufficiently detailed for our purposes.

Data Points to Report Data Points Reported by Kunadian et al Notes: *Kunadian et al's 2009 paper40 refers to a version of the original dataset117 that can be freely downloaded and is sufficiently detailed for our purposes.

Strengths and Limitations

This is the first methodological review on the reporting of doctors’ effect on patient outcomes. The clarity and simplicity of how doctors’ and surgeons’ effects are described here and the suggested standardization of such reporting should allow meta-analysis to be conducted, allow robust identification of outliers, and make the re-analysis of much existing data feasible. However, a limitation is that, as all of the included studies were conducted in North America or Europe, it is unclear whether the findings can be generalized to other regions, particularly in developing nations.

Conclusion

A doctors’ effect on patients’ physical health can be measured and reported in two ways: Firstly, by calculating the percentage of variation in patients’ physical health outcomes due to the doctor in the form of the intra-class correlation coefficient (ICC). Secondly, by grading doctors from best to worst patients’ physical health outcomes, assigning a confidence interval to those outcomes, and reporting how many doctors’ confidence intervals fall wholly above or below the overall average. Ideally, both should be reported.
  105 in total

1.  A comparison of confidence interval methods for the intraclass correlation coefficient in cluster randomized trials.

Authors:  Obioha C Ukoumunne
Journal:  Stat Med       Date:  2002-12-30       Impact factor: 2.373

2.  Critical evaluation of the Newcastle-Ottawa scale for the assessment of the quality of nonrandomized studies in meta-analyses.

Authors:  Andreas Stang
Journal:  Eur J Epidemiol       Date:  2010-07-22       Impact factor: 8.082

3.  Predictors of mitral valve repair: clinical and surgeon factors.

Authors:  Steven F Bolling; Shuang Li; Sean M O'Brien; J Matthew Brennan; Richard L Prager; James S Gammie
Journal:  Ann Thorac Surg       Date:  2010-12       Impact factor: 4.330

4.  Profiling individual physicians using administrative data from a single insurer: variance components, reliability, and implications for performance improvement efforts.

Authors:  Frank Eijkenaar; René C J A van Vliet
Journal:  Med Care       Date:  2013-08       Impact factor: 2.983

Review 5.  Proportion of hospital readmissions deemed avoidable: a systematic review.

Authors:  Carl van Walraven; Carol Bennett; Alison Jennings; Peter C Austin; Alan J Forster
Journal:  CMAJ       Date:  2011-03-28       Impact factor: 8.262

6.  Randomized clinical trial comparing 5-year recurrence rate after laparoscopic versus Shouldice repair of primary inguinal hernia.

Authors:  D Arvidsson; F H Berndsen; L G Larsson; C-E Leijonmarck; G Rimbäck; C Rudberg; S Smedberg; L Spangen; A Montgomery
Journal:  Br J Surg       Date:  2005-09       Impact factor: 6.939

7.  A practical approach to a multi-level analysis with a sparse binary outcome within a large surgical trial.

Authors:  Jayne Fountain; James Gallagher; Julia Brown
Journal:  J Eval Clin Pract       Date:  2004-05       Impact factor: 2.431

8.  The Cataract National Dataset electronic multi-centre audit of 55,567 operations: variation in posterior capsule rupture rates between surgeons.

Authors:  R L Johnston; H Taylor; R Smith; J M Sparrow
Journal:  Eye (Lond)       Date:  2009-08-14       Impact factor: 3.775

9.  Variation in Bariatric Surgery Costs and Complication Rates in the Military Health System.

Authors:  William Patrick Luan; Todd C Leroux; Cara Olsen; Douglas Robb; Jonathan S Skinner; Patrick Richard
Journal:  Mil Med       Date:  2020-08-14       Impact factor: 1.437

10.  Variation in quality of diabetes care at the levels of patient, physician, and clinic.

Authors:  Patrick J O'Connor; William A Rush; Gestur Davidson; Thomas A Louis; Leif I Solberg; Lauren Crain; Paul E Johnson; Robin R Whitebird
Journal:  Prev Chronic Dis       Date:  2007-12-15       Impact factor: 2.830

View more
  1 in total

1.  Characteristics of Exceptionally Good Doctors: A Protocol for a Cross-Sectional Survey of Adults.

Authors:  Christoph Schnelle; Mark A Jones
Journal:  Patient Relat Outcome Meas       Date:  2022-08-12
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.