Literature DB >> 35502434

Is There a Surgeons' Effect on Patients' Physical Health, Beyond the Intervention, That Requires Further Investigation? A Systematic Review.

Christoph Schnelle1, Justin Clark1, Rachel Mascord2, Mark A Jones1.   

Abstract

Objective: To find and review published papers researching surgeons' effects on patients' physical health. Clinical outcomes of surgery patients with similar prognoses cannot be fully explained by surgeon skill or experience. Just as there are "hospital" and "psychotherapist" effects, there may be "surgeons" effects that persist after controlling for known variables like patient health and operation riskiness.
Methods: Cohort studies and randomized controlled trials (RCTs) of any surgical intervention, which, after multivariate adjustment, either showed proportion of variance in patients' physical health outcomes due to surgeons (random effects) or graded surgeons from best to worst (fixed effects). Studies with <15 surgeons or only ascribing surgeons' effects to known variables excluded. Medline, PubMed, Embase, and PsycINFO were used for search until June 2020. Manual search for papers referring/referred by resulting studies. Risk of bias assessed by Cochrane risk-of-bias tool and Newcastle-Ottawa Scale.
Results: Included studies: 52 cohort studies and three RCTs of 52,436+ surgeons covering 102 outcomes (33 unique). Studies either graded surgeons from best to worst or calculated the intra-class correlation coefficient (ICC), the percentage of patients' variation due to surgeons, in diverse ways. Sixteen studies showed exceptionally good and/or bad performers with confidence intervals wholly above or below the average performance. ICCs ranged from 0 to 47%, median 4.0%. There are no well-established reporting standards; highly heterogeneous reporting, therefore no meta-analysis. Discussion: Interpretation: There is a surgeons' effect on patients' physical health for many types of surgeries and outcomes, ranging from small to substantial. Surgeons with exceptional patient outcomes appear regularly even after accounting for all known confounding variables. Many existing cohort studies and RCTs could be reanalyzed for surgeons' effects especially after methodological reporting guidelines are published.
Conclusion: In terms of patient outcomes, it can matter which surgeon is chosen. Surgeons with exceptional patient outcomes are worth studying further.
© 2022 Schnelle et al.

Entities:  

Keywords:  clinical competence; delivery of health care; doctors’ effect; physicians; physicians’ effect; practice effect; professional practice gap; quality of health care; surgeons’ practice pattern; therapists’ effect

Year:  2022        PMID: 35502434      PMCID: PMC9056050          DOI: 10.2147/TCRM.S357934

Source DB:  PubMed          Journal:  Ther Clin Risk Manag        ISSN: 1176-6336            Impact factor:   2.755


Introduction

What is already known on this topic: Previous research has shown associations between characteristics of surgeons, such as their level of surgical experience, and patient health outcomes. It is unclear whether surgeons have an influence on patients’ physical health that has not been captured by known variables and how large that influence is. What this study adds: This study is the first systematic review of unexplained surgeons’ influence on patients’ physical health. Findings are highly variable, depending on the type of outcome and surgery that can result in substantial differences in patient health outcomes between surgeons.

Rationale

If you want to find a good surgeon, an internet query will provide advice from many sources.3–5 There are also databases of what the database provider considers to be the best surgeons, calculated from raw death and complication rates plus other doctors’ recommendations.6 Surgeons themselves have given their opinion on what makes a good or outstanding surgeon,7–11 with Barry Jackson’s essay perhaps being the most comprehensive.12 However, this information mostly relies on personal experiences, although Jackson’s essay does mention “First-class outcomes after allowing for case-mix”. Existing evidence suggests that some surgeons are more effective at applying interventions than others as there is, for example, a substantial volume effect, ie case volume, and years of practice effect in a number of surgical specialties.13,14 In fact, there are few studies where author-selected outstanding practitioners are investigated,15–17 with only Schenck et al mentioning surgeons. It is well established that there is a hospital effect, ie that hospitals have a substantial influence on patients’ health outcomes and that there are wide variations in patients’ health outcomes between hospitals.18–23 There is also substantial research on a therapist effect in psychotherapy with wide variations among therapists, so much so that this finding has made it into training material for psychotherapists.24,25 Recent research also suggests that provider expectations could have a causal role in treatment effectiveness.26 At the same time, the placebo effect, which can be substantial,27 including in surgery28–31 with some dissent for orthopedic surgery,32 is suggestive of a surgeon’s effect. The placebo effect shows that even with an inert or inactive intervention, there is an effect on patients. It is possible that part of this effect is due to the surgeon administering the placebo, usually a type of sham surgery.28 However, there are currently no well-established standards on how to assess surgeons as an intervention in their own right or as an effect modifier of a given intervention. Recent research has endeavored to analyze the effect size of surgeons33 by investigating 10 surgical trials for surgeon intra-cluster correlation coefficients (ICCs) and found that surgeons have a range of effects on patient health that differ between surgical specialties. In the study by Udyavar et al34 of 2149 surgeons performing 569,767 emergency surgeries it was shown that in five out of seven types of surgery, surgeons were responsible for 23% to 47% of the variability in patient mortality. This difference in outcome could not be explained by the choice of treatment, prognostic or diagnostic factors, patient clinical or demographic factors, hospital-level factors, or surgeon volume. To date studies such as Udyavar et al have not been synthesized. In this systematic review we have addressed this gap in the literature.

Objectives

This systematic review aims to identify and evaluate all the research to date examining the effect of surgeons on patient physical health outcomes after known variables have been accounted for. It is part of a larger research project that includes a systematic review of non-surgical practitioners, and a methodological study on how to report practitioners’ effects on patients’ physical health.

Methods

Eligibility Criteria

A systematic review was conducted following Synthesis without meta-analysis (SWiM) guidelines.35 This review limits itself to studies that investigated actual patients’ physical outcomes and excluded studies that focused on patients’ opinions or satisfaction levels, with the rationale that these outcomes are often a more ambiguous way to measure surgeons’ effects.36 The PICO is as follows:

Information Sources and Search Strategy

We initially searched three databases: Medline via its PubMed interface, Embase, and PsycINFO from inception to June 2020 to identify relevant studies that investigate the influence of surgeons on patients’ physical health outcomes. The search strategy used for each database is reported in and was designed by JMC, a specialist in this area. In addition to the electronic search of databases, we further manually searched the references lists of the eligible articles and previous systematic reviews to identify potentially relevant studies that did not appear in the literature search. The following systematic review registries were searched for similar reviews: PROSPERO and Cochrane’s CENTRAL register. One study was suggested by a reviewer.37

Selection Process and Further Eligibility Criteria

Two reviewers independently screened titles/abstracts for inclusion. Any discrepancies were resolved by discussion or in consultation with a third reviewer. Study designs considered for inclusion were retrospective and prospective observational studies, case-control studies, and randomized controlled studies, where either the proportion of variance in patient outcomes explained by differences between practitioners, ie practitioners’ random effects, are measured, or the difference between the individual practitioners is highlighted, ranging from best to worst, ie practitioners’ fixed effects are measured. Any medical practitioner except psychotherapists were included. At this stage both surgeons and doctors who were not surgeons were included, however this paper only includes studies of surgeons. All other medical doctors are reviewed in a separate paper. Any patient’s physical health-related outcome was eligible, examples of which are repair reoperations, readmission rate, survival/mortality rate, embryo transfer rate, length of hospital stay, infection rate, estimated blood loss, recurrence rates, pain, and other post-operative complications. There were no date or language restrictions. We excluded studies that only ascribed a surgeons’ effect to particular surgeon-related variables, such as volume of procedures performed or specialty of surgeon; studies with fewer than 15 surgeons; cross-sectional studies, ie surveys of doctors or patients, as they had an increased risk of bias; and two studies that mentioned fixed or random effects but did not actually list the effects either graphically or in numerical form.38,39 The authors could not find a recommendation for the minimum number of clusters in a study for a systematic review – in this case the minimum number of practitioners. We took 15 practitioners as the smallest cluster size but appreciate that this is an arbitrary number. (Figure 1).
Figure 1

Flow diagram of selection of included documents.

Flow diagram of selection of included documents.

Data Collection Process and Data Items

Titles and abstracts were collected using Endnote 9 and uploaded into Rayyan for inclusion or exclusion where the two reviewers independently screened titles and abstracts. The resulting eligible studies were marked as members of a group in the original Endnote library and their full text documents were added to the library. CS and a second extractor independently and in duplicate extracted the relevant data from each eligible study and collected the following variables using Excel: Unique publication identifier consisting of first author and year Surgical specialty Type of study (RCT, Cohort) Type of intervention (can be multiple) Outcome type (multiple) Significant surgeons’ effect as per authors’ evaluation Y/N Number of surgeons Number of patients or procedures Number of hospitals/institutions ICC (intra-class correlation coefficient) Number/NS Multivariate analysis Y/N Number of negative and positive outliers Country of origin

Study Risk of Bias Assessment

Two reviewers independently used the Cochrane risk-of-bias assessment tool2 for the three included randomized controlled studies (Figures 2 and 3), and the Newcastle-Ottawa Scale (NOS) for the included cohort studies.40,41
Figure 2

First risk of bias chart for the three randomized controlled trials included.

Figure 3

Second risk of bias chart for the three randomized controlled trials included.

First risk of bias chart for the three randomized controlled trials included. Second risk of bias chart for the three randomized controlled trials included.

Effect Measures

The metric for the fixed effects is the percentage of positive and negative outliers as defined in the individual study reports. The metric for the random effects is the variance due to the practitioner or the intra-class correlation coefficient, defined as the variation in patient outcome due to the practitioner as a percentage of the total variation.

Synthesis Methods

As the data are highly heterogenous and there are no established standards on recording doctors’ effects or surgeons’ effects, no statistical synthesis was used. There were 14 surgical specialties plus two papers covering multiple surgeries, 50 separate interventions and 31 separate outcomes. The surgeons’ effect on patients’ physical health is described in two ways, using multilevel mixed effects regression modelling or hierarchical regression to understand both surgeon and system-level variation.42,43

Percentage of Variation in Patient Outcome Due to the Surgeon in the Form of the Intra-Class Correlation Coefficient (ICC)

Post-regression estimation gives the ICC, which as a number ranging from 0 to 1, gives the percentage of variation in outcome due to each level in the regression model. For example, in a three-level model of patients clustered per doctor, who in turn were clustered within hospitals, each level has an ICC with the total ICCs adding up to 1. In order to realize this, the studies included random effects for surgeons, and at times hospitals or other aggregators, such as county. Patient risk scores and other available variables like surgeon demographic data or year of intervention were included as fixed effects in the regression analysis. The quality and depth of the analysis varied greatly between papers. Confidence intervals for the ICC were not reported.44 A high quality study is Papachristofi et al.45 There is also substantial other research on the ICC.46–51

Grading Surgeons from Best to Worst

In this approach surgeons are ranked by their patient results, usually with a 95% confidence interval and either the raw, unadjusted scores are reported, or patient risk scores and/or surgeon demographic variables and other data, such as year of operation, are included in the model. In the majority of cases the method to calculate the interval is not mentioned, though there are exceptions52–59 and surgeons whose 95% confidence intervals rank wholly above or below the mean rate of outcomes are considered to be outliers. Reporting is done by listing the count of outliers, or graphically through a caterpillar or a funnel plot,60 with a caterpillar plot being an outcome-ordered forest plot.

Reporting Bias and Certainty Assessment

Due to there being no synthesis, reporting bias and certainty assessments were not undertaken.

Results

Study Selection

Overall, 4713 records were identified from electronic records, in addition to 6461 from other sources. After removing the 1224 duplicates, 10,239 studies underwent screening for eligibility. Then, full-text versions were retrieved for 471 records. One study was added by a reviewer. Finally, after exclusion of ineligible articles, 55 studies of more than 52,436 surgeons were included in the final synthesis.

Study Characteristics

The 55 studies that are included reported 102 outcomes, 33 of which are unique. Of the outcomes, 28 (20 studies)55–57,59,61–76 graded individual surgeons’ performance from best to worst; 38 (12 studies)34,77–87 recorded an ICC due to surgeons in a multivariate multi-level analysis; 14 (8 studies)44,45,53,58,88–91 recorded both; 20 (13 studies)37,52,92–102 provided a non-standard description of fixed effects; and 1 provided an ICC plus a non-standard description of fixed effects.103 One study104 graded surgeons from best to worst in one outcome (complications) and used a non-standard fixed effects description for another outcome (mortality). Of the 55 studies, three were randomized controlled trials,37,80,93 and 52 were observational cohort studies. The studies included various surgical specialties or aggregates thereof, including 8 or more specialties,81,91 breast surgery,59,73,74,101 cardiac surgery,44,45,52,53,56,65–67,70,89,90,99,100 colorectal surgery,34,61,68,71,78,85,92,103,104 ENT surgery,75 gastrointestinal surgery,34,83 general surgery,34,37,57,77,79,82,86,93,95,96,102 obstetrics,80 ophthalmology,97 orthopedic surgery,55,76,84,102 rectal surgery,72,98 spinal surgery,58 trauma surgery,87 and urology.62–64,69,88,94,102 38 studies were conducted in the USA, 10 in the UK, two in Austria and Sweden, one each in Canada, France, and Germany. The volume of included surgeons ranged from 17 to 14,598. The characteristics of the included studies are summarized in Table 1.
Table 1

Characteristics of Included Studies

PublicationSpecialtyDetailed InterventionSurgeonsPatients/ProceduresInstitutionsOutcomeNOS **
Anderson, 201652Cardiac surgeryNorwood operationNS288035Mortality9
Aquina, 2015a92 pg e163Colorectal surgeryColorectal resectionNS158,596NSC. difficile infection9
Aquina, 2015b61Colorectal surgeryUpper GI cancer resection22314,87599Blood transfusion, wound infection, pneumonia, sepsis9
Aquina, 2016103Colorectal surgeryColorectal resection3481125,160210Blood transfusion, wound infection, pneumonia, sepsis, intra-abdominal abscess9
Aquina, 201777General surgeryInguinal hernia operation1572124,416260Reoperation9
Ventral hernia operation201278,267256Reoperation9
Arvidsson, 200593General surgeryHernia operation2510687RecurrenceRCT
Becerra, 201778Colorectal surgeryLymph node examination in colectomy150312,332187Suboptimal care9
Begg, 200262UrologyRadical prostatectomy15910,73772Postoperative complications9
Incontinence9
Late urinary complications9
Mortality9
Bianco, 200563UrologyRadical prostatectomy1595238NSComplications9
Incontinence9
Late urinary complications9
Bianco, 201064UrologyRadical prostatectomy5477254Cancer recurrence9
Bolling, 201065Cardiac surgeryMitral valve repair/replacement108828,507639Mitral valve repair rates9
Bridgewater, 200367Cardiac surgeryCoronary artery surgery2385724Mortality9
Bridgewater, 200566Cardiac surgeryAortic valve surgery2510974Mortality8
Coronary artery surgery2590664Mortality8
Burns, 201168Colorectal surgeryColorectal surgery1557246,469156Reoperation9
Cromwell, 201369Urology and GynecologyUrinary-genital tract fistula4901194129Reoperation8
Dagenais, 201988UrologyPartial nephrectomy1914611Estimated blood loss9
Duclos, 201279General surgeryThyroid surgery2835745Hypoparathyroidism9
Recurrent laryngeal nerve palsy9
Eastham, 200394UrologyRadical prostatectomy4446292Positive surgical margins9
Eklund, 200937General surgeryInguinal hernia repair surgery481275NSRecurrenceRCT
Faschinger, 201195General surgeryCataract surgery1736,3291Capsule rupture9
Fountain, 200480ObstetricsHysterectomy, Abdominal43*87628*ComplicationsRCT
Hysterectomy, Vaginal43*50428*ComplicationsRCT
Gani, 201581[See on right]8 (cardiac, GI* surgery, trauma, HPB*, BME*, thoracic, transplant, vascular)5622,5591Readmission9
Glance, 200653Cardiac surgeryCardiac surgery13851,75033Mortality9*
Glance, 201689Cardiac surgeryCABG*24155,43640Major complications or mortality9
Grant, 200870Cardiac surgeryCardiac surgery3114,6374Mortality9
Healy, 201771Colorectal surgeryMinimally invasive colectomy97311846Complications8
Open colectomy97207846Complications8
Hermanek, 199972Rectal surgeryRectal carcinoma resection4311217Mortality9
Hermann, 200296General surgeryPrimary surgery for benign thyroid disease2016,4431Recurrent laryngeal nerve injury (RLNI)8
Hoffman, 201782General surgeryGeneral surgery1128183,283601Complications9
Huesch, 200956Cardiac surgeryCABG*398221,32775Mortality8
Hyder, 201383Gastrointestinal surgeryPancreatoduodenectomy5751488298Readmission9
Johnston, 201097OphthalmologistCataract surgery40455,51512Posterior capsule rupture (PCR)8
Justiniano, 201998Rectal surgeryRectal carcinoma resection3451251118Mortality9
Kaczmarski, 201973Breast surgeryBreast-conserving surgery5337291,065NSReoperation9
Kissenberth, 201884Orthopedic surgeryRotator cuff repair571703NSSingle Assessment Numeric Evaluation (SANE) score8
Landercasper, 201974Breast surgeryBreast-conserving surgery713954NSReoperation9
LaPar, 201499Cardiac surgeryMitral valve repair/replacement93419417Lack of repair8
Likosky, 2012100Cardiac surgeryCABG*3211,8388Postoperative low-output failure9
Luan, 201957General surgeryBariatric surgery38127721Complications9
Martin, 201358Spinal surgeryLumbar fusion298609143Complications9
Martin, 201358Spinal surgeryLumbar fusion298609143Reoperation9
McCahill, 201259Breast surgeryBreast-conserving surgery5422064Reoperation9
Papachristofi, 201490Cardiac surgeryCardiac surgery1818,4261Mortality9
Papachristofi, 201644Cardiac surgeryCardiac surgery127110,76910Mortality9
Papachristofi, 201745Cardiac surgeryCardiac surgery127107,03810Length of stay9
Quinn, 201891All surgeriesAll surgeries2724123,14151Any morbidity9
Death or serious morbidity9
Mortality9
Readmission9
Reoperation9
Surgical site infection9
Rudmik, 201775ENT surgeryEndoscopic sinus surgery432168NSESS* revision rate9
Schumacher, 2017101Breast surgeryBreast-conserving surgery93347056Reoperation7
Shih, 201585Colorectal surgeryColectomy345503324Complications9
Singh, 201876Orthopedic surgery/NeurosurgerySpine surgery398739,884NSLength of stay8
Readmission8
Thigpen, 201855Orthopedic surgeryRotator cuff repair349951ASES* performance score8
Udyavar, 2018a34Colorectal surgeryColectomy2149*569,767*225*Complications9
Mortality9
Readmission9
Gastrointestinal surgeryPeptic ulcer disease2149*569,767*225*Complications9
Mortality9
Readmission9
Small bowel resection2149*569,767*225*Complications9
Mortality9
Readmission9
General surgeryAppendectomy2149*569,767*225*Complications9
Mortality9
Readmission9
Cholecystectomy2149*569,767*225*Complications9
Mortality9
Readmission9
Laparotomy2149*569,767*225*Complications9
Mortality9
Readmission9
Lysis of adhesions2149*569,767*225*Complications9
Mortality9
Readmission9
Overall2149569,767225Complications9
Mortality9
Readmission9
Udyavar, 2018b87Trauma surgeryTrauma surgery17565,70631Mortality9
Udyavar, 201986General surgeryEmergency surgery5816215,745198Complications9
Xu, 2016104Colorectal surgeryColectomy276252544Complications9
Mortality9
Xu, 2019102General surgeryLaparoscopic cholecystectomy24761,884,842*NSComplications8
UrologyOther transurethral prostatectomy1663*1,884,842*NSComplications8
Radical prostatectomy1663*1,884,842*NSComplications8
Orthopedic surgeryCervical spinal fusion10,459*1,884,842*NSComplications8
Lumbar spinal fusion, anterior column10,459*1,884,842*NSComplications8
Lumbar spinal fusion, posterior column10,459*1,884,842*NSComplications8
Total hip arthroplasty10,459*1,884,842*NSComplications8
Total knee replacement10,459*1,884,842*NSComplications8

Notes: *The values are for the whole study population. Values for each subgroup were not reported.

Abbreviations: **ASES, American Shoulder and Elbow Surgeons; BME, breast, melanoma, and endocrine surgery; CABG, coronary artery bypass grafting; ESS, endoscopic sinus surgery; EQ-5D, quality of life via the Euro-Qol; GI, gastro-intestinal; HPB, hepatopancreatic biliary; NOS, Newcastle-Ottawa Scale for assessing risk of bias of cohort studies; NS, not stated, the number is not given and most likely greater than one.

Characteristics of Included Studies Notes: *The values are for the whole study population. Values for each subgroup were not reported. Abbreviations: **ASES, American Shoulder and Elbow Surgeons; BME, breast, melanoma, and endocrine surgery; CABG, coronary artery bypass grafting; ESS, endoscopic sinus surgery; EQ-5D, quality of life via the Euro-Qol; GI, gastro-intestinal; HPB, hepatopancreatic biliary; NOS, Newcastle-Ottawa Scale for assessing risk of bias of cohort studies; NS, not stated, the number is not given and most likely greater than one.

Risk of Bias in Studies

For the cohort studies, of 97 outcomes in 52 studies, (1) scored 7 stars, (21) 8 stars and (75) 9 stars out of a maximum of 9 stars on the Newcastle-Ottawa Scale.40,41 All studies scored the maximum points on the selection criteria and the outcome criteria. Those with 7 and 8 stars scored either 0 or 1 on comparability while the 9-star studies scored 2 (Table 1). The detailed risk of bias assessment of the three randomized controlled trials, using Cochrane RoB, is described in Figures 2 and 3, and .

Results of Individual Studies

Altogether 10 studies published caterpillar plots59,61,64,67,71,74–76,89,91 and five studies presented funnel plots.65,66,68–70 The plots showed the performance of surgeons for a particular patient outcome, usually sorted by performance, providing a 95% confidence interval for each surgeon and indicating whether that confidence interval was wholly above or below the average performance. Results ranged from no over- or underperformer62,66,67,69,70,91 to substantial numbers of both.59,61,62,64,72,73,75,76 Of the papers that reported fixed effects, 15 recorded exceptional performers after taking account of all known variables, including demographic variables of the practitioners, such as experience, volume of patients/procedures, and hospital effects (which themselves can be substantial).55,59,61–66,68,71,73–76,91 Other studies (n=22) published a random effect, worded many different ways, that showed the Intra-Class Correlation (ICC) effect.34,44,45,53,57,58,77–91,103 The random effects reported ranged from zero (ICC of 0.0%) to substantial (ICC of 10% or higher). (Tables 2 and 3, and Figures 4 and 5).
Table 2

Publications by Outcome and Numerical Results

OutcomeSpecialtyDetailed InterventionPublicationICC^Outliers %
NegativePositive
ComplicationsAny morbidityAll surgeriesAll surgeriesQuinn, 2018912.2%0.18%0.22%
Blood transfusion, wound infection, pneumonia, sepsisColorectal surgeryUpper GI cancer resectionAquina, 2015b6113.0%28.0%
Blood transfusion, wound infection, pneumonia, sepsis, intra-abdominal abscessColorectal surgeryColorectal resectionAquina, 201610324.3%OtherOther
C. difficile infectionColorectal surgeryColorectal resectionAquina, 2015a92 pg e163OtherOther
Capsule ruptureGeneral surgeryCataract surgeryFaschinger, 201195OtherOther
Complications (postoperative)Colorectal surgeryColectomyShih, 20158514.0%
Udyavar, 2018a342.3%
Xu, 20161043.3%NS
Minimally invasive colectomyHealy, 20177110.3%7.2%
Open colectomyHealy, 2017719.3%5.2%
Gastrointestinal surgeryPeptic ulcer diseaseUdyavar, 2018a340.03%
Small bowel resectionUdyavar, 2018a340.02%
General surgeryAppendectomyUdyavar, 2018a340.2%
Bariatric surgeryLuan, 2019572.6%15.8%
CholecystectomyUdyavar, 2018a340.1%
Emergency surgeryUdyavar, 20198627.3%
General surgeryHoffman, 2017826.2%
Laparoscopic cholecystectomyXu, 2019102OtherOther
LaparotomyUdyavar, 2018a340.1%
Lysis of adhesionsUdyavar, 2018a340.0%
Overall (Emergency general surgeries)Udyavar, 2018a340.1%
ObstetricsHysterectomy, AbdominalFountain, 2004807.4%
Hysterectomy, VaginalFountain, 2004800.5%
Spinal surgeryLumbar fusionMartin, 2013582.6%3.7%0.0%
UrologyOther transurethral prostatectomyXu, 2019102OtherOther
Radical prostatectomyBegg, 2002628.0%3.0%
Bianco, 200563 *7.5%2.5%
Xu, 2019102OtherOther
Orthopedic/Neuro-surgeryCervical spinal fusionXu, 2019102OtherOther
Lumbar spinal fusion, anterior columnXu, 2019102OtherOther
Lumbar spinal fusion, posterior columnXu, 2019102OtherOther
Total hip arthroplastyXu, 2019102OtherOther
Total knee replacementXu, 2019102OtherOther
Death or serious morbidityAll surgeriesAll surgeriesQuinn, 2018912.0%0.15%0.15%
Estimated blood lossUrologyPartial nephrectomyDagenais, 201988 ##14.4%10.5%10.5%
HypoparathyroidismGeneral surgeryThyroid surgeryDuclos, 20127932.0%
IncontinenceUrologyRadical prostatectomyBegg, 2002629.0%3.0%
Bianco, 200563 *9.4%2.5%
Late urinary complicationsUrologyRadical prostatectomyBegg, 20026213.0%14.0%
Bianco, 200563 *13.2%14.5%
Major complications or mortalityCardiac surgeryCardiac surgeryGlance, 2016891.76%3.3%1.7%
Posterior capsule rupture (PCR)OphthalmologistCataract surgeryJohnston, 201097OtherOther
Recurrent laryngeal nerve injury (RLNI)General surgeryPrimary surgery for benign thyroid diseaseHermann, 200296OtherOther
Recurrent laryngeal nerve palsyGeneral surgeryThyroid surgeryDuclos, 20127910.0%
Surgical site infectionAll surgeriesAll surgeriesQuinn, 2018914.5%0.29%0.07%
Length of stayCardiac surgeryCardiac surgeryPapachristofi, 2017452.79%11.8%14.2%
Orthopedic/Neuro-surgerySpine surgerySingh, 20187610.0%7.2%
MortalityAll surgeriesAll surgeriesQuinn, 2018911.4%0.0%0.0%
Cardiac surgeryAortic valve surgeryBridgewater, 200566 *0.0%0.0%
CABGHuesch, 200956 ***1.2%Other
Glance, 2006535.9%3.3%8.7%
Grant, 200870 *0.0%0.0%
Cardiac surgeryPapachristofi, 2014902.79%16.7%0.0%
Papachristofi, 2016444.0%15.0%6.3%
Coronary artery surgeryBridgewater, 2003670.0%0.0%
Bridgewater, 200566 *0.0%16.0%
Norwood operationAnderson, 201652OtherOther
Colorectal surgeryColectomyUdyavar, 2018a3422.9%
Xu, 2016104OtherOther
Gastrointestinal surgeryPeptic ulcer diseaseUdyavar, 2018a3447.3%
Small bowel resectionUdyavar, 2018a3423.1%
General surgeryAppendectomyUdyavar, 2018a346.9%
CholecystectomyUdyavar, 2018a343.5%
LaparotomyUdyavar, 2018a3433.2%
Lysis of adhesionsUdyavar, 2018a3435.5%
Overall (Emergency general surgeries)Udyavar, 2018a3432.7%
Rectal surgeryRectal carcinoma resectionHermanek, 1999729.3%16.3%
Justiniano, 201998OtherOther
Trauma surgeryTrauma surgeryUdyavar, 2018b878.7%
UrologyRadical prostatectomyBegg, 2002620.0%0.0%
Readmission8 (Cardiac, GIS, Trauma, HPB, BME, Thoracic, Transplant, Vascular)8 (Cardiac, GIS, Trauma, HPB, BME, Thoracic, Transplant, Vascular)Gani, 2015812.8%
All surgeriesAll surgeriesQuinn, 2018910.7%0.0%0.0%
Colorectal surgeryColectomyUdyavar, 2018a343.1%
Gastrointestinal surgeryPancreatoduodenectomyHyder, 2013830.3%
Peptic ulcer diseaseUdyavar, 2018a346.8%
Small bowel resectionUdyavar, 2018a342.9%
General surgeryAppendectomyUdyavar, 2018a343.5%
CholecystectomyUdyavar, 2018a343.0%
LaparotomyUdyavar, 2018a346.0%
Lysis of adhesionsUdyavar, 2018a344.9%
Overall (Emergency general surgeries)Udyavar, 2018a342.3%
ReoperationReoperationAll surgeriesAll surgeriesQuinn, 2018913.8%0.04%0.0%
Breast surgeryBreast-conserving surgeryKaczmarski, 20197317.5%3.7%
Landercasper, 2019745.7%4.3%
McCahill, 20125913.0%31.5%
Schumacher, 2017101OtherOther
Colorectal surgeryColorectal surgeryBurns, 201168 **0.7%4.5%
General surgeryHernia operationArvidsson, 200593OtherOther
Inguinal hernia operationAquina, 20177740.5%
Ventral hernia operationAquina, 20177714.0%
Spinal surgeryLumbar fusionMartin, 2013589.0%##
UrologyUrinary-genital tract fistulaCromwell, 201369 **0.0%0.0%
Suboptimal careColorectal surgeryLymph node examination in colectomyBecerra, 2017787.9%
Success or failureASES scoreOrthopedic surgeryRotator cuff repairThigpen, 2018555.9%8.8%
Cancer recurrenceUrologyRadical prostatectomyBianco, 2010648.3%36.1%
ESS revision rateENT surgeryEndoscopic sinus surgeryRudmik, 20177516.3%4.7%
Mitral valve repair ratesCardiac surgeryMitral valve repair/replacementBolling, 2010656.6%7.4%
LaPar, 201499OtherOther
Positive surgical marginsUrologyRadical prostatectomyEastham, 200394OtherOther
Postoperative low-output failureCardiac surgeryCABGLikosky, 2012100OtherOther
ReadmissionOrthopedic/Neuro-surgerySpine surgerySingh, 2018760.1%0.03%
RecurrenceGeneral surgeryInguinal hernia repair surgeryEklund, 2009372.1%
ScoreASES^ scoreOrthopedic surgeryRotator cuff repairThigpen, 2018555.9%8.8%
Single Assessment Numeric Evaluation (SANE) scoreOrthopedic surgeryRotator cuff repairKissenberth, 20188444.0%

Notes: ^ASES score is American Shoulder and Elbow Surgeons (ASES) performance score; ICC is intra-class correlation coefficient and shows percentage of variance due to practitioner as percentage of total variance after accounting for all known variables. Outliers are listed for papers where the surgeons were ordered in their effect on patients’ physical health from best to worst or vice versa. The percentages listed are those practitioners whose 95% confidence interval is wholly below or above the mean. Outliers listed as “Other” sorted their surgeons by physical patient effect but used a different way to present their data. Common examples are a caterpillar plot without confidence intervals or a bar chart. *99% confidence interval to define outliers used. **99.8% confidence interval to define outliers used. ***90% confidence interval to define outliers used. #Graph too small to calculate positive or negative outliers. ##Dagenais et al88 also shows precisely 0.00 between-surgeon variance for length of stay, glomerular filtration rate (GFR) preservation, positive margins, chronic kidney disease (CKD) upstaging, Clavien grade ≥ 1 complications, and 30-day readmission. Operative time had an ICC of 33.4%.

Table 3

ICC Summary Statistics

ICC n=53 Outcomes
Minimum0.001%
Maximum47.3%
Average10.2%
Median4.0%
IQR2.2–14.0%
Standard deviation0.13
Figure 4

Boxplot of ICC (intra-class correlation coefficient).

Figure 5

Intra-class correlation coefficients (ICCs) by paper, intervention, and patient outcome.

Publications by Outcome and Numerical Results Notes: ^ASES score is American Shoulder and Elbow Surgeons (ASES) performance score; ICC is intra-class correlation coefficient and shows percentage of variance due to practitioner as percentage of total variance after accounting for all known variables. Outliers are listed for papers where the surgeons were ordered in their effect on patients’ physical health from best to worst or vice versa. The percentages listed are those practitioners whose 95% confidence interval is wholly below or above the mean. Outliers listed as “Other” sorted their surgeons by physical patient effect but used a different way to present their data. Common examples are a caterpillar plot without confidence intervals or a bar chart. *99% confidence interval to define outliers used. **99.8% confidence interval to define outliers used. ***90% confidence interval to define outliers used. #Graph too small to calculate positive or negative outliers. ##Dagenais et al88 also shows precisely 0.00 between-surgeon variance for length of stay, glomerular filtration rate (GFR) preservation, positive margins, chronic kidney disease (CKD) upstaging, Clavien grade ≥ 1 complications, and 30-day readmission. Operative time had an ICC of 33.4%. ICC Summary Statistics Boxplot of ICC (intra-class correlation coefficient). Intra-class correlation coefficients (ICCs) by paper, intervention, and patient outcome. Only for a. complications after colectomy and b. mortality after cardiac surgery was there more than one study included that reported an ICC. As these are the only outcomes with multiple ICCs, a more detailed analysis follows: For colectomy, Shih et al85 reported an ICC of 14.0% and Udyavar et al34 an ICC of 2.3%. Udyavar defined complications as any of “pulmonary embolism, sepsis, myocardial infarction, acute renal failure, and cardiac arrest” while Shih defined a much longer list of items as complications, including surgical site infection; wound disruption; multiple types of infection; unplanned intubation; transfusion; multiple stroke or clotting diagnoses; multiple heart issues; renal complications or failure; extended coma or mechanical ventilation; nerve damage; failure of the graft or prosthesis; bowel obstruction; and anastomotic leak. For mortality after cardiac surgery three studies44,53,90 reported an ICC of 2.8% to 5.9% (Table 2).

Results of Syntheses, Reporting Biases and Certainty of Evidence

Not applicable as there was no synthesis.

Discussion

In this review, the objective was to determine whether there is a surgeons’ effect on patients' physical health that is apparent even after accounting for all known variables, such as level of experience. Included studies graded surgeons in order of performance or listed the proportion of variation that is due to practitioners after taking account of all known variables. All but three studies were cohort studies. The other three studies were randomized controlled trials. Findings showed substantial heterogeneity that may be related to type of surgery and type of outcome. After accounting for surgeons’ experience, patients’ risk, and all other known variables, there remained at times substantial differences in patients’ physical health outcomes between surgeons. More than a quarter of all studies (15 out of 55) showed high-volume outliers whose performance is well above the average. In contrast, there were types of surgery/intervention/outcome combinations that showed little evidence of a surgeons' effect on patients' physical health. These findings are somewhat consistent with the substantial body of research on a therapist effect in psychotherapy showing a wide variation in patient outcomes. With two exceptions the authors only found one study per combination of surgical specialty, intervention and patient outcome. The first exception was two studies covering complications after colectomy and they had very different ICCs of 14.0%85 and 2.3%.34 It seems the much wider definition of “complication” in Shih led to a bigger influence of surgeons on the outcome and therefore a higher ICC. The second exception was for mortality after cardiac surgery with three studies44,53,90 reporting an ICC of 2.8% to 5.9%. Clearly, standardized definitions of physical patient outcomes would assist comparisons across studies. A major limitation of the evidence identified in this review is that there is currently no standard way to report surgeons’ performance on patient's physical health. What does get reported can be divided into either grading individual surgeons by performance or calculating the percentage of variation in patients’ outcome that is due to the surgeon after all known variables have been taken into account. Both types of reporting are worded in many different ways, making discovery of such research difficult as can be seen in that more than 10,000 publications had to be reviewed. A further limitation is that only for very few papers the primary purpose was to report surgeons’ performance after taking account of all known variables. Much of the time the reviewed publications’ authors emphasized other aspects of healthcare.

Summary

In terms of this systematic review, it was revealed that surgeons’ performance data on physical patient health is available to the authors of many published research studies. However, this data is in most cases either not at all reported or only in a limited way. This data could easily be included in an article prepared for publication as the data is already available and often requires minimal or no extra analysis to provide it in the format recommended in the methodological review that is reviewed for publishing. Publishing this data will also allow these studies to be part of future meta-analyses, gaining further dissemination of the work. It seems that the possibility that surgeons are an intervention in their own right, an intervention that can be more or less effective and an intervention whose effect can be measured, is an area where there has been little systematic research. This is despite the fact that in psychotherapy it is well established that doctors (therapists) constitute an intervention in their own right, independent of the actual intervention they use.24,25 Furthermore, if the intervention is held constant, then surgeons are an effect modifier whose strength varies substantially depending on the intervention and the patients’ physical health outcome measured. If it can be established when and how much surgeons constitute an intervention or a substantial effect modifier in their own right, independent of the intervention they use, then this opens up the possibility that this intervention (surgeons) can be systematically managed and improved to the benefit of patients, the surgeons themselves, and the entire health system. None of the studies that identified outstanding surgeons61,64–66,68,71,74–76,89,91 made any recommendations on how to use this potential quality improvement resource. So far, we see little or no evidence in the literature that even when exceptional performers have been quantitatively identified, these exceptional performers are used as role models or as research subjects for qualitative research in order to find out what makes them exceptional. A key point of this systematic review is that the authors specifically looked for studies that showed a surgeons’ effect for which there was no explanation, ie a residual effect after all known information had been included in the statistical analysis. Therefore, the cause of the surgeons’ effect measured is, by the definition of the research question for this systematic review, not known. This leaves open the question whether the cause is unknowable, or if there are one or more causes that could be identified in future research. If we want to know what makes a good surgeon beyond the well-founded opinions of surgeons12 or those who work with surgeons – and how to train surgeons to be good surgeons – then the first step beyond all the current measures taken to train surgeons could be to reliably identify outstanding surgeons. Consequently, we can find out if their ability can be passed on to others and, if yes, to lift the overall standard of healthcare by transferring their exceptional ability to other surgeons. This is especially so as identifying data is already available in the many datasets consisting of medical records, some of which were accessed in the cohort studies covered here. Exceptional performances may be due to personality characteristics that may be hard or impossible to emulate, or we may find out that the surgeons employ easy to emulate techniques like connecting with patients, or simply have higher expectations of patient outcomes,26 or we may find that they live stress-resistant lives, or that they are rarely exhausted, or any other of a myriad of possibilities. If research that investigates exceptional performers identifies simple techniques or choices made at work, or out of work, that could be emulated relatively easily by many other surgeons, then this could lead to fewer complications and more successful surgeries, and there could be large beneficial effects on healthcare costs and patient health. However, the misuse of identifying supposed underperformers, for example by disciplining or evicting practitioners whose performance appears substandard but who are not statistical outliers or whose performance appears substandard due to a small number of high-risk patients, or due to other confounders like incomplete case-mix or risk score data, is a danger that can cause substantial harm to the surgeons. Further, an old saying is that what gets measured gets managed.105 If more data is available for each surgeon, then this data can be misused to disempower practitioners by adding more and more rules and regulations, and by giving practitioners less opportunity to use their experience and ability. Such data can also be misused in being available online, especially with insufficient explanations of proper usage; or being very much out of date, as is the case for two publicly available databases of surgeons whose data in 2021 only went until 2013106 and 2014.6 Moreover, giving surgeons key performance indicators of patient outcomes could be an unwarranted intrusion into the doctor/patient relationship and lead to surgeons avoiding high-risk patients, as even a few such patients can skew an individual surgeon’s patient outcome statistics, confirmed anecdotally here.66 However, this fact is denied if patients’ risk was accounted for.107 Hence crude performance data should not be published.67

Strengths and Limitations of This Study

The strength of this work lies in the broad search of the literature, the condensed and clear reporting of effect size, and the importance behind the finding that the surgeons’ effect at times has a significant effect size, as big as many non-surgical interventions themselves. The search term strategy used to identify studies was a complex and complete combination of terms that should have identified most of the relevant published studies. Furthermore, the references list of relevant articles and studies citing these articles were screened. This review was not limited by language or by timeframe. On the other hand, there are at least three broad limitations. First, the Newcastle-Ottawa Scale (NOS) was used for quality assessment with the majority of studies scoring between 8–9 (9 being the maximum total); however, the NOS has been critiqued for being “difficult to use and [having] vague decision rules”1 which derived from poor or fair inter-rater reliability between reviewers. However, it is important to note that associations between individual quality domains or overall quality score and effect estimates were not found. Moreover, the NOS has been endorsed by The Cochrane Collaboration2 for its implementation in systematic reviews of non-randomized studies. Second, as all of the review’s studies were conducted in North America and Europe, it is unclear whether the findings can be generalized to other regions, particularly in developing nations. Finally, while the outcome data was heterogeneous and did not enable a meta-analysis, there was also heterogeneity regarding surgical specialty, type of intervention, and type of outcome. Thereby, it is difficult to draw conclusions and synthesize studies with inconsistent outcome measures, and these characteristics have often been found attributable to a lack of a high level of evidence on the specific research subject.

Conclusions and Implications

Even after accounting for surgeons’ experience, patients’ risk and all other known variables there remain sometimes substantial differences in patients’ physical health outcomes between surgeons. Therefore it can matter which surgeon is chosen. At times it is possible to identify high-volume outliers whose performance is well above the average, and it could be worthwhile to study these surgeons to see whether their excellence can be passed on to their peers. It is evident that there are currently no well-established standards on how to assess surgeons as an intervention in their own right, thus systematic approaches to establishing standardized measures are needed, and researching the surgeons’ effect on patients’ physical health is still in its early stages.
PopulationPSurgeons
InterventionI
ComparisonC
OutcomeOSurgeons’ effect on patients’ physical health outcome
  85 in total

1.  Surgeon-, pathologist-, and hospital-level variation in suboptimal lymph node examination after colectomy: Compartmentalizing quality improvement strategies.

Authors:  Adan Z Becerra; Christopher T Aquina; Mariana Berho; Francis P Boscoe; Maria J Schymura; Katia Noyes; John R Monson; Fergal J Fleming
Journal:  Surgery       Date:  2017-01-11       Impact factor: 3.982

2.  THE MOST IMPORTANT FACTOR IN THE TREATMENT OF WAR WOUNDS AND THE MOST IMPORTANT FACTOR IN CIVILIAN SURGERY-THE GOOD SURGEON.

Authors:  G W Crile
Journal:  Ann Surg       Date:  1919-10       Impact factor: 12.969

Review 3.  Proportion of hospital readmissions deemed avoidable: a systematic review.

Authors:  Carl van Walraven; Carol Bennett; Alison Jennings; Peter C Austin; Alan J Forster
Journal:  CMAJ       Date:  2011-03-28       Impact factor: 8.262

4.  A Strategy for Risk-adjusted Ranking of Surgeons and Practices Based on Patient-reported Outcomes After Elective Lumbar Surgery.

Authors:  Ahilan Sivaganesan; Anthony L Asher; Mohamad Bydon; Inamullah Khan; Panagoitis Kerezoudis; Kevin T Foley; Hui Nian; Frank E Harrell; Kristin R Archer; Clinton J Devin
Journal:  Spine (Phila Pa 1976)       Date:  2019-05-01       Impact factor: 3.468

5.  Racial Differences in Complication Risk Following Emergency General Surgery: Who Your Surgeon Is May Matter.

Authors:  Nidhi Rhea Udyavar; Ali Salim; Edward E Cornwell; Zain Hashmi; Stuart R Lipsitz; Joaquim M Havens; Adil H Haider
Journal:  J Surg Res       Date:  2018-11-16       Impact factor: 2.192

6.  Variations in morbidity after radical prostatectomy.

Authors:  Colin B Begg; Elyn R Riedel; Peter B Bach; Michael W Kattan; Deborah Schrag; Joan L Warren; Peter T Scardino
Journal:  N Engl J Med       Date:  2002-04-11       Impact factor: 91.245

7.  Randomized clinical trial comparing 5-year recurrence rate after laparoscopic versus Shouldice repair of primary inguinal hernia.

Authors:  D Arvidsson; F H Berndsen; L G Larsson; C-E Leijonmarck; G Rimbäck; C Rudberg; S Smedberg; L Spangen; A Montgomery
Journal:  Br J Surg       Date:  2005-09       Impact factor: 6.939

8.  A practical approach to a multi-level analysis with a sparse binary outcome within a large surgical trial.

Authors:  Jayne Fountain; James Gallagher; Julia Brown
Journal:  J Eval Clin Pract       Date:  2004-05       Impact factor: 2.431

9.  Variation in Bariatric Surgery Costs and Complication Rates in the Military Health System.

Authors:  William Patrick Luan; Todd C Leroux; Cara Olsen; Douglas Robb; Jonathan S Skinner; Patrick Richard
Journal:  Mil Med       Date:  2020-08-14       Impact factor: 1.437

10.  Retrospective cohort study on patterns of care and outcomes of surgical treatment for lower urinary-genital tract fistula among English National Health Service hospitals between 2000 and 2009.

Authors:  David Cromwell; Paul Hilton
Journal:  BJU Int       Date:  2012-09-03       Impact factor: 5.588

View more
  4 in total

Review 1.  Is There a Doctors' Effect on Patients' Physical Health, Beyond the Intervention and All Known Factors? A Systematic Review.

Authors:  Christoph Schnelle; Justin Clark; Rachel Mascord; Mark A Jones
Journal:  Ther Clin Risk Manag       Date:  2022-07-21       Impact factor: 2.755

2.  Qualitative Study of Medical Doctors on Their Experiences and Opinions of the Characteristics of Exceptionally Good Doctors.

Authors:  Christoph Schnelle; Mark A Jones
Journal:  Adv Med Educ Pract       Date:  2022-07-17

3.  Characteristics of Exceptionally Good Doctors: A Protocol for a Cross-Sectional Survey of Adults.

Authors:  Christoph Schnelle; Mark A Jones
Journal:  Patient Relat Outcome Meas       Date:  2022-08-12

4.  The Doctors' Effect on Patients' Physical Health Outcomes Beyond the Intervention: A Methodological Review.

Authors:  Christoph Schnelle; Mark A Jones
Journal:  Clin Epidemiol       Date:  2022-07-18       Impact factor: 5.814

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.