| Literature DB >> 30568510 |
Sengwee Toh1, Robert Wellman2, R Yates Coley2, Casie Horgan1, Jessica Sturtevant1, Erick Moyneur3, Cheri Janning4, Roy Pardee2, Karen J Coleman5, David Arterburn2, Kathleen McTigue6, Jane Anau2, Andrea J Cook2.
Abstract
PURPOSE: Sharing of detailed individual-level data continues to pose challenges in multi-center studies. This issue can be addressed in part by using analytic methods that require only summary-level information to perform the desired multivariable-adjusted analysis. We examined the feasibility and empirical validity of 1) conducting multivariable-adjusted distributed linear regression and 2) combining distributed linear regression with propensity scores, in a large distributed data network. PATIENTS AND METHODS: We compared percent total weight loss 1-year postsurgery between Roux-en-Y gastric bypass and sleeve gastrectomy procedure among 43,110 patients from 36 health systems in the National Patient-Centered Clinical Research Network. We adjusted for baseline demographic and clinical variables as individual covariates, deciles of propensity scores, or both, in three separate outcome regression models. We used distributed linear regression, a method that requires only summary-level information (specifically, sums of squares and cross products matrix) from sites, to fit the three ordinary least squares linear regression models. A comparison set of analyses that used pooled deidentified individual-level data from sites served as the reference.Entities:
Keywords: distributed data networks; distributed regression; privacy-protecting methods; propensity score
Year: 2018 PMID: 30568510 PMCID: PMC6267363 DOI: 10.2147/CLEP.S178163
Source DB: PubMed Journal: Clin Epidemiol ISSN: 1179-1349 Impact factor: 4.790
Figure 1Computation process of a typical regression analysis.
Note: Numbers are hypothetical and for demonstrative purposes only.
Abbreviations: SSCP, sums of squares and cross products; SE, standard error.
Figure 2Distributed regression in a multicenter study.
Note: Numbers are hypothetical and for demonstrative purposes only.
Abbreviation: SSCP, sums of squares and cross products.
Figure 3Workflow to perform pooled individual-level data analysis and distributed regression analysis.
Baseline characteristics of patients who underwent Roux-en-Y gastric bypass or sleeve gastrectomy procedure from 36 health systems participating in the PCORnet Bariatric Study
| Variable | Roux-en-Y gastric bypass | Sleeve gastrectomy | ||
|---|---|---|---|---|
|
| ||||
| Number | Proportion | Number | Proportion | |
|
| ||||
| Total | 23,963 | 100.0 | 19,147 | 100.0 |
| Age | ||||
| 20–44 | 11,059 | 46.2 | 9,547 | 49.9 |
| 45–64 | 11,728 | 48.9 | 8,648 | 45.2 |
| 65–80 | 1,176 | 4.9 | 952 | 5.0 |
| Male sex | 4,701 | 19.6 | 3,748 | 19.6 |
| Race | ||||
| White | 16,995 | 70.8 | 10,970 | 57.3 |
| Black | 3,468 | 14.5 | 4,618 | 24.1 |
| Other | 3,540 | 14.8 | 3,559 | 18.6 |
| Hispanic ethnicity | 3,813 | 15.9 | 4,624 | 24.2 |
| Year of procedure | ||||
| 2005–2009 | 1,937 | 8.1 | 424 | 2.2 |
| 2010 | 3,487 | 14.6 | 1,195 | 6.2 |
| 2011 | 4,767 | 19.9 | 3,138 | 16.4 |
| 2012 | 6,101 | 25.5 | 4,011 | 21.0 |
| 2013 | 3,850 | 16.1 | 4,771 | 24.9 |
| 2014 | 3,315 | 13.9 | 4,908 | 25.6 |
| 2015 | 506 | 2.1 | 700 | 3.7 |
| Comorbidity score (SD) | −0.01 | 0.9 | −0.04 | 0.9 |
| Baseline weight (SD) | 280.37 | 57.0 | 274.86 | 57.6 |
| Baseline weight proximity | −21.64 | 35.3 | −14.60 | 31.2 |
| Days of hospitalization (SD) | 0.60 | 8.0 | 0.49 | 6.6 |
| Smoking | 2,278 | 9.5 | 1,605 | 8.4 |
| Diagnosis of | ||||
| Anxiety | 5,404 | 22.6 | 4,080 | 21.3 |
| Deep vein thrombosis | 181 | 0.8 | 150 | 0.8 |
| Depression | 8,150 | 34.0 | 5,540 | 28.9 |
| Diabetes | 10,608 | 44.3 | 5,677 | 29.7 |
| Dyslipidemia | 12,663 | 52.8 | 8,868 | 46.3 |
| Eating disorder | 3,833 | 16.0 | 1,168 | 6.1 |
| GERD | 11,196 | 46.7 | 6,917 | 36.1 |
| Hypertension | 15,452 | 64.5 | 10,798 | 56.4 |
| Infertility | 172 | 0.7 | 159 | 0.8 |
| Kidney disease | 2,172 | 9.1 | 1,469 | 8.0 |
| NAFLD | 6,853 | 28.6 | 3,231 | 16.9 |
| Osteoarthritis | 461 | 1.9 | 337 | 1.8 |
| PCOS | 1,281 | 5.4 | 919 | 4.8 |
| Psychosis | 1,240 | 5.2 | 701 | 3.7 |
| Pulmonary embolism | 320 | 1.3 | 225 | 1.2 |
| Sleep apnea | 13,116 | 54.7 | 8,255 | 43.1 |
| Substance use disorder | 522 | 2.2 | 462 | 2.4 |
Notes:
Measured in the year prior to the surgery unless otherwise specified;
number of days between baseline weight measurement and index procedure.
Abbreviations: GERD, gastroesophageal reflux disease; NAFLD, nonalcoholic fatty liver disease; PCORnet, National Patient-Centered Clinical Research Network; PCOS, polycystic ovarian syndrome.
Results from a linear regression model that adjusted for sites and confounders as individual covariates (Model 1) from 36 health systems participating in the PCORnet Bariatric Study
| Variable | Parameter estimate | SE | ||
|---|---|---|---|---|
|
| ||||
| Pooled individual-level data analysis | Distributed regression | Pooled individual-level data analysis | Distributed regression | |
|
| ||||
| Exposure | −0.05312 | −0.05312 | 0.00105 | 0.00105 |
| Age | ||||
| 20–44 | −0.01662 | −0.01662 | 0.00106 | 0.00106 |
| 45–64 | Reference | Reference | Reference | Reference |
| 65–80 | 0.01339 | 0.01339 | 0.00218 | 0.00218 |
| Male sex | 0.02162 | 0.02162 | 0.00133 | 0.00133 |
| Race | ||||
| White | Reference | Reference | Reference | Reference |
| Black | 0.02873 | 0.02873 | 0.00130 | 0.00130 |
| Other | 0.00883 | 0.00883 | 0.00153 | 0.00153 |
| Hispanic ethnicity | 0.00227 | 0.00227 | 0.00147 | 0.00147 |
| Year of procedure | ||||
| 2005–2009 | −0.00319 | −0.00319 | 0.00218 | 0.00218 |
| 2010 | −0.00301 | −0.00301 | 0.00169 | 0.00169 |
| 2011 | −0.00326 | −0.00326 | 0.00144 | 0.00144 |
| 2012 | Reference | Reference | Reference | Reference |
| 2013 | 0.00384 | 0.00384 | 0.00141 | 0.00141 |
| 2014 | 0.00569 | 0.00569 | 0.00145 | 0.00145 |
| 2015 | 0.03664 | 0.03664 | 0.00288 | 0.00288 |
| Comorbidity score | 0.00576 | 0.00576 | 0.00069243 | 0.00069243 |
| Baseline weight | −0.00025096 | −0.00025096 | 0.00000921 | 0.00000921 |
| Baseline weight proximity | 0.00012330 | 0.00012330 | 0.00001430 | 0.00001430 |
| Smoking | −0.00657 | −0.00657 | 0.00163 | 0.00163 |
| Days of hospitalization | 0.00017944 | 0.00017944 | 0.00006129 | 0.00006129 |
| Diagnosis of | ||||
| Anxiety | 0.00036721 | 0.00036721 | 0.00119 | 0.00119 |
| Deep vein thrombosis | 0.00245 | 0.00245 | 0.00530 | 0.00530 |
| Depression | 0.00412 | 0.00412 | 0.00107 | 0.00107 |
| Diabetes | 0.01914 | 0.01914 | 0.00107 | 0.00107 |
| Dyslipidemia | 0.00169 | 0.00169 | 0.00103 | 0.00103 |
| Eating disorder | −0.00247 | −0.00247 | 0.00236 | 0.00236 |
| GERD | −0.00132 | −0.00132 | 0.00095329 | 0.00095329 |
| Hypertension | 0.01454 | 0.01454 | 0.00124 | 0.00124 |
| Infertility | 0.00842 | 0.00842 | 0.00521 | 0.00521 |
| Kidney disease | 0.00059350 | 0.00059350 | 0.00176 | 0.00176 |
| NAFLD | −0.00652 | −0.00652 | 0.00150 | 0.00150 |
| Osteoarthritis | −0.00253 | −0.00253 | 0.00337 | 0.00337 |
| PCOS | 0.00118 | 0.00118 | 0.00212 | 0.00212 |
| Psychosis | 0.00006296 | 0.00006296 | 0.00226 | 0.00226 |
| Pulmonary embolism | 0.00722 | 0.00722 | 0.00414 | 0.00414 |
| Sleep apnea | −0.00153 | −0.00153 | 0.00098117 | 0.00098117 |
| Substance use disorder | −0.00729 | −0.00729 | 0.00310 | 0.00310 |
Notes:
Also adjusted for sites (35 indicator variables; results not shown for brevity);
Roux-en-Y gastric bypass vs sleeve gastrectomy;
measured in the year prior to the surgery;
modeled as a continuous variable;
number of days between baseline weight measurement and index procedure.
Abbreviations: GERD, gastroesophageal reflux disease; NAFLD, nonalcoholic fatty liver disease; PCORnet, National Patient-Centered Clinical Research Network; PCOS, polycystic ovarian syndrome; SE, standard error.
Results from a linear regression model that adjusted for sites and confounders as propensity score deciles (Model 2) from 36 health systems participating in the PCORnet Bariatric Study
| Variable | Parameter estimate | SE | ||
|---|---|---|---|---|
|
| ||||
| Pooled individual-level data analysis | Distributed regression | Pooled individual-level data analysis | Distributed regression | |
|
| ||||
| Exposure | −0.05470 | −0.05470 | 0.00113 | 0.00113 |
| PS stratum 1 | Reference | Reference | Reference | Reference |
| PS stratum 2 | −0.00754 | −0.00754 | 0.00209 | 0.00209 |
| PS stratum 3 | −0.00671 | −0.00671 | 0.00210 | 0.00210 |
| PS stratum 4 | −0.00717 | −0.00717 | 0.00211 | 0.00211 |
| PS stratum 5 | 0.00034218 | 0.00034218 | 0.00212 | 0.00212 |
| PS stratum 6 | −0.00583 | −0.00583 | 0.00213 | 0.00213 |
| PS stratum 7 | −0.00135 | −0.00135 | 0.00214 | 0.00214 |
| PS stratum 8 | −0.00435 | −0.00435 | 0.00216 | 0.00216 |
| PS stratum 9 | −0.00523 | −0.00523 | 0.00218 | 0.00218 |
| PS stratum 10 | −0.00812 | −0.00812 | 0.00222 | 0.00222 |
Notes:
Also adjusted for sites (35 indicator variables; results not shown for brevity);
Roux-en-Y gastric bypass vs sleeve gastrectomy.
Abbreviations: PCORnet, National Patient-Centered Clinical Research Network; PS, propensity score; SE, standard error.
Results from a linear regression model that adjusted for sites and confounders as both individual covariates and propensity score deciles (Model 3) from 36 health systems participating in the PCORnet Bariatric Study
| Variable | Parameter estimate | SE | ||
|---|---|---|---|---|
|
| ||||
| Pooled individual-level data analysis | Distributed regression | Pooled individual-level data analysis | Distributed regression | |
|
| ||||
| Exposure | −0.05355 | −0.05355 | 0.00108 | 0.00108 |
| Age | ||||
| 20–44 | −0.01668 | −0.01668 | 0.00106 | 0.00106 |
| 45–64 | Reference | Reference | Reference | Reference |
| 65–80 | 0.01364 | 0.01364 | 0.00218 | 0.00218 |
| Male sex | 0.02189 | 0.02189 | 0.00134 | 0.00134 |
| Race | ||||
| White | Reference | Reference | Reference | Reference |
| Black | 0.02917 | 0.02917 | 0.00132 | 0.00132 |
| Other | 0.00877 | 0.00877 | 0.00153 | 0.00153 |
| Hispanic ethnicity | 0.00218 | 0.00218 | 0.00148 | 0.00148 |
| Year of procedure | ||||
| 2005–2009 | −0.00372 | −0.00372 | 0.00241 | 0.00241 |
| 2010 | −0.00352 | −0.00352 | 0.00187 | 0.00187 |
| 2011 | −0.00348 | −0.00348 | 0.00146 | 0.00146 |
| 2012 | Reference | Reference | Reference | Reference |
| 2013 | 0.00453 | 0.00453 | 0.00146 | 0.00146 |
| 2014 | 0.00683 | 0.00683 | 0.00157 | 0.00157 |
| 2015 | 0.03793 | 0.03793 | 0.00295 | 0.00295 |
| Comorbidity score | 0.00585 | 0.00585 | 0.00069368 | 0.00069368 |
| Baseline weight | −0.00025150 | −0.00025150 | 0.00000922 | 0.00000922 |
| Baseline weight proximity | 0.00012628 | 0.00012628 | 0.00001507 | 0.00001507 |
| Smoking | −0.00653 | −0.00653 | 0.00163 | 0.00163 |
| Days of hospitalization | 0.00018276 | 0.00018276 | 0.00006130 | 0.00006130 |
| Diagnosis of | ||||
| Anxiety | 0.00040061 | 0.00040061 | 0.00119 | 0.00119 |
| Deep vein thrombosis | 0.00272 | 0.00272 | 0.00530 | 0.00530 |
| Depression | 0.00403 | 0.00403 | 0.00107 | 0.00107 |
| Diabetes | 0.01859 | 0.01859 | 0.00115 | 0.00115 |
| Dyslipidemia | 0.00164 | 0.00164 | 0.00103 | 0.00103 |
| Eating disorder | −0.00226 | −0.00226 | 0.00236 | 0.00236 |
| GERD | −0.00164 | −0.00164 | 0.00097208 | 0.00097208 |
| Hypertension | 0.01451 | 0.01451 | 0.00124 | 0.00124 |
| Infertility | 0.00880 | 0.00880 | 0.00521 | 0.00521 |
| Kidney disease | 0.00055973 | 0.00055973 | 0.00176 | 0.00176 |
| NAFLD | −0.00680 | −0.00680 | 0.00150 | 0.00150 |
| Osteoarthritis | −0.00244 | −0.00244 | 0.00337 | 0.00337 |
| PCOS | 0.00127 | 0.00127 | 0.00212 | 0.00212 |
| Psychosis | 0.00003263 | 0.00003263 | 0.00226 | 0.00226 |
| Pulmonary embolism | 0.00757 | 0.00757 | 0.00415 | 0.00415 |
| Sleep apnea | −0.00177 | −0.00177 | 0.00098965 | 0.00098965 |
| Substance use disorder | −0.00745 | −0.00745 | 0.00310 | 0.00310 |
| PS stratum 1 | Reference | Reference | Reference | Reference |
| PS stratum 2 | 0.00180 | 0.00180 | 0.00204 | 0.00204 |
| PS stratum 3 | 0.00362 | 0.00362 | 0.00207 | 0.00207 |
| PS stratum 4 | 0.00058495 | 0.00058495 | 0.00211 | 0.00211 |
| PS stratum 5 | 0.00731 | 0.00731 | 0.00217 | 0.00217 |
| PS stratum 6 | 0.00056111 | 0.00056111 | 0.00223 | 0.00223 |
| PS stratum 7 | 0.00508 | 0.00508 | 0.00231 | 0.00231 |
| PS stratum 8 | 0.00383 | 0.00383 | 0.00242 | 0.00242 |
| PS stratum 9 | 0.00516 | 0.00516 | 0.00257 | 0.00257 |
| PS stratum 10 | 0.00336 | 0.00336 | 0.00285 | 0.00285 |
Notes:
Also adjusted for sites (35 indicator variables; results not shown for brevity);
Roux-en-Y gastric bypass vs sleeve gastrectomy;
measured in the year prior to the surgery;
modeled as a continuous variable;
number of days between baseline weight measurement and index procedure.
Abbreviations: GERD, gastroesophageal reflux disease; NAFLD, nonalcoholic fatty liver disease; PCORnet, National Patient-Centered Clinical Research Network; PCOS, polycystic ovarian syndrome; PS, propensity score; SE, standard error.