| Literature DB >> 30873729 |
Gillis Carrigan1, Samuel Whipple1, Michael D Taylor1, Aracelis Z Torres2, Anala Gossai2, Brandon Arnieri1, Melisa Tucker2, Philip P Hofmeister2, Peter Lambert1, Sandra D Griffith2, William B Capra1.
Abstract
PURPOSE: The aim of this study was to assess the impact of missing death data on survival analyses conducted in an oncology EHR-derived database.Entities:
Keywords: lung cancer; missing deaths; overall survival; pharmacoepidemiology; survival analyses
Mesh:
Year: 2019 PMID: 30873729 PMCID: PMC6594237 DOI: 10.1002/pds.4758
Source DB: PubMed Journal: Pharmacoepidemiol Drug Saf ISSN: 1053-8569 Impact factor: 2.890
Figure 1Overview of methods and analysis [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 2Attrition diagram [Colour figure can be viewed at wileyonlinelibrary.com]
Demographic and clinical characteristics of aNSCLC patients
| Clinical or Demographic Characteristic |
A |
B |
C |
D |
|---|---|---|---|---|
| Age categories (binary) at advanced diagnosis | ||||
| <65 y | 1931 (31.4%) | 32 (23.5%) | 196 (30.7%) | 1164 (35.7%) |
| 65+ y | 4226 (68.6%) | 104 (76.5%) | 443 (69.3%) | 2099 (64.3%) |
| Gender | ||||
| Female | 2706 (43.9%) | 69 (50.7%) | 302 (47.3%) | 1727 (52.9%) |
| Male | 3451 (56.1%) | 67 (49.3%) | 336 (52.6%) | 1536 (47.1%) |
| Unknown | 0 (0.00%) | 0 (0.00%) | 1 (0.16%) | 0 (0.00%) |
| Race/Ethnicity | ||||
| White | 3998 (78.9%) | 80 (67.2%) | 340 (65.8%) | 2133 (75.7%) |
| Black or African American | 414 (8.17%) | 8 (6.72%) | 71 (13.7%) | 258 (9.16%) |
| Asian | 90 (1.78%) | 10 (8.40%) | 19 (3.68%) | 116 (4.12%) |
| Other race | 567 (11.2%) | 21 (17.6%) | 87 (16.8%) | 310 (11.0%) |
| Region | ||||
| Northeast | 1679 (27.3%) | 33 (25.0%) | 112 (18.3%) | 944 (29.4%) |
| Midwest | 1198 (19.5%) | 33 (25.0%) | 66 (10.8%) | 589 (18.3%) |
| South | 2394 (39.0%) | 37 (28.0%) | 243 (39.6%) | 1143 (35.6%) |
| West | 868 (14.1%) | 29 (22.0%) | 192 (31.3%) | 536 (16.7%) |
| Validation period (y) | ||||
| 2011 | 333 (5.41%) | 7 (5.15%) | 17 (2.66%) | 9 (0.28%) |
| 2012 | 920 (14.9%) | 15 (11.0%) | 64 (10.0%) | 34 (1.04%) |
| 2013 | 1401 (22.8%) | 34 (25.0%) | 137 (21.4%) | 76 (2.33%) |
| 2014 | 1672 (27.2%) | 42 (30.9%) | 211 (33.0%) | 150 (4.60%) |
| 2015 | 1831 (29.7%) | 38 (27.9%) | 210 (32.9%) | 2994 (91.8%) |
| Histology | ||||
| Non–squamous cell carcinoma | 4124 (67.0%) | 98 (72.1%) | 445 (69.6%) | 2339 (71.7%) |
| Squamous cell carcinoma | 1586 (25.8%) | 31 (22.8%) | 147 (23.0%) | 779 (23.9%) |
| NSCLC histology NOS | 447 (7.26%) | 7 (5.15%) | 47 (7.36%) | 145 (4.44%) |
| Group stage at diagnosis | ||||
| Stage I/II | 685 (11.1%) | 12 (8.82%) | 65 (10.2%) | 563 (17.3%) |
| Stage III/IIIA | 539 (8.75%) | 7 (5.15%) | 62 (9.70%) | 289 (8.86%) |
| Stage IIIB/IV | 4608 (74.8%) | 112 (82.4%) | 478 (74.8%) | 2261 (69.3%) |
| Group stage is not reported | 325 (5.28%) | 5 (3.68%) | 34 (5.32%) | 150 (4.60%) |
| Smoking status | ||||
| History of smoking | 5307 (86.2%) | 113 (83.1%) | 521 (81.5%) | 2704 (82.9%) |
| No history of smoking | 608 (9.87%) | 22 (16.2%) | 92 (14.4%) | 509 (15.6%) |
| Unknown/not documented | 242 (3.93%) | 1 (0.74%) | 26 (4.07%) | 50 (1.53%) |
| ALK status | ||||
| Rearrangement not present | 2390 (89.3%) | 62 (91.2%) | 220 (86.6%) | 1535 (89.1%) |
| Rearrangement present | 54 (2.02%) | 1 (1.47%) | 7 (2.76%) | 73 (4.24%) |
| Unsuccessful/indeterminate test | 194 (7.25%) | 4 (5.88%) | 23 (9.06%) | 97 (5.63%) |
| Unknown | 37 (1.38%) | 1 (1.47%) | 4 (1.57%) | 17 (0.99%) |
| EGFR status | ||||
| Mutation negative | 2489 (84.1%) | 53 (70.7%) | 235 (80.5%) | 1395 (73.9%) |
| Mutation positive | 294 (9.93%) | 17 (22.7%) | 42 (14.4%) | 401 (21.2%) |
| Unsuccessful/indeterminate test | 150 (5.07%) | 4 (5.33%) | 13 (4.45%) | 83 (4.40%) |
| Unknown | 28 (0.95%) | 1 (1.33%) | 2 (0.68%) | 9 (0.48%) |
| ROS1 status | ||||
| Rearrangement not present | 480 (88.7%) | 10 (90.9%) | 46 (83.6%) | 633 (90.8%) |
| Rearrangement present | 12 (2.22%) | 1 (9.09%) | 1 (1.82%) | 7 (1.00%) |
| Unsuccessful/indeterminate test | 46 (8.50%) | 0 (0.00%) | 6 (10.9%) | 50 (7.17%) |
| Unknown | 3 (0.55%) | 0 (0.00%) | 2 (3.64%) | 7 (1.00%) |
| KRAS status | ||||
| Mutation negative | 504 (64.2%) | 10 (55.6%) | 57 (63.3%) | 378 (64.2%) |
| Mutation positive | 248 (31.6%) | 5 (27.8%) | 28 (31.1%) | 188 (31.9%) |
| Unsuccessful/indeterminate test | 33 (4.20%) | 3 (16.7%) | 5 (5.56%) | 23 (3.90%) |
| PDL1 status | ||||
| PD‐L1 negative/not detected | 23 (52.3%) | 0 (0.00%) | 3 (60.0%) | 73 (54.5%) |
| PD‐L1 positive | 21 (47.7%) | 2 (100%) | 2 (40.0%) | 50 (37.3%) |
| Unsuccessful/indeterminate test | 0 (0.00%) | 0 (0.00%) | 0 (0.00%) | 8 (5.97%) |
| No interpretation given in report | 0 (0.00%) | 0 (0.00%) | 0 (0.00%) | 3 (2.24%) |
| Gap between systemic therapy and advanced diagnosis | ||||
| No treatment | 1527 (24.8%) | 30 (22.1%) | 220 (34.4%) | 846 (25.9%) |
| No, ≤90‐d gap | 3854 (62.6%) | 86 (63.2%) | 343 (53.7%) | 1924 (59.0%) |
| Yes, >90‐d gap | 776 (12.6%) | 20 (14.7%) | 76 (11.9%) | 493 (15.1%) |
| Gap between structured activity and advanced diagnosis | ||||
| No activity after advanced diagnosis date | 50 (0.81%) | 0 (0.00%) | 7 (1.10%) | 37 (1.13%) |
| No, ≤90‐d gap | 5443 (88.4%) | 121 (89.0%) | 552 (86.4%) | 2649 (81.2%) |
| Yes, >90‐d gap | 664 (10.8%) | 15 (11.0%) | 80 (12.5%) | 577 (17.7%) |
| Received platinum‐based 1L therapy: Yes | 3497 (56.8%) | 78 (57.4%) | 321 (50.2%) | 1757 (53.8%) |
| Received EGFR‐targeted 1L therapy: Yes | 396 (6.43%) | 14 (10.3%) | 37 (5.79%) | 301 (9.22%) |
| Received other chemotherapy in 1L: Yes | 647 (10.5%) | 11 (8.09%) | 51 (7.98%) | 233 (7.14%) |
| Follow‐up time from advanced diagnosis (mo), median [IQR] | 5.36 [2.04‐11.6] | 5.64 [2.53‐13.6] | 4.37 [1.55‐10.9] | 10.3 [3.95‐22.2] |
Among those tested and based upon most recent successful biomarker test.
PD‐L1 “Unsuccessful/indeterminate test” results also include “PD‐L1 equivocal” results.
Figure 3Censoring patterns of those patients with missing death dates (cell C false‐negative patients) [Colour figure can be viewed at wileyonlinelibrary.com]
Impact of missing deaths on measures of absolute risk (mOS)
| Median Overall Survival and 95% CI, mo | |||||||
|---|---|---|---|---|---|---|---|
| Exposure Group | Simulation 1 (63.4%) | Bias, % | Simulation 2 (72.5%) | Bias, % | EHR‐derived (90.6%) | Bias, % | Gold Standard Data |
| Platinum treatment | 12.3 (11.7‐13.0) | 36.7 | 11.2 (10.7‐11.7) | 24.4 | 9.5 (9.1‐9.9) | 5.6 | 9.0 (8.7‐9.4) |
| Other Chemo | 11.8 (10.1‐13.6) | 53.2 | 10.1 (8.8‐11.9) | 31.2 | 8.2 (7.3‐9.6) | 6.5 | 7.7 (6.9‐8.7) |
| EGFR+ | 20.7 (17.8‐25.9) | 46.8 | 19.4 (15.9‐22.2) | 37.6 | 15.0 (13.4‐18.9) | 6.4 | 14.1 (12.1‐17.3) |
| EGFR‐ | 21.2 (18.9‐25.3) | 43.2 | 19.2 (17.1‐21.4) | 29.7 | 16.0 (14.5‐18.3) | 8.1 | 14.8 (14.0‐17.0) |
| KRAS+ | 33.4 (28.5‐NA) | 40.9 | 29.9 (24.9‐36.2) | 25.8 | 24.3 (22.3‐29.4) | 2.5 | 23.7 (21.7‐27.3) |
| KRAS‐ | 16.3 (15.2‐17.2) | 39.3 | 14.6 (13.9‐15.3) | 24.8 | 12.4 (11.8‐13.1) | 6.0 | 11.7 (11.3‐12.4) |
Figure 4Impact of missing deaths on comparative analyses conducted with EHR‐derived data: current mortality sensitivity vs simulated sensitivities compared with gold standard benchmark [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 5Impact of missing deaths on analyses that use the EHR‐derived data as an external control arm: current mortality sensitivity vs simulated sensitivities compared with gold standard benchmark. For the external control analyses, the experimental arm in all analyses is composed of the gold standard data, and the control arm is composed of the EHR‐derived data only. For simulations 1 and 2, the same approach is taken where the experimental arms are composed of the gold standard data, and the control arms are composed of the EHR‐derived data only (with their respective simulated lower sensitivities). Each analysis is in turn compared with an analysis conducted using the gold standard data only (solid red vertical line in Figure 5 represents the HR using the gold standard with dashed line representing its corresponding 95% CI) [Colour figure can be viewed at wileyonlinelibrary.com]