| Literature DB >> 31346542 |
Justin Bohn1, Sebastian Schneeweiss1, Robert J Glynn1, Sengwee Toh2, Richard Wyss1, Rishi Desai1, Joshua J Gagne1.
Abstract
PURPOSE: Little is known about how disease risk score (DRS) development should proceed under different pharmacoepidemiologic follow-up strategies. In an analysis of dabigatran vs. warfarin and risk of major bleeding, we compared the results of DRS adjustment when models were developed under "intention-to-treat" (ITT) and "as-treated" (AT) approaches.Entities:
Keywords: Health insurance claims data; comparative effectiveness research; confounding control; disease risk scores; prognostic scores
Year: 2019 PMID: 31346542 PMCID: PMC6640656 DOI: 10.5334/egems.254
Source DB: PubMed Journal: EGEMS (Wash DC) ISSN: 2327-9214
Figure 1Follow-up of patients in the study population. Schematic representation of follow-up in the study population, showing historical warfarin initiators A and B, concurrent warfarin initiators C and D, and concurrent dabigatran initiators E and F. Arrows indicate the duration of observation, becoming dashed when a patient discontinues their index treatment. Events are represented by an “X”, and censoring by a “|”. The disease risk score model is fit among patients A and B only. Note that under “as-treated” follow-up observation stops at discontinuation, and thus only patients B, D, and F have events that are observed.
Incidence of major bleeding under different follow-up approaches, by treatment initiation and period.
| Historical cohort | Concurrent cohort | |||||
|---|---|---|---|---|---|---|
| Warfarin | Warfarin | Dabigatran | ||||
| Follow-up | Events | IR | Events | IR | Events | IR |
| AT | 1,059 | 61 | 1,438 | 78 | 430 | 48 |
| ITT (6-Month) | 1,167 | 65 | 1,774 | 79 | 457 | 47 |
| ITT (1-Year) | 1,933 | 57 | 2,505 | 70 | 690 | 43 |
| ITT (2-Year) | 3,075 | 51 | 2,930 | 64 | 803 | 40 |
| ITT(All) | 3,936 | 48 | 2,944 | 64 | 803 | 40 |
Abbreviations: IR, incidence rate per 1,000 person-years; AT, as-treated; ITT, intention-to-treat.
Selected baseline characteristics by treatment initiation and period.
| Historical cohort | Concurrent cohort | ||
|---|---|---|---|
| Warfarin (N = 39,209) | Warfarin (N = 56,456) | Dabigatran (N = 22,809) | |
| Age, mean (sd) | 69.1 (12.4) | 71.1 (12.1) | 67.3 (12.2) |
| Male sex, n (%) | 23,747 (61) | 34,227 (61) | 14,600 (64) |
| Census region, n (%) | |||
| Northeast | 6,659 (17) | 7,974 (14) | 3,495 (15) |
| South | 11,720 (30) | 14,361 (25) | 7,414 (33) |
| Midwest | 13,076 (33) | 12,992 (23) | 4,555 (20) |
| West | 7,754 (20) | 21129 (37) | 7,345 (32) |
| HASBLED Score, mean (sd) | 2.27 (1.01) | 2.48 (1.10) | 2.18 (1.02) |
| CHA2DS2-VASc score, mean (sd) | 3.22 (1.59) | 3.56 (1.67) | 2.91 (1.57) |
| Renal dysfunction, n (%) | 4,775 (12) | 10,385 (18) | 2,146 (9) |
| Coronary artery disease, n (%) | 12,342 (32) | 19,717 (35) | 6,768 (30) |
| Upper GI bleed, n (%) | 243 (0.6) | 345 (0.6) | 62 (0.3) |
Discrimination and prognostic balance of disease risk score models, by period and treatment status.
| C-Index | Pseudo-bias | |||||||
|---|---|---|---|---|---|---|---|---|
| Historical cohort | Concurrent cohort | |||||||
| Follow-up | Warfarin | Warfarin | Dabigatran | All | Optimism | Estimate | 95% | CI |
| AT | 0.678 | 0.622 | 0.660 | 0.638 | 0.056 | –6% | –16% | 5% |
| ITT (6-Month) | 0.677 | 0.630 | 0.676 | 0.649 | 0.047 | –6% | –15% | 5% |
| ITT (1-Year) | 0.666 | 0.624 | 0.674 | 0.643 | 0.042 | –6% | –16% | 4% |
| ITT (2-Year) | 0.658 | 0.621 | 0.677 | 0.641 | 0.037 | –7% | –17% | 4% |
| ITT (All) | 0.653 | 0.619 | 0.683 | 0.641 | 0.034 | –6% | –15% | 4% |
Abbreviations: DRS, disease risk score; AT, as-treated; ITT, intention-to-treat; HR, hazard ratio; CI, confidence interval.
Footnotes:
a) DRS model optimism is the difference between the C-Index for the model in the historical cohort and the C-Index for the model among warfarin initiators in the concurrent cohort. All C-indices were estimated within the AT follow-up experience.
b) The dry run “pseudo-bias” is the ability of a DRS to induce prognostic balance. These are presented on the scale of percent bias in the hazard ratio. Values closer to 0% indicate better prognostic balance.
Figure 2Calibration of DRS models at 180 days. Calibration plots showing Kaplan-Meier predicted risk vs. DRS model predicted risk within deciles of the DRS. Calibration is shown at 180 days among historical warfarin initiators, concurrent warfarin initiators, concurrent dabigatran initiators, and the entire concurrent population in columns 1–3, and for the five DRS models (AT, 6-month ITT, 1-year ITT, 2-year ITT, and unlimited ITT) in rows 1–5, respectively. Each point represents a sample decile of the DRS. Dashed lines represent a slope of 1, or exact calibration. P-values and chi-squared test statistics for the Greenwood-Nam-D’Agostino test of the null hypothesis of “no lack of fit” are given at the top of each pane.
Disease risk score discrimination and prognostic balance in the resampling study.
| Follow-up | Sizea | Eventsb | C-Index | Pseudo biasg | |||||
|---|---|---|---|---|---|---|---|---|---|
| Historical cohort | Concurrent cohort | ||||||||
| Warfarinc | Warfarind | Dabigatrane | Allf | Estimate | 95% Empirical CI | ||||
| AT (All) | 10% | 105 | 0.787 | 0.524 | 0.538 | 0.531 | –19% | –28% | –9% |
| 25% | 265 | 0.729 | 0.568 | 0.590 | 0.579 | –15% | –25% | –4% | |
| 50% | 529 | 0.707 | 0.593 | 0.621 | 0.606 | –13% | –23% | –1% | |
| 75% | 794 | 0.698 | 0.602 | 0.633 | 0.616 | –11% | –22% | 1% | |
| 90% | 954 | 0.695 | 0.605 | 0.636 | 0.619 | –11% | –21% | 2% | |
| ITT (6m) | 10% | 116 | 0.765 | 0.540 | 0.556 | 0.547 | –18% | –27% | –8% |
| 25% | 292 | 0.718 | 0.583 | 0.612 | 0.596 | –14% | –24% | –3% | |
| 50% | 584 | 0.701 | 0.606 | 0.640 | 0.621 | –11% | –22% | 0% | |
| 75% | 874 | 0.693 | 0.613 | 0.651 | 0.629 | –9% | –21% | 2% | |
| 90% | 1,050 | 0.690 | 0.616 | 0.656 | 0.633 | –9% | –20% | 3% | |
| ITT (1y) | 10% | 193 | 0.736 | 0.552 | 0.576 | 0.562 | –17% | –26% | –6% |
| 25% | 483 | 0.698 | 0.590 | 0.626 | 0.605 | –13% | –23% | –1% | |
| 50% | 965 | 0.684 | 0.607 | 0.649 | 0.624 | –10% | –22% | 1% | |
| 75% | 1,450 | 0.678 | 0.613 | 0.657 | 0.630 | –9% | –21% | 3% | |
| 90% | 1,739 | 0.676 | 0.615 | 0.660 | 0.632 | –9% | –19% | 3% | |
| ITT (2y) | 10% | 307 | 0.708 | 0.568 | 0.600 | 0.580 | –16% | –26% | –5% |
| 25% | 769 | 0.680 | 0.597 | 0.640 | 0.613 | –12% | –22% | –1% | |
| 50% | 1,537 | 0.671 | 0.609 | 0.658 | 0.627 | –10% | –20% | 1% | |
| 75% | 2,305 | 0.667 | 0.613 | 0.665 | 0.632 | –9% | –20% | 3% | |
| 90% | 2,768 | 0.665 | 0.614 | 0.667 | 0.633 | –9% | –19% | 3% | |
| ITT (All) | 10% | 393 | 0.694 | 0.573 | 0.612 | 0.587 | –15% | –25% | –3% |
| 25% | 984 | 0.671 | 0.598 | 0.649 | 0.616 | –11% | –22% | 0% | |
| 50% | 1,967 | 0.664 | 0.608 | 0.666 | 0.629 | –9% | –20% | 2% | |
| 75% | 2,951 | 0.660 | 0.612 | 0.672 | 0.633 | –8% | –19% | 3% | |
| 90% | 3,543 | 0.659 | 0.613 | 0.673 | 0.634 | –8% | –18% | 4% | |
a The size of the resampled historical cohorts is given as a percentage of the full historical cohort sample size (N = 39,209 warfarin initiators).
b Number of events refers to the number of events corresponding to the follow-up strategy used for disease risk score development (column 1). This has been averaged across the 1,000 resampled cohorts of each size.
c C-index among the AT experience of warfarin initiators in the resampled historical cohorts. This has been averaged across the 1,000 resampled cohorts of each size.
d C-index among the AT experience of warfarin initiators in the concurrent cohort. This has been averaged across the 1,000 DRS models fit in the resampled cohorts of each size.
e C-index among the AT experience of dabigatran initiators in the concurrent cohort. This has been averaged across the 1,000 DRS models fit in the resampled cohorts of each size.
f C-index among the AT experience of all initiators in the concurrent cohort. This has been averaged across the 1,000 DRS models fit in the resampled cohorts of each size.
g Pseudo-bias estimated with the dry-run analysis is a measure of prognostic balance. Here it is presented in terms of the percent bias in the hazard ratio. This has been averaged across the 1,000 DRS models fit in the resampled historical cohorts of each size. Confidence intervals are 95% empirical intervals based on the 2.5th and 97.5th percentile.
Figure 3AT hazard ratios stratified by deciles of five DRSs, averaged across 1,000 models fit to resampled cohorts from 10%–100% sample size. Estimated as-treated (AT) hazard ratio (HR) vs. sample size, averaged across estimates stratified on deciles of DRS from models fit to 1,000 resampled cohorts at 10%, 25%, 50%, 75%, 90%, and 100% of the full historical cohort sample size. For reference, the crude AT HR and PS-stratified AT HR are presented below and above the plotted curves, respectively.