| Literature DB >> 33769477 |
Ryan W Stidham1,2,3, Yumu Liu4, Binu Enchakalody5, Tony Van6, Venkataramu Krishnamurthy7, Grace L Su1,6, Ji Zhu3,4, Akbar K Waljee1,2,3,6.
Abstract
BACKGROUND: Although imaging, endoscopy, and inflammatory biomarkers are associated with future Crohn disease (CD) outcomes, common laboratory studies may also provide prognostic opportunities. We evaluated machine learning models incorporating routinely collected laboratory studies to predict surgical outcomes in U.S. Veterans with CD.Entities:
Keywords: Crohn disease; Lasso; complications; prediction models
Mesh:
Year: 2021 PMID: 33769477 PMCID: PMC8314116 DOI: 10.1093/ibd/izab035
Source DB: PubMed Journal: Inflamm Bowel Dis ISSN: 1078-0998 Impact factor: 5.325
FIGURE 1.Illustration of prediction window at 2 separate time points used to model future CD-related surgery. The prediction window is composed of the time from the first clinic visit until 1 year before the surgery outcome or the date of the last visit. Two distinct time points were used to model outcome prediction: the last prediction day (1 year before surgery or last visit) and a random time point within the prediction window. In addition to single time points, the historical summary statistics of laboratory values before time points were also calculated and included in the models of surgery.
Patient Characteristics Summary With Selected Laboratory Values
| Variables | No Surgery (n = 2553) | Surgery (n = 256) | ||
|---|---|---|---|---|
| Sex (M) | 2384 (93.4%) | 238 (93.0%) | ||
| Race | White | 1977 (77.4%) | 199 (77.7%) | |
| Black | 225 (8.8%) | 27 (10.5%) | ||
| Asian/Pacific islander | 34 (1.3%) | 2 (0.78%) | ||
| Unknown | 317 (12.4%) | 28 (10.9%) | ||
| Ethnicity | Hispanic | 15 (0.6%) | 1 (0.4%) | |
| Non-Hispanic | 2251 (88.1%) | 239 (93.4%) | ||
| Unknown | 287 (11.2%) | 16 (6.3%) | ||
| Medication History | 5-aminosalicylates | 1649 (64.6%) | 181 (70.7%) | |
| Steroids | 1023 (40.0%) | 149 (58.2%) | ||
| Anti-TNF | 244 (9.6%) | 40 (15.6%) | ||
| Thiopurine or methotrexate | 702 (27.5%) | 116 (45.3%) | ||
| Combination therapy | 112 (4.4%) | 17 (6.6%) | ||
| Laboratory | HCT | Last measurement | 40.38 (5.12) | 40.12 (4.75) |
| Historical mean | 40.43 (4.05) | 39.81(4.19) | ||
| WBC count | Last measurement | 7.58 (2.81) | 8.25 (3.07) | |
| Historical mean | 7.68 (2.10) | 8.46 (2.51) | ||
| PLATE | Last measurement | 226.0 (184.0-277.0) | 257.0 (207.8-330.2) | |
| Historical mean | 239.8 (199.5-284.8) | 277.1 (231.7-338.0) | ||
| ALB | Last measurement | 3.71 (0.55) | 3.68 (0.58) | |
| Historical mean | 3.86 (0.41) | 3.71 (0.50) | ||
| BUN | Last measurement | 16.0 (12.0-21.0) | 13.0 (10.0-16.0) | |
| Historical mean | 15.3 (12.4-19.4) | 12.8 (9.8-16.0) | ||
| MCHC | Last measurement | 33.19 (1.26) | 33.66 (1.15) | |
| Historical mean | 33.50 (0.90) | 33.61 (0.94) | ||
| Days from last visit to Surgery | N/A | 411.5 (383.0-464.2) |
The summary statistics are presented as count (%), mean (SD), or median (first quartile to third quartile).
ALB indicates albumin; BUN, blood urea nitrogen; HCT, hematocrit; MCHC, mean corpuscular hemoglobin concentration; PLATE, platelets; WBC, white blood cell.
Comparison of Models Containing Different Data Elements for Predicting Future Surgery in CD
| Model | Sensitivity | Specificity | AuROC | Brier | AuROC* | Brier* |
|---|---|---|---|---|---|---|
| 1. Best model demographic + medication + last laboratory measurement + historical laboratory summary | 0.735 (0.013) | 0.726 (0.013) | 0.782 (0.0019) | 0.0451 (0.0002) | 0.775 (0.0447) | 0.0465 (0.0018) |
| 2. Demographic + medication + last laboratory measurement | 0.722 (0.011) | 0.714 (0.010) | 0.761 (0.0014) | 0.0455 (0.0002) | 0.761 (0.0446) | 0.0466 (0.0018) |
| 3. Demographic + medication | 0.631 (0.103) | 0.702 (0.012) | 0.714 (0.0016) | 0.0473 (0.0002) | 0.715 (0.0473) | 0.0482 (0.0012) |
| 4. Last laboratory measurement alone | 0.690 (0.009) | 0.670 (0.009) | 0.691 (0.0021) | 0.0477 (0.0002) | 0.673 (0.0494) | 0.0489 (0.0010) |
| 5. Random-forest method for all variables | 0.673 (0.017) | 0.652 (0.016) | 0.686 (0.0049) | 0.0488 (0.0002) | 0.675 (0.0526) | 0.0500 (0.0016) |
The first 4 columns of the table are based on cross-validation, reporting mean measures collected over 30 constructed datasets.
*The last 2 columns of the table are based on random splitting, reporting the mean value over 50 replications from a randomly chosen dataset.
FIGURE 2.Variable importance plot for time-dependent variables used to predict future surgery in CD. Anti-TNF use was associated with a lower probability of surgery within 1 year. Combination anti-TNF and immunomodulator use was also associated with avoidance of surgery at 1 year. Corticosteroid use had a strong influence on model predictions of surgery in 1 year. Immunomodulator monotherapy and 5-aminosalicylate use had little effect on the likelihood of surgery. High platelet values, high mean cell hemoglobin concentration values, low albumin values, and low blood urea nitrogen values were influential model components predicting future surgery. * First principal component.