| Literature DB >> 25896647 |
Adler Perotte1, Rajesh Ranganath2, Jamie S Hirsch3, David Blei4, Noémie Elhadad5.
Abstract
BACKGROUND: As adoption of electronic health records continues to increase, there is an opportunity to incorporate clinical documentation as well as laboratory values and demographics into risk prediction modeling.Entities:
Keywords: electronic health records; risk prediction; survival analysis; topic modeling
Mesh:
Year: 2015 PMID: 25896647 PMCID: PMC4482276 DOI: 10.1093/jamia/ocv024
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Baseline statistics and independent variables for the five studied models
| Independent Variables | Development cohort ( | Validation cohort ( | eGFR model | LKF model | TKF model | LTKF model | RLT model |
|---|---|---|---|---|---|---|---|
| Age | 66.95 ± 11.43 | 67.17 ± 11.69 | X | X | X | X | X |
| Gender (M/F) | 912 (35%)/1697 (65%) | 107 (36%)/192 (64%) | X | X | X | X | X |
| eGFR | 50.34 ± 8.47 | 50.48 ± 7.60 | X | ||||
| Laboratory Test-based factors and biases (24 variables) | X | X | |||||
| Text-based factors and biases (60 variables) | X | X | |||||
| 25OH Vitamin D | 19.18 ± 7.25 | 16.83 ± 5.11 | X | ||||
| Bicarbonate | 25.20 ± 3.06 | 25.23 ± 3.22 | X | ||||
| BUN | 21.45 ± 8.06 | 21.04 ± 7.75 | X | ||||
| Calcium | 9.39 ± 0.42 | 9.37 ± 0.40 | X | ||||
| Chloride | 102.83 ± 3.50 | 102.64 ± 3.25 | X | ||||
| Creatinine | 1.15 ± 0.34 | 1.10 ± 0.36 | X | ||||
| C-reactive protein | 7.70 ± 8.36 | 6.13 ± 9.86 | X | ||||
| Hematocrit | 37.90 ± 4.80 | 37.68 ± 4.67 | X | ||||
| Hemoglobin | 12.35 ± 1.84 | 12.06 ± 1.66 | X | ||||
| (K) Potassium | 4.29 ± 0.45 | 4.27 ± 0.41 | X | ||||
| Magnesium | 1.85 ± 0.25 | 1.86 ± 0.25 | X | ||||
| (Na) Sodium | 138.97 ± 2.81 | 138.81 ± 2.71 | X | ||||
| Phosphate | 3.41 ± 0.67 | 3.39 ± 0.63 | X | ||||
| Protein | 7.29 ± 0.66 | 7.43 ± 0.69 | X | ||||
| Parathyroid Hormone | 140.69 ± 83.64 | 141.98 ± 82.44 | X | ||||
| Triglyceride | 147.85 ± 72.60 | 154.47 ± 78.15 | X | ||||
| Urine Protein/creatinine | 32.51 ± 30.56 | 31.21 ± 29.58 | X | ||||
| Urine protein qualitative | 2.12 ± 1.02 | 2.12 ± 0.93 | X | ||||
| Uric Acid | 6.35 ± 2.02 | 6.14 ± 1.98 | X |
eGFR = estimated glomerular filtration rate; LKF = Laboratory Test Kalman Filter; TKF = Text Kalman Filter; LTKF = Laboratory Test and Text Kalman Filter; RLT = recent laboratory tests.
*eGFR was calculated using the CKD-EPI equation.
Topics associated with increased risk of progression. (topic titles shown in parentheses were assigned manually once the topics were generated, and are presented as a way to label the topics)
| Topic 3 (heart failure) | Topic 32 (diabetes) | Topic 29 (dialysis) |
|---|---|---|
| Lasix | Units | q15 |
| Volume | Insulin | Dialysis |
| Edema | Subcutaneous | Fistula |
| Heart | Lantus | Volume |
| Failure | Glucose | Bid |
| Worsening | Diabetes | Lasix |
| Diuresis | Times | Placement |
| Severe | 70/30 | Improved |
| Diastolic | Diabetic | Heparin |
| Overload | Days | Examined |
Laboratory variables were chosen for their relatedness to CKD and associated comorbidities by a board certified nephrologist (see Table 1).
Dependent Variable
The outcome of interest was defined as progression to CKD stage IV (eGFR consistently <30 ml/min/173 m2 for ≥3 months).
Topics associated with decreased risk of progression. (Topic titles shown in parentheses were assigned manually once the topics were generated, and are presented as a way to label the topics)
| Topic 33 (family history) | Topic 35 (health maintenance) | Topic 41 (non-specific) | Topic 43 (gynecological) | Topic 45 (asthma) |
|---|---|---|---|---|
| Died | Died | History | Breast | Albuterol |
| Age | Flu | Pressure | Vaginal | Asthma |
| Years | Visit | Rate | Mammo | Inhaled |
| Mother | Fasting | Count | Cancer | Lung |
| Father | Colonoscopy | Three | hx | Obstructive |
| Brother | Year | Revealed | pap | Wheezing |
| Sister | Shot | Times | nl | Advair |
| Worked | Vaccine | Shortness | Age | Pulm |
| Children | wnl | Discharged | Will | Restrictive |
| Deceased | Check | Creatinine | Endometrial | Puffs |
Figure 1:Log hazard ratios for the eGFR Model. Progression of CKD from stage III to stage IV in this model was associated with low eGFR (P < .001), male gender (P = .0051), and younger age (P < .001). (* indicates P < .05, ** indicates P < .01, *** indicates P < .001).
Figure 2.Log hazard ratios for the Recent Laboratory Tests Model. Progression of CKD from stage III to stage IV in this model was associated with elevated levels of BUN (P < .001), creatinine (P < .001), triglycerides (P = .0061), and urine protein (quantitative, P = .019; qualitative, P < .001) as well as decreased levels of hematocrit (P = .0043), hemoglobin (P = .0034), calcium (P = .033), and serum protein (P = .0017). (* indicates P < .05, ** indicates P < .01, *** indicates P < .001).
Concordances and concordance comparisons for the 5 studied models: Laboratory Test and Text Kalman Filter (LTKF), Laboratory Test Kalman Filter (LKF), Text Kalman Filter (TKF), Recent Laboratory Tests (RLT), and estimated Glomerular Filtration Rate (eGFR)
| Δ LTKF | Δ LKF | Δ TKF | Δ RLT | Δ eGFR | Concordance | |
|---|---|---|---|---|---|---|
| LTKF | *** |
| *** | 0.849 | ||
| LKF | *** | ** | 0.836 | |||
| TKF | *** | 0.733 | ||||
| RLT | ** | 0.819 | ||||
| eGFR | 0.779 |
*P < .05, **P < .01, ***P < .001.