| Literature DB >> 36006985 |
Heidi Fischer1, Lei Qian1, Jacek Skarbinski2,3, Katia J Bruxvoort1,4, Rong Wei1, Kris Li1, Laura B Amsden2, Mariah S Wood2, Abigail Eaton2, Brigitte C Spence1, Sally F Shaw1, Sara Y Tartof1,5.
Abstract
OBJECTIVE: Though targeted testing for latent tuberculosis infection ("LTBI") for persons born in countries with high tuberculosis incidence ("HTBIC") is recommended in health care settings, this information is not routinely recorded in the electronic health record ("EHR"). We develop and validate a prediction model for birth in a HTBIC using EHR data.Entities:
Mesh:
Year: 2022 PMID: 36006985 PMCID: PMC9409495 DOI: 10.1371/journal.pone.0273363
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Characteristics of KPSC patient population by country of birth, as documented in the electronic health record, January 1, 2008—December 31st, 2019.
| Characteristic | Born in-HTBIC | Not Born in HTBIC | Total | Missing |
|---|---|---|---|---|
| N = 496,257 | N = 1,540,143 | N = 2,036,400 | N = 5,446,017 | |
|
| ||||
| No | 308,353 (62.1) | 1,504,729 (97.7) | 1,813,082 (89) | 4,631,579 (85.0) |
| Yes | 183,670 (37.0) | 34,522 (2.2) | 218,192 (10.7) | 620,004 (11.4) |
| Unknown | 4,234 (0.9) | 892 (0.1) | 5126 (0.3) | 194434 (3.6) |
|
| ||||
| No | 345,387 (70) | 1,515,940 (98) | 1,861,327 (91) | 4,847,902 (89) |
| Yes | 149,353 (30) | 23,083 (1.5) | 172,436 (8.5) | 378,583 (7.0) |
| Unknown | 1,517 (0.3) | 1,120 (<0.1) | 2,637 (0.1) | 219,532 (4.0) |
|
| ||||
| Median (IQR) | 34 (25, 43) | 26 (17, 36) | 28 (19, 38) | 26 (17, 36) |
| Unknown, n (%) | 876 (0.2) | 3,268 (0.2) | 4144 (0.2) | 30,722 (0.6) |
|
| ||||
| White | 48,757 (9.8) | 691,669 (45) | 740,426 (36) | 1,574,527 (29) |
| Asian | 132,132 (27) | 80,553 (5.2) | 212,685 (10) | 501,022 (9.2) |
| Black | 8,987 (1.8) | 203,434 (13) | 212,421 (10) | 351,783 (6.5) |
| Hawaiian/Pacific Islander | 7,400 (1.5) | 8,577 (0.6) | 15,977 (0.8) | 35,177 (0.6) |
| Hispanic | 294,339 (59) | 538,884 (35) | 833,223 (41) | 2,028,575 (37) |
| Native Am./Alaskan | 658 (0.1) | 4,862 (0.3) | 5,520 (0.3) | 14,864 (0.3) |
| Multiple/Other/Unknown | 3,984 (0.8) | 12,164 (0.8) | 16,148 (0.8) | 940,069 (17) |
|
| ||||
| No | 495,926 (100) | 1,539,744 (100) | 2,035,670 (100) | 5,442,120 (100) |
| Yes | 331 (<0.1) | 399 (<0.1) | 730 (<0.1) | 3,897 (<0.1) |
|
| ||||
| No | 322,862 (65) | 1,078,472 (70) | 1,401,334 (69) | 4,690,499 (86) |
| Yes | 173,395 (35) | 461,671 (30) | 635,066 (31) | 755,518 (14) |
1high TB incidence country 2Bacillus Calmette–Guérin vaccination, 3hepatitis B virus
Fig 1Percent patients born in countries with high TB incidence for each predictor of interest in the training dataset.
Model performance with 95% confidence intervals.
| Model | AUCROC | AUPRC | Brier |
|---|---|---|---|
| Models on Training Dataset | |||
| Preferred Language | 0.68 (0.67,0.68) | 0.53 (0.53,0.54) | |
| Needs Interpreter | 0.64 (0.64,0.64) | 0.50 (0.50,0.50) | |
| Percent Non-US-Born in Census Tract | 0.66 (0.66,0.66) | 0.37 (0.37,0.37) | |
| Race Ethnicity | 0.77 (0.77,0.77) | 0.49 (0.48,0.49) | |
| BCG3 Vaccine | 0.50 (0.50,0.50) | 0.24 (0.24,0.24) | |
| HBV4 Screen | 0.52 (0.52,0.53) | 0.26 (0.26,0.26) | |
| Two-Way Interactions | 0.86 (0.86,0.86) | 0.72 (0.72,0.72) | 0.11 (0.11,0.11) |
| Main Effects | 0.86 (0.86,0.86) | 0.71 (0.71,0.71) | 0.12 (0.11,0.12) |
| Language, Race/Ethnicity, Census, Interpreter, HBV | 0.86 (0.86,0.86) | 0.71 (0.71,0.71) | 0.12 (0.12,0.12) |
| Language, Race/Ethnicity, Census, Interpreter | 0.86 (0.85,0.86) | 0.70 (0.70,0.70) | 0.12 (0.12,0.12) |
|
|
|
|
|
| Language, Race/Ethnicity | 0.84 (0.84,0.84) | 0.68 (0.68,0.68) | 0.12 (0.12,0.12) |
|
|
|
|
|
|
|
|
|
|
1area under the receiver operator curve
2area under the precision and recall curve
3Bacillus Calmette–Guérin vaccination
4Hepatitis B
Fig 2Odds ratios and 95 percent confidence intervals for predictors in univariate, main effects, and final models.
Final model performance on validation datasets using various cut-points.
| Model | PPV | FOR | Sensitivity | Specificity |
|---|---|---|---|---|
|
|
|
|
|
|
| Preferred Language Only | 0.84 | 0.17 | 0.37 | 0.98 |
| 19% Cut-point (KPNC estimate) | 0.74 | 0.13 | 0.58 | 0.93 |
| 24% Cut-point (KPSC Estimate) | 0.66 | 0.11 | 0.65 | 0.89 |
| 27% Cut-point (CA3 Estimate) | 0.61 | 0.11 | 0.68 | 0.86 |
| 34% Cut-point (LA4 County Estimate) | 0.53 | 0.09 | 0.74 | 0.79 |
|
| ||||
| Preferred Language Only | 0.77 | 0.15 | 0.29 | 0.98 |
| 19% Cut-point (KPNC estimate) | 0.61 | 0.09 | 0.64 | 0.9 |
| 24% Cut-point (KPSC Estimate) | 0.56 | 0.07 | 0.73 | 0.86 |
| 27% Cut-point (CA Estimate) | 0.54 | 0.07 | 0.75 | 0.85 |
| 34% Cut-point (LA County Estimate) | 0.51 | 0.06 | 0.78 | 0.82 |
1Positive predictive value
2False Omission Rate, 3California, 4Los Angeles
Screening metrics for LTBI and TB in patients with known country of birth for final model using various cut-points.
| Model Combination | N Screened | True Cases Identified | True Positive Rate | NNS |
|---|---|---|---|---|
| Preferred Language Only | 50,034 | 8601 | 0.18 | 5.8 |
| Final Model, 19% Cut-point | 163,321 | 23129 | 0.49 | 7.1 |
| Final Model, 24% Cut-point | 208,306 | 27142 | 0.57 | 7.7 |
| Final Model, 27% Cut-point | 223,607 | 28048 | 0.59 | 8.0 |
| Final Model, 34% Cut-point | 254,845 | 29884 | 0.63 | 8.5 |
| EHR Documented Birth in HTBIC | 132,125 | 31507 | 0.66 | 4.2 |
| Preferred Language Only | 264,731 | 291 | 0.38 | 909.7 |
| Final Model, 19% Cut-point | 671,991 | 931 | 0.69 | 721.8 |
| Final Model, 24% Cut-point | 844,211 | 1045 | 0.77 | 807.9 |
| Final Model, 27% Cut-point | 898,033 | 1059 | 0.78 | 848.0 |
| Final Model, 34% Cut-point | 1,011,946 | 1091 | 0.81 | 927.5 |
| EHR Documented Birth in HTBIC | 673,742 | 1050 | 0.78 | 641.7 |
1Number needed to screen defined as number screened divided by number of cases identified
2latent tuberculosis infection
3active tuberculosis disease