| Literature DB >> 29439692 |
Harsheen Kaur1,2,3, Sunghwan Sohn4, Chung-Il Wi1,2, Euijung Ryu2,4, Miguel A Park5, Kay Bachman5, Hirohito Kita5, Ivana Croghan6, Jose A Castro-Rodriguez7, Gretchen A Voge1,2,8, Hongfang Liu9, Young J Juhn10,11.
Abstract
BACKGROUND: Thus far, no algorithms have been developed to automatically extract patients who meet Asthma Predictive Index (API) criteria from the Electronic health records (EHR) yet. Our objective is to develop and validate a natural language processing (NLP) algorithm to identify patients that meet API criteria.Entities:
Keywords: API; Asthma; Epidemiology; Informatics; NLP
Mesh:
Year: 2018 PMID: 29439692 PMCID: PMC5812028 DOI: 10.1186/s12890-018-0593-9
Source DB: PubMed Journal: BMC Pulm Med ISSN: 1471-2466 Impact factor: 3.317
Asthma Predictive Index (API) for asthmaa ascertainment
| Major Criteria | Minor criteria |
|---|---|
| 1. Physician diagnosis of asthma for parents | 1. Physician diagnosis of allergic rhinitis for patient |
| 2. Physician diagnosis of eczema for patient | 2. Wheezing apart from colds |
| 3. Eosinophilia (≥ 4%) |
aAsthma is determined by frequent wheezing episodes (two or more wheezing episodes within one year) plus at least one of major criteria or two of minor criteria
Fig. 1Overview of NLP-API algorithm (Abbreviation: PPI – Patient Provided Information)
Demographics of the test cohort
| Test cohort ( | |
|---|---|
| Age at the last follow-up date, years, median (interquartile range) | 5.3 (3.6, 6.7) |
| Male, n (%) | 209 (48%) |
| White, n (%) | 315 (74%) |
| Asthma (ascertained by abstractors), n (%) | 36 (8%) |
| Allergic rhinitis, n (%) | 39 (9%) |
| Eczema, n (%) | 102 (24%) |
| Family history of asthma, n (%) | 101 (23%) |
| Maternal smoking during pregnancy, n (%) | 33 (7%) |
| History of breastfeeding, n (%) | 354 (84%) |
Agreement of asthma ascertainment between NLP and manual chart review (criterion validity)
| Test cohort ( | Kappa-index | Overall agreement rate | Sensitivity | Specificity | PPVa | NPVb |
|---|---|---|---|---|---|---|
| Overall | 0.86 | 97% | 86% | 98% | 88% | 98% |
| Sex | ||||||
| Male ( | 0.89 | 98% | 90% | 98% | 90% | 98% |
| Female ( | 0.82 | 97% | 80% | 99% | 85% | 98% |
| Race | ||||||
| Caucasian ( | 0.83 | 97% | 81% | 98% | 88% | 98% |
| Non-Caucasian ( | 0.94 | 99% | 100% | 98% | 90% | 100% |
| Gestational age | ||||||
| Late Preterm ( | 0.82 | 96% | 84% | 98% | 84% | 98% |
| Term ( | 0.90 | 98% | 88% | 99% | 93% | 99% |
aPPV: Positive Predictive Value
bNPV: Negative Predictive Value
Associations of asthma status determined by NLP and manual chart review with known risk factors for asthma (construct validity)
| By NLP | By manual chart review | |||||||
|---|---|---|---|---|---|---|---|---|
| No asthma ( | Asthma ( | ORd | No asthma ( | Asthma ( | ORd | |||
| Age,a years, median (IQR) | 5.2 | 6.2 | 1.2 | .01 | 5.1 | 6.3 | 1.2 | .02 |
| Male, | 188 (47%) | 21 | 1.6 | .17 | 188 (48%) | 21 | 1.5 | .23 |
| White, | 290 (75%) | 25 | 0.8 | .62 | 288 (75%) | 27 | 1.0 | .97 |
| Birth weight, median (IQR) | 3.14 (2.5,3.5) | 2.8 (2.3,3.4) | 0.9 | .08 | 3.1 | 2.8 | 0.9 | .14 |
| Cesarean section, | 115 (29%) | 11 | 1.1 (0.5,2.3) | .79 | 116 (30%) | 10 | 0.9 | .81 |
| Gestational age, median (IQR) | 37 (36,39) | 36 (36,37) | 0.8 | .07 | 37 (36,39) | 36 (36,38) | 0.8 | .15 |
| Allergic rhinitis, | 31 | 8 | 3.4 | < .01 | 31 | 8 | 3.3 | < .01 |
| Eczema, | 87 | 15 | 2.7 | < .01 | 86 | 16 | 2.8 | < .01 |
| Family history of asthma, | 80 | 21 | 5.8 | < .01 | 80 | 21 | 5.4 (2.6,11.0) | < .01 |
| Family history of atopic diseases, | 150 (38%) | 14 | 1.0 | .83 | 148 (37%) | 16 | 1.3 | .43 |
| Passive smoke exposure, | 51 (14%) | 8 (24%) | 1.9 | .11 | 51 | 8 | 1.8 | .13 |
| Maternal smoking,b | 25 | 8 | 4.4 | < .01 | 25 | 8 | 4.2 (1.7,10.3) | < .01 |
| Childcare attendance,c | 165 (42%) | 18 | 1.4 | .28 | 164 (42%) | 19 | 1.5 | .21 |
| Breastfeeding, | 327 (85%) | 27 | 0.8 | .66 | 325 (84%) | 29 | 1.0 | .89 |
aAge at the last follow-up date
bmaternal smoking status during pregnancy
cChildcare attendance before 3 years
dUnadjusted Odds ratio