| Literature DB >> 29238227 |
Francis Nissen1, Jennifer K Quint2, Samantha Wilkinson1, Hana Mullerova3, Liam Smeeth1, Ian J Douglas1.
Abstract
OBJECTIVE: To describe the methods used to validate asthma diagnoses in electronic health records and summarize the results of the validation studies.Entities:
Keywords: NPV; PPV; database; epidemiology; sensitivity; specificity; validity
Year: 2017 PMID: 29238227 PMCID: PMC5716672 DOI: 10.2147/CLEP.S143718
Source DB: PubMed Journal: Clin Epidemiol ISSN: 1179-1349 Impact factor: 4.790
Figure 1Study screening process: PRISMA flow diagram.
Note: Reproduced from Moher D, Liberati A, Tetzlaff J, Altman DG, PRISMA Group. Preferred reporting items for systematic Reviews and meta-analyses: the PRISMA statement. BMJ. 2009;339:b2535.37
Abbreviation: EHR, electronic health record.
Characteristics of studies with validated asthma algorithms
| Author, year, country, (period) | Data source, population | Sample/case characteristics | Clinical event | Algorithm | Validation |
|---|---|---|---|---|---|
| Xi et al, | 2 large academic primary care clinics | 398 randomly selected patients | Asthma code | Search algorithms: | Manual review |
| Engelkes et al, | ICPI: Dutch GP EHR Primary care | 63,518 potential cases identified | Definite, probable, and doubtful cases of asthma | Combination of ICPI communication codes, clinician codes, drug names and free text generated by a machine-learning algorithm (RIPPER) | 22,699 cases manually validated, 14,303 asthma cases found |
| Afzal et al, | ICPI: Dutch GP EHR Primary care | 63,618 potential asthma cases identified, children aged 5–18 | Definite, probable, and doubtful cases of asthma | Combination of ICPI communication codes, clinician codes, drug names and free text generated by a machine-learning algorithm (RIPPER) | 5,032 patients manually validated by clinician |
| Dexheimer et al, | 1 pediatric ED | 15,163 assessed, 1,100 asthma patients all asthma patients (2–18 years) in a 3 month time window | Asthma code | Bayesian network system, previously used on claims data (Sanders) | Pediatric asthma/respiratory distress protocol filled in for identified patients |
| Wu et al, | Children enrolled in the Mayo Clinic sick-child daycare program, Secondary care | 112 children younger than 4 | ICD-9 codes | Natural language processing (logic) | Manual review by a clinician |
| Kozyrskyj et al, | SAGE: birth cohort of 16,320 children born in 1995 in Manitoba, Canada | 723 children from the group with completed questionnaires | Asthma | Database definitions in health care records | Pediatric allergist diagnosis of asthma |
| Pacheco et al, | NUgene Project | 7,970 people with DNA samples, of which 521 had an asthma diagnosis | Asthma diagnosis | Manual review of 100 cases for both algorithms | |
| Vollmer et al, | KPNW, Epic, OSCAR, TOPS ED, secondary care | 235,000 patients with continuous health plan eligibility aged 15–55 in January 1999 | ICD-9 codes | ||
| Donahue et al, | Harvard Pilgrim Health Care (HPHC); Primary, secondary and emergency care | Random sample of 100 patients | Asthma code | Asthma diagnosis and asthma drug dispensing | Manual review by clinicians |
| Premaratne et al, | Accident and EDs of two hospitals | All asthma patients January–March 1994 1,185 records, of which 209 did not have enough data | String containing “asth*” | String containing “asth*” in the free text records | |
|
| |||||
|
| |||||
| Engeland et al, | MBRN: population-based birth registry, all births in Norway since 1967 (more than 2.3 million) | 108,489 pregnancies, of which 4,549 mothers were recorded as having asthma in MBRN | Asthma | Asthma diagnosis in MBRN | NorPD: asthma medication |
| Coulter et al, | 7 general practices in the Oxford community health project | 2,443 on digital register Bronchodilators, inhaled CS, prophylactic drugs | Asthma diagnosis | Asthma diagnosis on register | Manual review against the list of patients on long-term medication |
| Ward et al, | GP Practice with 14,830 patients | 833 asthma patients, 659 responses | Asthma in GP database | One of the following criteria: | Questionnaire to determine bronchial hyperreactivity |
Abbreviations: CPP, cumulative patient profile; ICPI, integrated primary care information database; GP, general practitioner; EHR, electronic health record; SAGE, Study of Asthma, Genes and the Environment; KPNW, Kaiser Permanente Northwest Division; OSCAR, outside claims database; TOPS, The outpatient pharmacy system; ED, emergency department; ICS, inhaled corticosteroids; MBRN, Medical Birth Registry of Norway; CS, corticosteroids; NorPD, Norwegian Prescription Database.
Characteristics of studies with validated asthma algorithms
| Author, year, country, prevalence | Algorithm | Sensitivity, 95% CI | Specificity, 95% CI | PPV, 95% CI | NPV, 95% CI | Prevalence |
|---|---|---|---|---|---|---|
| Manual validation | ||||||
|
| ||||||
| Xi et al, | 1. Asthma in disease registry | 7% (5–10) | 99% (97–100) | 67% (38–87) | 73% (72–74) | 8.1% |
| 2. Billing code | 77% (75–83) | 89.2% (86–92) | 74% (67–80) | 91% (88–94) | ||
| 3. Asthma in CPP | 63% (59–68) | 92% (90–95) | 76% (68–83) | 87% (83–89) | ||
| 4. Asthma medications | 79% (75–83) | 64% (59–68) | 46% (41–50) | 88% (84–92) | ||
| 5. Asthma in chart notes | 85% (81–88) | 76% (72–80) | 58% (52–63) | 93% (89–95) | ||
| 6. Asthma in CPP OR billing code 493 | 90% (87–93) | 84% (80–88) | 69% (63–74) | 96% (93–97) | ||
| 7. Asthma in CPP OR billing code 493 (exclusion codes 491, 492, and 496) | 87% (83–90) | 85% (82–89) | 70% (63–76) | 94% (91–96) | ||
| 8. (Asthma in chart notes OR asthma medications) AND billing code 493 | 78% (74–82) | 92% (89–95) | 79% (72–85) | 91% (88–94) | ||
| 9. (Billing code 493 OR medications) AND asthma in chart note | 84% (80–88) | 84% (80–88) | 67% (61–73) | 93% (90–95) | ||
| 10. Billing diagnostic code 493 AND asthma in chart notes | 74% (70–78) | 93% (91–96) | 81% (73–87) | 90% (87–93) | ||
| Engelkes et al, | Definite, probable and doubtful cases | 63% | ||||
| Afzal et al, | Definite asthma | 98% | 95% | 66% | 6% | |
| Definite + probable | 96% | 90% | 82% | 29% | ||
| Definite, probable and doubtful cases | 95% | 67% | 57% | 32% | ||
| Dexheimer et al, | Algorithm constructed using a Bayesian network system | 64% | 7–10% | |||
| Wu et al, | ICD-9 codes | 31 | 93 | 57 | 82 | 4–17% |
| Natural language processing: logic | 81 | 95 | 84 | 94 | ||
| Natural language processing: machine learning | 85 | 97 | 88 | 95 | ||
| Kozyrskyj et al, | At least one asthma hospitalization, or two physician visits, or four prescription medications | 47% (35–60) | 92% (78–98) | 91% (76–98) | 11% | |
| At least one asthma hospitalization, or two physician visits, or two prescription medications | 67% (54–78) | 92% (78–98) | 94% (82–99) | |||
| At least one asthma hospitalization, or one physician visit, or two prescription medications | 77% (65–87) | 92% (78–98) | 94% (85–99) | |||
| At least one asthma hospitalization, or one physician visit, or two bronchodilators, or one controller medication | 80% (69–89) | 89% (74–97) | 93% (83–98) | |||
| At least one asthma hospitalization, or one physician visit, or two bronchodilators, or one bronchodilator and ketotifen or an oral steroid, or one controller medication | 80% (69–89) | 89% (74–97) | 93% (83–98) | |||
| At least one asthma hospitalization, or one physician visit, or one bronchodilator, or one controller medication | 82% (70–90) | 83% (67–94) | 90% (79–96) | |||
| Pacheco et al, 2009 | Initial algorithm | 70% (60–78) | 100% | 100% (90–100) | 77% (65–86) | 7.2% |
| Final algorithm | 95% (84–99) | 96% (87–99) | 95% (84–99) | 96% (87–99) | ||
| Vollmer et al, | Algorithm 1: population of 4460 | 95% | 4.1% | |||
| Algorithm 2: population of 2334 | 90% | |||||
| Algorithm 3: population of 545 | 70% | |||||
| Algorithm 4: population of 25 | 100% | |||||
| Algorithm 5: population of 11 | 50% | |||||
| Algorithm 6: population of 721 | 80% | |||||
| Algorithm 7: population of 99 | 27% | |||||
| Algorithm 8: population of 1528 | 80% | |||||
| Donahue et al, | Asthma code and drug dispensing | 86% | 3% | |||
| Premaratne et al, | String containing asth* in free text records | 80% (75–86) | 96% (96–99) | 91% (87–94) | 94% (93–95) | 20.6% |
|
| ||||||
| Comparison to an independent database | ||||||
|
| ||||||
| Engeland et al, | Asthma in MBRN and NorPD | 51% (49–52) | 98% (98–98) | 46% (45–48) | 4.20% | |
| Coulter et al, | Percentage of people on long term medication and recorded on the register | 58% | ||||
|
| ||||||
| Comparison to a questionnaire | ||||||
|
| ||||||
| Ward et al, | Total of all reviewed patients | 89% | 5.60% | |||
| Cases without bronchial hyperreactivity | 73% | |||||
| Controls with bronchial hyperreactivity | 78% | |||||
Abbreviations: PPV, positive predictive value; NPV, negative predictive value; NLP, natural language processing; ML, machine learning; MBRN, Medical Birth Registry of Norway; NorPD, Norwegian Prescription Database.
Quality assessment using QUADAS-2
| Study | Risk of bias
| |||
|---|---|---|---|---|
| Patient selection | Index test | Reference standard | Flow and timing | |
| Xi et al, | ☺ | ? | ☺ | ? |
| Engelkes et al, | ☺ | ☺ | ☹ | ☺ |
| Afzal et al, | ☹ | ☺ | ☺ | ☺ |
| Dexheimer et al, | ☺ | ☺ | ☺ | ☺ |
| Wu et al, | ☹ | ☺ | ? | ☺ |
| Kozyrskyj et al, | ☹ | ☹ | ☺ | ☺ |
| Pacheco et al, | ☹ | ☺ | ☺ | ☺ |
| Vollmer et al, | ☹ | ☺ | ☺ | ☺ |
| Donahue et al, | ☺ | ☹ | ☹ | ☺ |
| Premaratne et al, | ☺ | ☹ | ☺ | ☺ |
| Engeland et al, | ☹ | ☹ | ☹ | ☹ |
| Coulter et al, | ☹ | ☹ | ☹ | ? |
| Ward et al, | ☹ | ☹ | ☺ | ☹ |
Note: Happy face: low risk; sad face: high risk; question mark: unclear risk.