| Literature DB >> 35764999 |
Vishal Bali1, Jessica Weaver2, Vladimir Turzhitsky2, Jonathan Schelfhout2, Misti L Paudel3,4, Erin Hulbert3, Jesse Peterson-Brandt3, Anne-Marie Guerra Currie5, Dylan Bakka5.
Abstract
BACKGROUND: Chronic cough (CC) is difficult to identify in electronic health records (EHRs) due to the lack of specific diagnostic codes. We developed a natural language processing (NLP) model to identify cough in free-text provider notes in EHRs from multiple health care providers with the objective of using the model in a rules-based CC algorithm to identify individuals with CC from EHRs and to describe the demographic and clinical characteristics of individuals with CC.Entities:
Keywords: Chronic cough; Cough; Diagnostic test accuracy study; Electronic health records; Natural language processing; Sensitivity and specificity
Mesh:
Year: 2022 PMID: 35764999 PMCID: PMC9238070 DOI: 10.1186/s12890-022-02035-6
Source DB: PubMed Journal: BMC Pulm Med ISSN: 1471-2466 Impact factor: 3.320
Cough model performance metrics on the held-out test set
| Concept | Model type | Precision (PPV)* | Recall (Sensitivity)† | F1‡ | Support§ |
|---|---|---|---|---|---|
| Cough mention | Entity recognition | 0.98 | 0.99 | 0.99 | 341 |
| Positive cough mention | Classification | 0.96 | 0.68 | 0.80 | 179 |
| Negative cough mention | Classification | 0.96 | 0.84 | 0.89 | 86 |
Positive Predictive Value (PPV)
*Precision (PPV) indicates how well the model correctly identifies the desired cough context. PPV = true positives/(true positives + false positives)
†Recall (Sensitivity) indicates how well true positive cough contexts have been captured from provider notes. Recall = true positives/(true positives + false negatives)
‡F1 represents the harmonic mean of precision and recall and identifies the relative impacts of false positives and false negatives on the interpretation of the model’s performance. F1 = (2 × (precision × recall)/(precision + recall))
§Support represents the number of occurrences of the cough concept in the test set
Fig. 1Proportion of A cough encounters and B individuals with chronic cough identified from electronic health records, diagnosis codes/written medication orders, or a combination of both methods. EHR, electronic health record; NLP, natural language processing. Panel (A), individuals with cough encounters (N = 291,326). Panel (B), individuals with chronic cough (N = 8861)
Demographic characteristics and comorbidities of individuals with chronic cough
| Characteristic | Individuals with chronic cough |
|---|---|
| 18–39 | 508 (10.5) |
| 40–44 | 207 (4.3) |
| 45–49 | 324 (6.7) |
| 50–54 | 432 (9.0) |
| 55–59 | 505 (10.5) |
| 60–64 | 501 (10.4) |
| ≥ 65 | 2341 (48.6) |
| Female | 3215 (66.7) |
| Northeast | 677 (14.1) |
| Midwest | 2297 (47.7) |
| South | 1362 (28.3) |
| West | 480 (10.0) |
| Other | 2 (0.0) |
| Other lower respiratory disease | 3926 (81.5) |
| Respiratory infections | 3595 (74.6) |
| Disorders of lipid metabolism | 2996 (62.2) |
| Other connective tissue disease | 2974 (61.7) |
| Diseases of the heart | 2824 (58.6) |
| Hypertension | 2808 (58.3) |
| Non-traumatic joint disorders | 2774 (57.6) |
| Eye disorders | 2687 (55.8) |
| Other nutritional, endocrine, and metabolic disorders | 2550 (52.9) |
| Other skin disorders | 2534 (52.6) |
*Northeast: Connecticut, Maine, Massachusetts, New Hampshire, New Jersey, New York, Pennsylvania, Rhode Island, Vermont. Midwest: Illinois, Indiana, Iowa, Kansas, Michigan, Minnesota, Missouri, Nebraska, North Dakota, Ohio, South Dakota, Wisconsin. South: Alabama, Arkansas, Delaware, District of Columbia, Florida, Georgia, Kentucky, Louisiana, Maryland, Mississippi, North Carolina, Oklahoma, South Carolina, Tennessee, Texas, Virginia, West Virginia. West: Alaska, Arizona, California, Colorado, Hawaii, Idaho, Montana, Nevada, New Mexico, Oregon, Utah, Washington, Wyoming. Other: American Samoa, Armed Forces, Commonwealth of the Northern Mariana Islands, Federated State of Micronesia, Guam, Marshall Islands, Palau, Puerto Rico, Virgin Islands
†Comorbidities were as defined by the Agency for Healthcare Research and Quality’s Clinical Classifications Software Refined (CCSR) for ICD-10-CM Diagnoses [23].
All-cause health care resource use among individuals with chronic cough
| All-cause HCRU | Individuals with chronic cough | HCRU counts, PPPM |
|---|---|---|
| Ambulatory visit | 4666 (96.9) | 2.37 (2.31) |
| Office visit | 4624 (96.0) | 1.29 (1.12) |
| Hospital outpatient visit | 4281 (88.9) | 1.09 (1.85) |
| Emergency room visit | 2811 (58.3) | 0.12 (0.26) |
| Hospital inpatient visit | 1342 (27.9) | 0.02 (0.05) |
| Pharmacy | 4644 (96.4) | 3.18 (2.97) |
HCRU, health care resource use; PPPM, per patient per month; SD, standard deviation
All-cause health care costs for individuals with chronic cough
| All-cause cost type | Individuals with chronic cough ( |
|---|---|
| Mean (SD)* | |
| Total health care | $1931 (4281) |
| Medical | $1450 (3749) |
| Ambulatory | $771 (1813) |
| Office visit | $221 (448) |
| Hospital outpatient | $550 (1712) |
| Emergency room | $62 (171) |
| Inpatient | $468 (2804) |
| Other medical | $150 (764) |
| Pharmacy | $481 (1440) |
*All values are in US dollars and represent costs per patient per month; SD, standard deviation
Fig. 2Distribution of chronic cough eligibility dates. Dates reflect the month and year in which individuals first qualified as having chronic cough