| Literature DB >> 35767912 |
Eva S Klappe1, Ronald Cornet2, Dave A Dongelmans3, Nicolette F de Keizer2.
Abstract
BACKGROUND: During the Coronavirus disease 2019 (COVID-19) pandemic it became apparent that it is difficult to extract standardized Electronic Health Record (EHR) data for secondary purposes like public health decision-making. Accurate recording of, for example, standardized diagnosis codes and test results is required to identify all COVID-19 patients. This study aimed to investigate if specific combinations of routinely collected data items for COVID-19 can be used to identify an accurate set of intensive care unit (ICU)-admitted COVID-19 patients.Entities:
Keywords: COVID-19; Data accuracy; Electronic Health Records; Problem list; Real-time data extraction; Routinely collected data
Mesh:
Year: 2022 PMID: 35767912 PMCID: PMC9186787 DOI: 10.1016/j.ijmedinf.2022.104808
Source DB: PubMed Journal: Int J Med Inform ISSN: 1386-5056 Impact factor: 4.730
Fig. 1Flow chart to annotate a patient with a COVID-19 or non-COVID-19 label.
Search queries including routinely collected data items to identify an accurate set of COVID-19 patients. Search queries shown on white background are EHR data items that could be extracted real-time from the EHR. The search query in italic includes the data item that cannot be extracted real-time as it is retrospectively registered.
| Positive RT-PCR test result |
| The ICD-10 code for COVID-19 (U07.1 and/or U07.2) by healthcare professionals * |
| The ICD-10 code for COVID-19 (U07.1) by healthcare professionals ** |
| An infection label for COVID-19 (confirmed) |
* According to the WHO definition, both U07.1 and U07.2 indicate COVID-19 patients [37].
** According to the WHO definition [38], according to a (Dutch) manual for using the Diagnosis Thesaurus (DT) for healthcare professionals [40], and according to a (Dutch) manual for clinical coders [42], the ICD-10 code U07.1 is used to indicate a patient confirmed by RT-PCR testing and U07.2 can be used to indicate unconfirmed only suspected COVID-19 patients.
Fig. 2Dataset inclusion and exclusion and final gold standard dataset (n = 402) with 196 COVID-19 labeled patients and 206 non-COVID-19 labeled patients.
Performance of search queries including (combinations of) routinely collected data items to identify an accurate set of COVID-19 patients. The performance is determined using the gold standard dataset including the (non–)COVID-19 labels and two subsets. In white, the search queries including data items that could be extracted real-time from the EHR system are shown. In italic, the search queries including ICD-10 coding retrospectively registered by clinical coders are shown.
| Resulting cases (true and false) (n) | Recall (95% CI) | Specificity (95% CI) | Precision (95% CI) | F1 score | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Complete set (n = 402, 196 COVID-19; 206 non-COVID-19) | Feb-Apr (n = 208, 90 COVID-19; 118 non-COVID-19) | May-Dec (n = 194, 88 COVID-19; 106 non-COVID-19) | Complete set | Feb-Apr | May-Dec | Complete set | Feb-Apr | May-Dec | Complete set | Feb-Apr | May-Dec | Complete set | Feb-Apr | May-Dec | |
| Positive RT-PCR test result | 140 | 61 | 79 | 0.71 (0.65–0.78) | 0.68 (0.57–0.77) | 0.75 (0.65–0.82) | 1.0 (0.98–1.0) | 1.0 (0.97–1.00) | 1.0 (0.96–1.0) | 1.0 (0.97–1.0) | 1.0 (0.94–1.0) | 1.0 (0.95–1.0) | 0.83 | 0.81 | 0.85 |
| The ICD-10 code for COVID-19 (U07.1 and/or U07.2) by healthcare professionals | 295 | 200 | 95 | 0.86 (0.80–0.90) | 0.99 (0.94–1.00) | 0.75 (0.65–0.82) | 0.38 (0.32–0.45) | 0.06 (0.02–0.12) | 0.82 (0.72–0.89) | 0.57 (0.51–0.63) | 0.44 (0.37–0.52) | 0.83 (0.74–0.90) | 0.68 | 0.61 | 0.79 |
| The ICD-10 code for COVID-19 (U07.1) by healthcare professionals | 162 | 88 | 74 | 0.82 (0.75–0.87) | 0.97 (0.91–0.99) | 0.69 (0.59–0.78) | 0.99 (0.97–1.00) | 0.99 (0.95–1.0) | 0.99 (0.94–1.0) | 0.99 (0.96–1.00) | 0.99 (0.94–1.0) | 0.99 (0.93–1.0) | 0.89 | 0.98 | 0.81 |
| An infection label for COVID-19 (confirmed) by members of the infection department or by healthcare professionals | 212 | 110 | 102 | 0.97 (0.93–0.99) | 0.99 (0.94–1.0) | 0.95 (0.89–0.98) | 0.89 (0.84–0.93) | 0.82 (0.74–0.89) | 0.99 (0.95–1.0) | 0.90 (0.85–0.93) | 0.81 (0.72–0.88) | 0.99 (0.95–1.0) | 0.93 | 0.89 | 0.97 |
| Positive RT-PCR test result | 120 | 60 | 60 | 0.61 (0.54–0.68) | 0.67 (0.56–0.76) | 0.57 (0.47–0.66) | 1.0 (0.98–1.0) | 1.0 (0.97–1.0) | 1.0 (0.96–1.0) | 1.0 (0.97–1.0) | 1.0 (0.94–1.0) | 1.0 (0.94–1.0) | 0.76 | 0.80 | 0.72 |
| Positive RT-PCR test result | 113 | 59 | 54 | 0.58 (0.50–0.65) | 0.66 (0.55–0.75) | 0.51 (0.41–0.61) | 1.0 (0.98–1.0) | 1.0 (0.97–1.0) | 1.0 (0.96–1.0) | 1.0 (0.97–1.0) | 1.0 (0.94–1.0) | 1.0 (0.93–1.0) | 0.73 | 0.79 | 0.68 |
| Positive RT-PCR test result | 136 | 60 | 76 | 0.69 (0.62–0.76) | 0.67 (0.56–0.76) | 0.72 (0.62–0.80) | 1.0 (0.98–1.0) | 1.0 (0.97–1.) | 1.0 (0.96–1.0) | 1.0 (0.97–1.0) | 1.0 (0.94–1.0) | 1.0 (0.95–1.0) | 0.82 | 0.80 | 0.84 |
| The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals | 182 | 106 | 76 | 0.84 (0.78–0.89) | 0.98 (0.92–1.0) | 0.72 (0.62–0.80) | 0.91 (0.87–0.95) | 0.85 (0.77–0.91) | 1.0 (0.96–1.0) | 0.90 (0.85–0.94) | 0.83 (0.74–0.90) | 1.0 (0.95–1.0) | 0.87 | 0.90 | 0.84 |
| The ICD-10 code for COVID-19 (U07.1) by healthcare professionals | 156 | 86 | 70 | 0.80 (0.73–0.85) | 0.96 (0.89–0.99) | 0.66 (0.56–0.75) | 1.0 (0.98–1.0) | 1.0 (0.97–1.0) | 1.0 (0.96–1.0) | 1.0 (0.98–1.0) | 1.0 (0.96–1.0) | 1.0 (0.95–1.0) | 0.89 | 0.98 | 0.80 |
| Positive RT-PCR test result | 315 | 201 | 114 | 0.96 (0.92–0.98) | 1.0 (0.96–1.0) | 0.92 (0.86–0.97) | 0.38 (0.32–0.45) | 0.06 (0.02–0.12) | 0.82 (0.72–0.89) | 0.60 (0.54–0.65) | 0.45 (0.38–0.52) | 0.86 (0.78–0.92) | 0.74 | 0.62 | 0.89 |
| Positive RT-PCR test result | 189 | 90 | 99 | 0.95 (0.91–0.98) | 0.99 (0.94–1.0) | 0.92 (0.86–0.97) | 0.99 (0.97–1.0) | 0.99 (0.95–1.0) | 0.99 (0.94–1.0) | 0.99 (0.96–1.0) | 0.99-(0.94–1.0) | 0.99 (0.95–1.0) | 0.97 | 0.99 | 0.96 |
| Positive RT-PCR test result | 216 | 111 | 105 | 0.99 (0.96–1.0) | 1.0 (0.96–1.0) | 0.98 (0.93–1.0) | 0.89 (0.84–0.93) | 0.82 (0.74–0.89) | 0.99 (0.94–1.0) | 0.90 (0.85–0.94) | 0.81 (0.73–0.88) | 0.99 (0.95–1.0) | 0.94 | 0.90 | 0.99 |
| The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals | 325 | 204 | 121 | 0.99 (0.96–1.0) | 1.0 (0.96–1.0) | 0.98 (0.93–1.0) | 0.36 (0.30–0.43) | 0.03 (0.01–0.08) | 0.81 (0.71–0.88) | 0.60 (0.54–0.65) | 0.44 (0.37–0.51) | 0.86 (0.78–0.92) | 0.74 | 0.61 | 0.92 |
| The ICD-10 code for COVID-19 (U07.1) by healthcare professionals | 218 | 112 | 106 | 0.99 (0.96–1.0) | 1.0 (0.96–1.0) | 0.98 (0.93–1.0) | 0.88 (0.83–0.92) | 0.81 (0.73–0.88) | 0.98 (0.92–1.0) | 0.89 (0.84–0.93) | 0.80 (0.72–0.87) | 0.98 (0.93–1.0) | 0.94 | 0.89 | 0.98 |
| Positive RT-PCR test result | 118 | 59 | 59 | 0.60 (0.53–0.67) | 0.66 (0.55–0.75) | 0.56 (0.46–0.65) | 1.0 (0.98–1.0) | 1.0 (0.97–1.0) | 1.0 (0.96–1.0) | 1.0 (0.97–1.0) | 1.0 (0.94–1.0) | 1.0 (0.94–1.0) | 0.75 | 0.97 | 0.72 |
| Positive RT-PCR test result | 111 | 58 | 53 | 0.57 (0.49–0.64) | 0.64 (0.54–0.74) | 0.50 (0.40–0.60) | 1.0 (0.98–1.0) | 1.0 (0.97–1.0) | 1.0 (0.96–1.0) | 1.0 (0.97–1.0) | 1.0 (0.94–1.0) | 1.0 (0.93–1.0) | 0.72 | 0.78 | 0.67 |
| Positive RT-PCR test result | 327 | 204 | 123 | 1.0 (0.98–1.0) | 1.0 (0.96–1.0) | 1.0 (0.97–1.0) | 0.36 (0.30–0.43) | 0.03 (0.01–0.08) | 0.81 (0.71–0.88) | 0.60 (0.54–0.65) | 0.44 (0.37–0.51) | 0.86 (0.79–0.92) | 0.75 | 0.66 | 0.93 |
| Positive RT-PCR test result | 220 | 112 | 108 | 1.0 (0.98–1.0) | 1.0 (0.96–1.0) | 1.0 (0.97–1.0) | 0.88 (0.83–0.92) | 0.81 (0.73–0.88) | 0.98 (0.92–1.0) | 0.89 (0.84–0.93) | 0.80 (0.72–0.87) | 0.98 (0.93–1.0) | 0.94 | 0.89 | 0.99 |
Abbreviation: CI, confidence interval.
Fig. 3Search queries applied to the gold standard dataset (n = 402). The numbers indicate the search queries, shown in the legend.
Number and percentages of patients retrieved in the complete gold standard dataset (n = 402) based on search queries including routinely collected data items.
| Gold standard dataset = 402 patients | COVID-19 patients (n = 196) (n(%)) | Non-COVID-19 patients (n = 206) (n(%)) |
|---|---|---|
| RT-PCR test* | ||
| Confirmed (only ‘positive’) | 90 (45.9) | - |
| Confirmed (Both ‘positive’ and ‘negative’) | 50 (25.5) | - |
| Not-confirmed (only ‘negative’) | 18 (9.2) | 201 (97.6) |
| No RT-PCR tests available | 23 (11.7) | 3 (1.4) |
| Only other test results (no negative, no positive, not both) | 15 (7.7) | 2 (1.0) |
| ICD-10 codes on problem list coded by healthcare professionals** | ||
| U07.1 | 153 (78.1) | 2 (1.0) |
| Code is ‘closed’ | 131 (85.6) | 2 (100.0) |
| U07.2 | 8 (4.1) | 125 (60.7) |
| Code is ‘closed’ | 7 (87.5) | 117 (93.6) |
| Both U07.1 and U07.2*** | 7 (3.6) | - |
| U07.2 was older | 6 (85.7) | - |
| U07.1 was older | 1 (14.3) | - |
| Only other coding (no U07.1, no U07.2) | 28 (14.3) | 79 (38.2) |
| Infection labels**** | ||
| Confirmed (‘SARS’) | 129 (65.8) | 18 (8.7) |
| Infection note is suspected | 4 (3.1) | 17 (94.4) |
| Suspected (‘Suspected SARS’) | 1 (0.5) | 113 (54.6) |
| Infection note is confirmed | 0 (0.0) | 2 (1.8) |
| Both confirmed and suspected*** | 61 (31.1) | 4 (1.9) |
| Suspected was older | 58 (95.1) | 2 (50.0) |
| Confirmed was older | 3 (4.9) | 2 (50.0) |
| No infection labels | 5 (2.6) | 60 (29.0) |
| Only other infection labels (no SARS, no Suspected SARS) | 0 (0.0) | 11 (5.3) |
| ICD-10 codes by clinical coders | ||
| U07.1 | 194 (99.0) | 0 (0.0) |
| U07.2 | 2 (1.0) | 5 (2.4) |
| No coding | 0 (0.0) | 0 (0.0) |
| Only other coding (no U07.1, no U07.2) | 0 (0.0) | 201 (97.6) |
* Patients who did not have one positive and/or one negative test, but other test results (antibodies, invalid tests, cancelled tests) were considered ‘only other test results’. Not-confirmed indicated that patients did not have any positive RT-PCR test result.
** Problem list codes are considered ‘active’ or ‘closed’. Problems are closed when the episode is over, but the problem should still be visible in the problem list (i.e. it will be relevant for medical history). When problems are corrected, they should be removed from the problem list, according to the problem list policy in our hospital.
*** Patients with both confirmed and suspected in either infection labels and problem lists, the dates in ‘infection start moment’ and ‘date of observation’ were checked to determine whether confirmed and suspected was older for infection labels and problem lists respectively.
****Infection note is a free-text field indicating more details about the infection status, this displays the number of codes that had contradictory information in the infection note compared to the standardized infection label.
The confusion matrices and number of patients to determine the performance per search query for the complete gold standard dataset and two subsets. The complete dataset (All) included 402 patients (196 COVID-19; 206 non-COVID-19). The dataset with admissions between February – April 2020 (Feb-Apr) included 208 patients (90 COVID-19; 118 non-COVID-19). The dataset with admissions between May-December (May-Dec) included 194 patients (106 COVID-19; 88 non-COVID-19). In white, the search queries including data items that could be extracted real-time from the EHR system are shown. In italic, the search queries including ICD-10 coding retrospectively registered by clinical coders are shown.
| True Positive (TP) (n) | False Positive (FP) (n) | False Negative (FN) (n) | True Negative (TN) (n) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All | Feb-Apr | May-Dec | All | Feb-Apr | May-Dec | All | Feb-Apr | May-Dec | All | Feb-Apr | May-Dec | |
| Positive RT-PCR test result | 140 | 61 | 79 | 0 | 0 | 0 | 56 | 29 | 27 | 206 | 118 | 88 |
| The ICD-10 code for COVID-19 (U07.1 and/or U07.2) by healthcare professionals | 168 | 89 | 79 | 127 | 111 | 16 | 28 | 1 | 27 | 79 | 7 | 72 |
| The ICD-10 code for COVID-19 (U07.1) by healthcare professionals | 160 | 87 | 73 | 2 | 1 | 1 | 36 | 3 | 33 | 204 | 117 | 87 |
| An infection label for COVID-19 (confirmed) by members of the infection department or by healthcare professionals | 190 | 89 | 101 | 22 | 21 | 1 | 6 | 1 | 5 | 184 | 97 | 87 |
| Positive RT-PCR test result | 120 | 60 | 60 | 0 | 0 | 0 | 76 | 30 | 46 | 206 | 118 | 88 |
| Positive RT-PCR test result | 113 | 59 | 54 | 0 | 0 | 0 | 83 | 31 | 52 | 113 | 118 | 88 |
| Positive RT-PCR test result | 136 | 60 | 76 | 0 | 0 | 0 | 60 | 30 | 30 | 206 | 118 | 88 |
| The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals | 164 | 88 | 76 | 18 | 18 | 0 | 32 | 2 | 30 | 188 | 100 | 88 |
| The ICD-10 code for COVID-19 (U07.1) by healthcare professionals | 156 | 86 | 70 | 0 | 0 | 0 | 40 | 4 | 36 | 206 | 118 | 88 |
| Positive RT-PCR test result | 188 | 90 | 98 | 127 | 111 | 16 | 8 | 0 | 8 | 79 | 7 | 72 |
| Positive RT-PCR test result | 187 | 89 | 98 | 2 | 1 | 1 | 9 | 1 | 8 | 204 | 117 | 87 |
| Positive RT-PCR test result | 194 | 90 | 104 | 22 | 21 | 1 | 2 | 0 | 2 | 184 | 97 | 87 |
| The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals | 194 | 90 | 104 | 131 | 114 | 17 | 2 | 0 | 2 | 75 | 4 | 71 |
| The ICD-10 code for COVID-19 (U07.1) by healthcare professionals | 194 | 90 | 104 | 24 | 22 | 2 | 2 | 0 | 2 | 182 | 96 | 86 |
| Positive RT-PCR test result | 118 | 59 | 59 | 0 | 0 | 0 | 78 | 31 | 47 | 206 | 118 | 88 |
| Positive RT-PCR test result | 111 | 58 | 53 | 0 | 0 | 0 | 85 | 32 | 53 | 206 | 118 | 88 |
| Positive RT-PCR test result | 196 | 90 | 106 | 131 | 114 | 17 | 0 | 0 | 0 | 75 | 4 | 71 |
| Positive RT-PCR test result | 196 | 90 | 106 | 24 | 22 | 2 | 0 | 0 | 0 | 182 | 96 | 86 |
| Number | Search query |
|---|---|
| 1A | Positive RT-PCR test result |
| 2A | The ICD-10 code for COVID-19 (U07.1 and/or U07.2) by healthcare professionals |
| 3A | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 4A | An infection label for COVID-19 (confirmed) |
| 5A | Positive RT-PCR test result |
| 6A | Positive RT-PCR test result |
| 7A | Positive RT-PCR test result |
| 8A | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 9A | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 10A | Positive RT-PCR test result |
| 11A | Positive RT-PCR test result |
| 12A | Positive RT-PCR test result |
| 13A | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 14A | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 15A | Positive RT-PCR test result |
| 16A | Positive RT-PCR test result |
| 17A | Positive RT-PCR test result |
| 18A | Positive RT-PCR test result |
| 1B | The ICD-10 code for COVID-19 (U07.1) by clinical coders |
| 2B | Positive RT-PCR test result |
| 3B | ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 4B | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 5B | An infection label for COVID-19 |
| 6B | Positive RT-PCR test result |
| 7B | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 8B | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 9B | An infection label for COVID-19 |
| 10B | Positive RT-PCR test result |
| 11B | Positive RT-PCR test result |
| 12B | Positive RT-PCR test result |
| 13B | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 14B | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 15B | Positive RT-PCR test result |
| 16B | Positive RT-PCR test result |
| 17B | Positive RT-PCR test result |
| 18B | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 19B | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| Number | Search query |
|---|---|
| 1A | Positive RT-PCR test result |
| 2A | The ICD-10 code for COVID-19 (U07.1 and/or U07.2) by healthcare professionals |
| 3A | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 4A | An infection label for COVID-19 (confirmed) |
| 5A | Positive RT-PCR test result |
| 6A | Positive RT-PCR test result |
| 7A | Positive RT-PCR test result |
| 8A | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 9A | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 10A | Positive RT-PCR test result |
| 11A | Positive RT-PCR test result |
| 12A | Positive RT-PCR test result |
| 13A | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 14A | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 15A | Positive RT-PCR test result |
| 16A | Positive RT-PCR test result |
| 17A | Positive RT-PCR test result |
| 18A | Positive RT-PCR test result |
| 1B | The ICD-10 code for COVID-19 (U07.1) by clinical coders |
| 2B | Positive RT-PCR test result |
| 3B | ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 4B | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 5B | An infection label for COVID-19 |
| 6B | Positive RT-PCR test result |
| 7B | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 8B | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 9B | An infection label for COVID-19 |
| 10B | Positive RT-PCR test result |
| 11B | Positive RT-PCR test result |
| 12B | Positive RT-PCR test result |
| 13B | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 14B | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
| 15B | Positive RT-PCR test result |
| 16B | Positive RT-PCR test result |
| 17B | Positive RT-PCR test result |
| 18B | The ICD-10 code (U07.1 and/or U07.2) by healthcare professionals |
| 19B | The ICD-10 code for COVID-19 (U07.1) by healthcare professionals |
Confusion matrix example.
| Gold standard | |||
|---|---|---|---|
| Yes | No | ||
| Outcome of the algorithm | Yes | True Positive (TP) | False Positive (FP) |
| No | False Negative (FN) | True Negative (TN) | |