| Literature DB >> 35094685 |
Ross D Williams1, Aniek F Markus1, Jenna M Reps2, Peter R Rijnbeek3, Cynthia Yang1, Talita Duarte-Salles4, Scott L DuVall5, Thomas Falconer6, Jitendra Jonnagaddala7, Chungsoo Kim8, Yeunsook Rho9, Andrew E Williams10, Amanda Alberga Machado11, Min Ho An12, María Aragón4, Carlos Areia13, Edward Burn4,14, Young Hwa Choi15, Iannis Drakos16, Maria Tereza Fernandes Abrahão17, Sergio Fernández-Bertolín4, George Hripcsak6, Benjamin Skov Kaas-Hansen18,19, Prasanna L Kandukuri20, Jan A Kors1, Kristin Kostka21, Siaw-Teng Liaw7, Kristine E Lynch5, Gerardo Machnicki22, Michael E Matheny23,24, Daniel Morales25, Fredrik Nyberg26, Rae Woong Park27, Albert Prats-Uribe14, Nicole Pratt28, Gowtham Rao2, Christian G Reich21, Marcela Rivera29, Tom Seinen1, Azza Shoaibi2, Matthew E Spotnitz6, Ewout W Steyerberg30,31, Marc A Suchard32, Seng Chan You27, Lin Zhang33,34, Lili Zhou20, Patrick B Ryan2, Daniel Prieto-Alhambra14.
Abstract
BACKGROUND: We investigated whether we could use influenza data to develop prediction models for COVID-19 to increase the speed at which prediction models can reliably be developed and validated early in a pandemic. We developed COVID-19 Estimated Risk (COVER) scores that quantify a patient's risk of hospital admission with pneumonia (COVER-H), hospitalization with pneumonia requiring intensive services or death (COVER-I), or fatality (COVER-F) in the 30-days following COVID-19 diagnosis using historical data from patients with influenza or flu-like symptoms and tested this in COVID-19 patients.Entities:
Keywords: COVID-19; Patient-level prediction modelling; Risk score
Mesh:
Year: 2022 PMID: 35094685 PMCID: PMC8801189 DOI: 10.1186/s12874-022-01505-z
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.612
Data sources formatted to the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) used in this research (data type: claims, electronic health/medical records (EHR/EMR), general practitioner (GP))
| Database | Database | Country | Data type | Contains COVID-19 data? | Time period |
|---|---|---|---|---|---|
| Columbia University Irving Medical Center Data Warehouse | CUIMC | United States | EMR | Yes | Influenza: 1990-2020 COVID-19: March-April 2020 |
| Health Insurance and Review Assessment | HIRA | South Korea | Claims | Yes | COVID-19: 1st January- 4th April 2020 |
| The Information System for Research in Primary Care | SIDIAP | Spain | GP and hospital admission EHRs linked | Yes | Influenza: 2006-2017 COVID-19: March 2020 |
| Tufts Research Data Warehouse | TRDW | United States | EMR | Yes | Influenza: 2006-2020 COVID-19: March 2020 |
| Department of Veterans Affairs | VA | United States | EMR | Yes | Influenza: 2009-2010, 2014-2019 COVID-19: 1st March- 20th April |
| Optum© De-Identified ClinFormatics® Data Mart Databasea | ClinFormatics | United States | Claims | No | 2000-2018 |
| Ajou University School of Medicine Database | AUSOM | South Korea | EHR | No | 1996 - 2018 |
| Australian Electronic Practice based Research Network | AU-ePBRN | Australia | GP and hospital admission EHRs linked | No | 2012-2019 |
| IBM MarketScan® Commercial Database | CCAE | United States | Claims | No | 2000-2018 |
| Integrated Primary Care Information | IPCI | Netherlands | GP | Yes | 2006-2020 |
| Japan Medical Data Center | JMDC | Japan | Claims | No | 2005-2018 |
| IBM MarketScan® Multi-State Medicaid Database | MDCD | United States | Claims | No | 2006-2017 |
| IBM MarketScan® Medicare Supplemental Database | MDCR | United States | Claims | No | 2000-2018 |
| Optum© de-identified Electronic Health Record Dataset | Optum EHR | United States | EHR | No | 2006-2018 |
aDevelopment database
Fig. 1A Flow chart representing the path of data in the study. This details the splits used internally for model development, the steps taken for model parsimonisation and validation and external validation
Population size, outcome proportion, and characteristics for the development database (influenza) and external validation databases for COVID-19 and influenza (N/A indicates this result is not available)
| Development | External validation: COVID-19 | External validation: influenza | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ClinFormatics | CUIMC | HIRA | SIDIAP | TRDW | VA | AUSOM | AU-ePBRN | CCAE | IPCI | JMDC | MDCD | MDCR | Optum EHR | |
| Number of participants | 2,082,277 | 2,731 | 1,985 | 37,950 | 395 | 1,446 | 3,105 | 2,791 | 3,146,801 | 29,132 | 1,276,478 | 536,806 | 248,989 | 1,654,157 |
| Hospitalization with pneumonia (Outcome proportion %) | 105,030 (5.04) | N/A | 89 (4.48) | 1,223 (1.11) | 21 (5.32) | 149 (10.30) | 49 (1.58) | 29 (1.04) | 33,824 (1.07) | 22 (0.08) | 728 (0.06) | 32,987 (6.15) | 31,059 (12.47) | 34,229 (2.07) |
| Hospitalization with pneumonia requiring intensive services or death (Outcome proportion %) | 29,905 (1.44) | 134 (4.91) | 22 (1.11) | N/A | 5 (1.27) | 38 (2.63) | 5 (0.16) | 3 (0.11) | 4,856 (0.02) | 24 (0.08) | 65 (0.01) | 7,226 (1.35) | 3,628 (1.46) | 7,368 (0.45) |
| Death (Outcome proportion %) | 11,407 (0.55) | 335 (12.27) | 43 (2.17) | 406 (1.07) | 1 (0.25) | 43 (2.97) | 5 (0.16) | 4 (0.14) | 965 (0.03) | 24 (0.08) | 75 (0.01) | 2,603 (0.48) | 1,354 (0.54) | 3,513 (0.21) |
| Age (% above 65) | 26.1 | 38.9 | 15.6 | 17.9 | 18.2 | 37.3 | 11.9 | 23.1 | 12.5 | 16.9 | 16.0 | 14.2 | 96.2 | 30.0 |
| Sex (%, male) | 44.4 | 47.2 | 43.5 | 43.4 | 49.6 | 81.4 | 41.7 | 44.5 | 42.7 | 43.7 | 56.8 | 29.2 | 45.9 | 40.1 |
| Cancer (%) | 12.6 | 17.1 | 9.8 | 6.3 | 11.6 | 17.0 | 7.7 | 8.2 | 6.2 | 3.7 | 2.5 | 8.9 | 35.2 | 10.6 |
| COPD (%) | 10.2 | 9.3 | 4.9 | 2.5 | 6.3 | 20.5 | 2.7 | 3.1 | 2.7 | 2.7 | 0.5 | 19.8 | 26.6 | 7.6 |
| Diabetes (%) | 20.5 | 30.9 | 23.1 | 8.0 | 19.7 | 35.2 | 3.8 | 13.0 | 11.4 | 6.7 | 8.3 | 27.4 | 36.1 | 15.3 |
| Heart disease (%) | 31.0 | 40.1 | 17.1 | 11.2 | 25.8 | 44.7 | 7.7 | 12.9 | 16.5 | 7.5 | 8.0 | 36.1 | 68.2 | 23.4 |
| Hypertension (%) | 44.2 | 51.6 | 26.3 | 14.8 | 38.5 | 63.0 | 13.9 | 27.0 | 29.1 | 12.4 | 11.4 | 49.8 | 80.4 | 36.1 |
| Hyperlipidemia (%) | 46.8 | 40.6 | 39.9 | 11.4 | 32.9 | 62.5 | 3.3 | 20.2 | 21.8 | 4.6 | 15.2 | 36.0 | 69.6 | 34.2 |
| Kidney disease (%) | 18.7 | 31.2 | 17.0 | 11.0 | 24.3 | 32.4 | 7.6 | 6.2 | 9.0 | 1.2 | 5.1 | 23.4 | 35.5 | 14.9 |
Results for internal validation in ClinFormatics
| Outcome | Predictors | No. Variables | AUC | AUPRC |
|---|---|---|---|---|
| Hospitalization with pneumonia | Conditions/drugs + age/sex | 521 | 0.852 | 0.224 |
| Age/sex | 2 | 0.818 | 0.164 | |
| COVER-H | 9 | 0.840 | 0.120 | |
| Hospitalization with pneumonia requiring intensive services or death | Conditions/drugs + age/sex | 349 | 0.860 | 0.070 |
| Age/sex | 2 | 0.821 | 0.049 | |
| COVER-I | 9 | 0.839 | 0.059 | |
| Fatality | Conditions/drugs + age/sex | 205 | 0.926 | 0.069 |
| Age/sex | 2 | 0.909 | 0.037 | |
| COVER-F | 9 | 0.896 | 0.039 |
Results of external validation of the COVER scores on COVID-19 patients with a GP, ER, or OP visit in 2020 (*Confidence interval is not reported as the number of outcomes is larger than 1000)
| Outcome | Database | AUC (95% confidence interval) | AUPRC |
|---|---|---|---|
Hospitalization with pneumonia (COVER-H) | HIRA | 0.806 (0.762-0.851) | 0.134 |
| SIDIAP | 0.748* | 0.072 | |
| TRDW | 0.731 (0.611-0.851) | 0.132 | |
| VA | 0.689 (0.649-0.729) | 0.179 | |
Hospitalization with pneumonia requiring intensive services or death (COVER-I) | CUIMC | 0.734 (0.699-0.769) | 0.100 |
| HIRA | 0.910 (0.889-0.931) | 0.053 | |
| VA | 0.763 (0.708-0.818) | 0.058 | |
Fatality (COVER-F) | CUIMC | 0.820 (0.796-0.840) | 0.400 |
| HIRA | 0.898 (0.857-0.940) | 0.150 | |
| SIDIAP | 0.895 (0.881-0.910) | 0.083 | |
| VA | 0.717 (0.642-0.791) | 0.068 |
Fig. 2The ROC and Calibration plots for the validations (internal and external) of the 3 Cover scores
Fig. 3A graphic showing how to calculate the 3 Cover scores with a nomogram to convert the raw score into a percentage risk. There is also a distribution of scores found using internal validation to allow for comparison of a patients score to the wider population