Literature DB >> 32409818

Unmasking the Actual COVID-19 Case Count.

Samuel C Kou1, Shihao Yang2, Chia-Jung Chang3, Teck-Hua Ho4, Lisa Graver5.   

Abstract

This report presents a novel approach to estimate the total number of COVID-19 cases in the United States, including undocumented infections, by combining the Centers for Disease Control and Prevention's influenza-like illness surveillance data with aggregated prescription data. We estimated that the cumulative number of COVID-19 cases in the United States by 4 April 2020 was > 2.5 million.
© The Author(s) 2020. Published by Oxford University Press for the Infectious Diseases Society of America.

Entities:  

Keywords:  CDC’s influenza-like illness report; aggregated prescription data; influenza-like illness; total case count; undocumented infection

Mesh:

Year:  2020        PMID: 32409818      PMCID: PMC7239241          DOI: 10.1093/cid/ciaa580

Source DB:  PubMed          Journal:  Clin Infect Dis        ISSN: 1058-4838            Impact factor:   9.079


( During the COVID-19 pandemic, many infections with mild to no symptoms are not reported due to various factors, including limited testing [1, 2]. There is a critical need to estimate the true scale of the pandemic for hot-spot detection, resource allocation, and intervention planning. Existing modeling approaches use epidemiology data [2] and digital technology/data [3-5] to estimate the scale of COVID-19. In this report, we present a novel approach to estimate the total number of COVID-19 cases, including undocumented infections, in the United States (US) by comparing data from the US Centers for Disease Control and Prevention (CDC) Outpatient Influenza-like Illness Surveillance Network (ILINet), which targets all influenza-like illness (ILI), overlapping with COVID-19, against the aggregated prescription data of oseltamivir [6], which targets influenza only. Our model shows that current official numbers are severely underestimated: We estimate that by the week ending 21 March 2020, there were > 1.3 million total COVID-19 infections in the US and that by the week ending 4 April 2020, there were > 2.5 million total infections in the US.

METHODS

The CDC defines ILI as “fever and a cough and/or a sore throat without a known cause other than influenza” [7], which covers the common symptoms of COVID-19. CDC generates weekly reports on the ILI level [8] and conducts laboratorial influenza virologic surveillance. Prior to mid-February 2020, these 2 surveillance measures moved in the same direction. Since mid-February, however, the 2 measures have diverged, with the difference between ILI and laboratory-confirmed influenza activities attributable to COVID-19 [7, 8]. If we can obtain an accurate measure for influenza level, we can then use the difference between the reported ILI level and the estimated influenza level to estimate the level of new COVID-19 cases on a weekly basis. We used aggregated weekly prescription data of oseltamivir, prescribed to treat influenza A and B but not COVID-19, to estimate the influenza level. Specifically, we used a linear model to calibrate the CDC-reported ILI level to the oseltamivir prescription data from January 2010 to mid-February 2020, and then produced estimates for influenza activity for mid-February to early April 2020 (Figure 1).
Figure 1.

The estimated influenza level before and after mid-February 2020. Prior to mid-February 2020, our estimated influenza level (blue line) closely matches the Centers for Disease Control and Prevention (CDC)–reported influenza-like illness (ILI) level (black line), but significant gaps between the 2 levels (red and black lines) emerge after mid-February, which can be attributed to COVID-19. To estimate the COVID-19 weekly case counts shown in the figure, we used the ILI total counts reported in ILINet, the reported 8.5% sampling rate of ILINet, and the reported 50% ± 8% rate of persons with symptomatic ILI seeking medical care for their illness. For the reported rates, see https://www.cdc.gov/flu/about/burden/preliminary-in-season-estimates.htm and https://www.cdc.gov/flu/about/burden/how-cdc-estimates.htm.

The estimated influenza level before and after mid-February 2020. Prior to mid-February 2020, our estimated influenza level (blue line) closely matches the Centers for Disease Control and Prevention (CDC)–reported influenza-like illness (ILI) level (black line), but significant gaps between the 2 levels (red and black lines) emerge after mid-February, which can be attributed to COVID-19. To estimate the COVID-19 weekly case counts shown in the figure, we used the ILI total counts reported in ILINet, the reported 8.5% sampling rate of ILINet, and the reported 50% ± 8% rate of persons with symptomatic ILI seeking medical care for their illness. For the reported rates, see https://www.cdc.gov/flu/about/burden/preliminary-in-season-estimates.htm and https://www.cdc.gov/flu/about/burden/how-cdc-estimates.htm.

RESULTS

Our estimated influenza level (blue line) closely matches the CDC-reported ILI level (Figure 1, black line) (correlation 0.974) prior to mid-February 2020, but significant gaps between the 2 levels (Figure 1, red and black lines) emerge after mid-February, which can be attributed to COVID-19. For the week ending 21 March 2020, we estimated that 47% of the reported ILI level could be from COVID-19, which corresponds to approximately 855 000 new symptomatic cases in the US. As the official confirmed number of new cases was 17 450 for that week [9], this result shows that there were > 800 000 unreported symptomatic cases. The figure also shows that the cumulative number of COVID-19 symptomatic cases in the US by the week ending 28 March 2020 was estimated to be > 2 million and that the cumulative number of symptomatic cases in the US by the week ending 4 April 2020 was estimated to be > 2.5 million. Our results show that the official numbers are severely underestimated, a conclusion that appears to be supported by a recent large-scale screening study covering > 6% of the Icelandic population [10] and another antibody survey study in Santa Clara County, California (although the study was cautioned for its design and potential sampling bias) [11]. Our study targeted symptomatic COVID-19 cases as we used the CDC-reported percentage of patients with symptomatic illness who would seek medical care in our estimation. Therefore, if we consider the substantial presymptomatic and asymptomatic cases revealed by the Icelandic study [10], the total number of COVID-19 infections in the US is likely to be even higher than our estimates.

DISCUSSION

Our estimation method is simple and intuitive. It contrasts the CDC-reported ILI level with the estimated influenza level from influenza-specific prescription data to obtain an estimate of the COVID-19 level. Our approach innovatively combined the traditional syndromic surveillance system with big data from pharmacy prescriptions. It provides a feasible solution for estimating unreported COVID-19 cases with mild symptoms. One limitation of our model is that the estimate might become more conservative through time due to administrative/government interventions. Toward the start of April, the syndromic surveillance system ILINet got more and more affected by the changes in the healthcare system, including increased use of telemedicine, the recommendation to limit hospital visits to only severe illness, and tightened social distancing. These changes affect the total number of hospital visits, patients’ inclination to seek outpatient healthcare, and doctors’ medication prescription. Thus, our estimates in early to mid-March could be more accurate as these changes had not yet taken place, and our estimate would serve as a lower bound for the symptomatic cases of COVID-19 in later weeks. Our study indicates the feasibility to estimate COVID-19 case count using multiple data sources. This approach can be used in conjunction with approaches utilizing digital data sources for COVID-19 case estimation [12, 13]. COVID-19 presents an unprecedented challenge. Conquering it requires unprecedented levels of collaboration and data sharing across government agencies, research institutes, and the private sector.
  6 in total

1.  Accurate estimation of influenza epidemics using Google search data via ARGO.

Authors:  Shihao Yang; Mauricio Santillana; S C Kou
Journal:  Proc Natl Acad Sci U S A       Date:  2015-11-09       Impact factor: 11.205

2.  Aggregated mobility data could help fight COVID-19.

Authors:  Caroline O Buckee; Satchit Balsari; Jennifer Chan; Mercè Crosas; Francesca Dominici; Urs Gasser; Yonatan H Grad; Bryan Grenfell; M Elizabeth Halloran; Moritz U G Kraemer; Marc Lipsitch; C Jessica E Metcalf; Lauren Ancel Meyers; T Alex Perkins; Mauricio Santillana; Samuel V Scarpino; Cecile Viboud; Amy Wesolowski; Andrew Schroeder
Journal:  Science       Date:  2020-03-23       Impact factor: 47.728

3.  Share mobile and social-media data to curb COVID-19.

Authors:  Rachel A McKendry; Geraint Rees; Ingemar J Cox; Anne Johnson; Michael Edelstein; Andrew Eland; Molly M Stevens; David Heymann
Journal:  Nature       Date:  2020-04       Impact factor: 49.962

4.  Digital technology and COVID-19.

Authors:  Daniel Shu Wei Ting; Lawrence Carin; Victor Dzau; Tien Y Wong
Journal:  Nat Med       Date:  2020-04       Impact factor: 53.440

5.  Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2).

Authors:  Ruiyun Li; Sen Pei; Bin Chen; Yimeng Song; Tao Zhang; Wan Yang; Jeffrey Shaman
Journal:  Science       Date:  2020-03-16       Impact factor: 47.728

6.  Spread of SARS-CoV-2 in the Icelandic Population.

Authors:  Daniel F Gudbjartsson; Agnar Helgason; Hakon Jonsson; Olafur T Magnusson; Pall Melsted; Gudmundur L Norddahl; Jona Saemundsdottir; Asgeir Sigurdsson; Patrick Sulem; Arna B Agustsdottir; Berglind Eiriksdottir; Run Fridriksdottir; Elisabet E Gardarsdottir; Gudmundur Georgsson; Olafia S Gretarsdottir; Kjartan R Gudmundsson; Thora R Gunnarsdottir; Arnaldur Gylfason; Hilma Holm; Brynjar O Jensson; Aslaug Jonasdottir; Frosti Jonsson; Kamilla S Josefsdottir; Thordur Kristjansson; Droplaug N Magnusdottir; Louise le Roux; Gudrun Sigmundsdottir; Gardar Sveinbjornsson; Kristin E Sveinsdottir; Maney Sveinsdottir; Emil A Thorarensen; Bjarni Thorbjornsson; Arthur Löve; Gisli Masson; Ingileif Jonsdottir; Alma D Möller; Thorolfur Gudnason; Karl G Kristinsson; Unnur Thorsteinsdottir; Kari Stefansson
Journal:  N Engl J Med       Date:  2020-04-14       Impact factor: 91.245

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.