| Literature DB >> 35447417 |
Guangming Jiang1, Jiangping Wu2, Jennifer Weidhaas3, Xuan Li2, Yan Chen2, Jochen Mueller4, Jiaying Li4, Manish Kumar5, Xu Zhou6, Sudipti Arora7, Eiji Haramoto8, Samendra Sherchan9, Gorka Orive10, Unax Lertxundi10, Ryo Honda11, Masaaki Kitajima12, Greg Jackson13.
Abstract
As a cost-effective and objective population-wide surveillance tool, wastewater-based epidemiology (WBE) has been widely implemented worldwide to monitor the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) RNA concentration in wastewater. However, viral concentrations or loads in wastewater often correlate poorly with clinical case numbers. To date, there is no reliable method to back-estimate the coronavirus disease 2019 (COVID-19) case numbers from SARS-CoV-2 concentrations in wastewater. This greatly limits WBE in achieving its full potential in monitoring the unfolding pandemic. The exponentially growing SARS-CoV-2 WBE dataset, on the other hand, offers an opportunity to develop data-driven models for the estimation of COVID-19 case numbers (both incidence and prevalence) and transmission dynamics (effective reproduction rate). This study developed artificial neural network (ANN) models by innovatively expanding a conventional WBE dataset to include catchment, weather, clinical testing coverage and vaccination rate. The ANN models were trained and evaluated with a comprehensive state-wide wastewater monitoring dataset from Utah, USA during May 2020 to December 2021. In diverse sewer catchments, ANN models were found to accurately estimate the COVID-19 prevalence and incidence rates, with excellent precision for prevalence rates. Also, an ANN model was developed to estimate the effective reproduction number from both wastewater data and other pertinent factors affecting viral transmission and pandemic dynamics. The established ANN model was successfully validated for its transferability to other states or countries using the WBE dataset from Wisconsin, USA.Entities:
Keywords: Artificial neural network; COVID-19; Incidence; Prevalence; SARS-CoV-2; Wastewater-based epidemiology
Mesh:
Substances:
Year: 2022 PMID: 35447417 PMCID: PMC9006161 DOI: 10.1016/j.watres.2022.118451
Source DB: PubMed Journal: Water Res ISSN: 0043-1354 Impact factor: 13.400
Parameters of collected COVID and WBE data from the Utah state (USA), and their use in the ANN models.
| Category | Symbol | Type | ANN use | Definition | Units |
|---|---|---|---|---|---|
| Wastewater analysis | Date | Date/time | - | Wastewater sampling date | dd/mm/yyyy |
| VL | Numeric | F | SARS-CoV-2 viral load in wastewater | MGC/person/day | |
| ST | Categorical | - | Wastewater sampling technique | - | |
| Catchment | Loc | Categorical | - | The sewer or wastewater treatment plant for wastewater sampling | - |
| Pop | Numeric | F | population | person | |
| ADWF | Numeric | F | Average dry weather flow | ML/day | |
| Weather | Prain | Numeric | F | Daily precipitation at the sampling location | mm |
| Tair | Numeric | F | Average daily air temperature | oC | |
| Twater | Numeric | F | Average daily wastewater temperature | oC | |
| Clinical test | TR | Numeric | - | Ratio of population being tested clinically on the sampling date | - |
| TPR | Numeric | F | Positivity ratio of clinical tests | - | |
| Vaccination | Vcr | Numeric | F | The ratio of completed vaccination (2 injections) | - |
| Vir | Numeric | F | The ratio of initiated vaccination (1 injection) | - | |
| Case numbers | P1, P2, P3, …, P7 | Numeric | T | Daily new cases per 100, 000 population for the WWTP on the 1st, 2nd, 3rd, …, 7th day since the wastewater sampling date | Case/100,000 person |
| P3d, P7d, P14d, P3dF, P7dF, P14dF | Numeric | T | 3-day, 7-day and 14-day rolling and future average of daily new cases per 100, 000 population of the wastewater sampling date | Case/100,000 person | |
| Effective reproduction rate | Numeric | T | The COVID-19 effective reproduction rate, which represents how fast COVID is spreading in a given area by estimating the number of people that a newly infected person goes on to eventually infect. | - |
F and T indicate the data were used as features (input) and targets (response) of the ANN models, respectively. Some reserved parameters are indicated as “-”.
Two sets of test ratios and test positive ratios were collected, one for the Utah state and one for the counties. The state-level data is applied when the county-level data is not available. The clinical test ratio and positive ratio of specific wastewater catchment was determined by its overlapping with the county boundaries if possible.
ANN model structures for the evaluation of different capacities in estimating the COVID-19 epidemiological parameters.
| Group | No. of scenarios | ANN features | ANN targets |
|---|---|---|---|
| Prediction of incidence rate (ANN-IR) | 7 | Pop, ADWF, VL, Twater, Prain, Tair, TPR, Vcr, Vir | P1, P2, P3, …, P7 |
| Prediction of prevalence rate (ANN-PR) | 6 | Pop, ADWF, VL, Twater, Prain, Tair, TPR, Vcr, Vir | P3d, P7d, P14d, P3dF, P7dF, P14dF |
| Prediction of effective reproduction rate (ANN- | 1 | Pop, ADWF, VL, Twater, Prain, Tair, TPR, Vcr, Vir | |
| Contributions of inputs | 8 | All represents the complete sets of input data. Different combinations among the categories of weather (W), clinical testing (T), and vaccination (V). | Incidence, prevalence rate and |
| The eight scenarios are All, All-V, All-T, All-W, All-V-T, All-V-W, All-T-W, All-V-T-W. |
Fig. 1Correlations between all ANN input features and targets (P4 and P14d, representing incidence and prevalence rate, respectively, and Ri) in the WBE datasets obtained in Utah, USA. The numbers and circle sizes indicate the correlation coefficient; and blank cells indicate insignificant correlations by a cut-off p=0.01.
Fig. 2(A) Box plots of the correlation coefficient (R) and estimation error (MSE) of ANN models being trained to predict incidence rates, i.e., P1, P2, …, P7 (ANN-IR) and prevalence rates, i.e., P3d, P7d, P14d and P3dF, P7dF and P14dF (ANN-PR) in Utah, USA. (B) ANN outputs vs. clinical testing reported incidence rate, i.e. case numbers on the 4th day of the wastewater sampling date. (C) ANN outputs vs. the prevalence rate reported by clinical test, i.e., the 14-day running average of the wastewater sampling date.
Fig. 3Daily new cases (green circles) on the 4th day (A) and 14-day running average (B) case numbers of wastewater sampling date and ANN estimated cases (lines) for selected wastewater treatment plants with different populations, i.e., 500, 250, 100, 25, and 6 thousand people in Utah, USA.
Fig. 4The regression plot of the ANN estimated effective reproduction rate vs. the reported effective reproduction rate determined in conventional approach in Utah, USA.
The correlation coefficient (R) and estimation error (MSE) of the ANN-IR, ANN-PR and ANN-Ri models being trained with complete or partial inputs to predict incidence rate P4, prevalence rate P14d and effective production rate Ri, respectively.
| Inputs | Average | 95% CI | Average | 95% CI | |
|---|---|---|---|---|---|
| P4 | All | 0.72 | 0.002 | 525.7 | 3.02 |
| All-V | 0.68 | 0.009 | 583.3 | 13.70 | |
| All-T | 0.69 | 0.003 | 572.9 | 4.61 | |
| All-W | 0.68 | 0.008 | 587.5 | 12.48 | |
| All-V-T | 0.55 | 0.021 | 765.4 | 25.24 | |
| All-V-W | 0.62 | 0.007 | 677.4 | 9.30 | |
| All-T-W | 0.60 | 0.021 | 699.0 | 31.96 | |
| All-V-T-W | 0.44 | 0.046 | 921.6 | 136.41 | |
| P14d | All | 0.89 | 0.004 | 151.4 | 5.76 |
| All-V | 0.87 | 0.005 | 184.6 | 6.92 | |
| All-T | 0.86 | 0.011 | 199.5 | 15.77 | |
| All-W | 0.85 | 0.005 | 209.6 | 6.76 | |
| All-V-T | 0.78 | 0.012 | 290.5 | 14.40 | |
| All-V-W | 0.82 | 0.004 | 243.0 | 4.51 | |
| All-T-W | 0.76 | 0.008 | 313.1 | 9.98 | |
| All-V-T-W | 0.58 | 0.027 | 501.5 | 34.72 | |
| All | 0.85 | 0.003 | 0.0072 | 0.0001 | |
| All-V | 0.78 | 0.007 | 0.0104 | 0.0003 | |
| All-T | 0.83 | 0.016 | 0.0086 | 0.0008 | |
| All-W | 0.82 | 0.002 | 0.0090 | 0.0001 | |
| All-V-T | 0.72 | 0.025 | 0.0128 | 0.0010 | |
| All-V-W | 0.71 | 0.003 | 0.0134 | 0.0001 | |
| All-T-W | 0.78 | 0.004 | 0.0104 | 0.0002 | |
| All-V-T-W | 0.67 | 0.005 | 0.0148 | 0.0002 | |